Using Clustering Labels to Supervise Mashup Service Classification

0
83

Authors: Jianwen Xiang, Lin Li, Yang Liu

Tags: 2018, conceptual modeling

With the rapid growth of mashup resources, clustering mashup services according to the functions of the mashup services has become an effective way to improve the quality of mashup services management. Clustering is a learning task that classifies individuals or objects into different clusters based on the similarity. The purpose of clustering is to maximize the homogeneity of elements in the same cluster and maximize the heterogeneity of the elements in different clusters. It is a multivariate statistical method for classification. However, compared with the supervised classification, the clustering’s ability to categorize is much weaker. Existing methods for mashup services clustering mostly focus on utilizing key features from WSDL documents directly. In this paper, we proposed a method to improve the categorize ability of clustering. That is, applying supervised thought to cluster mashup services. First, taking basic clustering operations on the WSDL documents of mashups to obtain the clustering result for each element. Then, using the WSDL documents as training data, and the clustering results from the first step as pseudo-tags to train a classification learner. Finally, classifying mashups with this classification learner to get the final clustering results.

Read the full paper here: https://link-springer-com.proxy2.hec.ca/chapter/10.1007/978-3-030-01391-2_8