A New XML Clustering for Structural Retrieval

0
59

Authors: Jeong Hee Hwang, Keun Ho Ryu

Tags: 2004, conceptual modeling

XML becomes increasingly important in data exchange and information management. Starting point for retrieving the information and integrating the documents efficiently is clustering the documents that have similar structure. Thus, in this paper, we propose a new XML document clustering method based on similar structure. Our approach first extracts the representative structures of XML documents by sequential pattern mining. And then we cluster XML documents of similar structure using the clustering algorithm for transactional data, assuming that an XML document as a transaction and the frequent structure of documents as the items of the transaction. We also apply our technique to XML retrieval. Our experiments show the efficiency and good performance of the proposed clustering method.

Read the full paper here: https://link.springer.com/chapter/10.1007/978-3-540-30464-7_30