View : 671 Download: 0

Preparations for semantics-based XML mining

Title
Preparations for semantics-based XML mining
Authors
Lee J.-W.Lee K.Kim W.
Ewha Authors
이기호이정원
SCOPUS Author ID
이기호scopus
Issue Date
2001
Journal Title
Proceedings - IEEE International Conference on Data Mining, ICDM
ISSN
1550-4786JCR Link
Citation
Proceedings - IEEE International Conference on Data Mining, ICDM, pp. 345 - 352
Indexed
SCOPUS scopus
Document Type
Conference Paper
Abstract
XML allows users to define elements using arbitrary words and organize them in a nested structure. These features of XML offer both challenges and opportunities in information retrieval, document management, and data mining. In this paper, we propose a new methodology for preparing XML documents for quantitative determination of similarity between XML documents by taking account of XML semantics (i.e., meanings of the elements and nested structures of XML documents). Accurate quantitative determination of similarity between XML documents provides an important basis for a variety of applications of XML document mining and processing. Experiments with XML documents show that our methodology provides a 50-100% improvement in determining similarity, over the traditional vector-space model that considers only term-frequency and 100% accuracy in identifying the category of each document from an on-line bookstore. © 2001 IEEE.
ISBN
0769511198

9780769511191
Appears in Collections:
인공지능대학 > 컴퓨터공학과 > Journal papers
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

BROWSE