DocumentCode :
2860973
Title :
Ontology-Based Structured Cosine Similarity in Speech Document Summarization
Author :
Yuan, Soe-Tsyr ; Sun, Jerry
Author_Institution :
National Chengchi University, Taipei, Taiwan
fYear :
2004
fDate :
20-24 Sept. 2004
Firstpage :
508
Lastpage :
513
Abstract :
Development of algorithms for automated text categorization in massive text document sets is an important research area of data mining and knowledge discovery. Most of the text-clustering methods were grounded in the term-based measurement of distance or similarity, ignoring the structure of terms in documents. In this paper we present a novel method named Structured Cosine Similarity that furnishes document clustering with a new way of modeling on document summarization, considering the structure of terms in documents in order to improve the quality of speech document clustering.
Keywords :
Clustering methods; Data mining; Frequency; Large scale integration; Management information systems; Ontologies; Size measurement; Speech; Sun; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
Print_ISBN :
0-7695-2100-2
Type :
conf
DOI :
10.1109/WI.2004.10091
Filename :
1410855
Link To Document :
بازگشت