DocumentCode :
2734131
Title :
Deploying Top-k Specific Patterns for Relevance Feature Discovery
Author :
Pipanmaekaporn, Luepol ; Li, Yuefeng ; Geva, Shlomo
Author_Institution :
Fac. of Sci. & Technol., Queensland Univ. of Technol., Brisbane, QLD, Australia
Volume :
3
fYear :
2010
fDate :
Aug. 31 2010-Sept. 3 2010
Firstpage :
318
Lastpage :
321
Abstract :
The quality of discovered patterns is important for relevance feature discovery in text documents because frequent pattern mining often produces many noisy patterns. In this paper, we propose a novel method for the summarization of discovered patterns in text documents, finding a smaller number of specific patterns for representing the large number of discovered patterns for a given topic. We also evaluate the proposed method by implementing a new pattern-based information filtering model. The experimental results show that the proposed method not only outperforms both the term-based approaches and pattern based approaches, but largely reduces the number of feature terms as well.
Keywords :
data mining; information filtering; text analysis; discovered pattern summarization; frequent pattern mining; pattern based approaches; pattern-based information filtering model; relevance feature discovery; term-based approaches; text documents; top-k specific pattern quality; Conferences; Feature extraction; Frequency measurement; Ontologies; Taxonomy; Text mining; Training; Information Filtering; Pattern Mining; Relevance Feature Discovery; Text Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on
Conference_Location :
Toronto, ON
Print_ISBN :
978-1-4244-8482-9
Electronic_ISBN :
978-0-7695-4191-4
Type :
conf
DOI :
10.1109/WI-IAT.2010.194
Filename :
5614182
Link To Document :
بازگشت