DocumentCode :
3399784
Title :
Semantics preserved concept based mining model
Author :
Seena, M.S. ; Velayudhan, Remya
Author_Institution :
Dept. of Comput. Sci., Amrita Vishwavidyapeetham, Cochin, India
fYear :
2013
fDate :
10-11 Oct. 2013
Firstpage :
1
Lastpage :
8
Abstract :
Most of the text retrieval and mining methods are still based on the exact word matching and they use term frequency (word or phrases) as basic measure. It captures the importance of the term in the document but may not capture the original semantics of the term, resulting in poor retrieval performance. To overcome the lack of semantic consideration, a new framework has been introduced which relies on the concept based mining model and semantic based approach. The core part of our model is concept extraction, which perform functions such as document cleaning, parts-of-speech tagging, parsing, term and phrase extraction, feasibility analysis and relation miner. Semantic net and synonym dictionary preserve the semantic relationship in the text document. The dataset used here is ACM abstract articles collected from ACM digital library. Large sets of experiments using the proposed model were conducted and the results demonstrate the accuracy of mining model using semantics preserved concepts, feasibility analysis using singular value decomposition and semantic net representation.
Keywords :
data mining; digital libraries; information retrieval; singular value decomposition; text analysis; ACM abstract articles; ACM digital library; concept extraction; document cleaning; feasibility analysis; parsing; part-of-speech tagging; phrase extraction; relation miner; retrieval performance; semantic net representation; semantic-based approach; semantics preserved concept-based mining model; singular value decomposition; synonym dictionary; term extraction; term frequency; text document; text mining method; text retrieval method; word matching; Abstracts; Dictionaries; Large scale integration; Natural language processing; Semantics; Tagging; Vectors; Root verb extraction; Semantic Net; Singular Value Decomposition; Synonym dictionary; Term and phrase extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Trends in Communication, Control, Signal Processing & Computing Applications (C2SPCA), 2013 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4799-1082-3
Type :
conf
DOI :
10.1109/C2SPCA.2013.6749408
Filename :
6749408
Link To Document :
بازگشت