DocumentCode
3399784
Title
Semantics preserved concept based mining model
Author
Seena, M.S. ; Velayudhan, Remya
Author_Institution
Dept. of Comput. Sci., Amrita Vishwavidyapeetham, Cochin, India
fYear
2013
fDate
10-11 Oct. 2013
Firstpage
1
Lastpage
8
Abstract
Most of the text retrieval and mining methods are still based on the exact word matching and they use term frequency (word or phrases) as basic measure. It captures the importance of the term in the document but may not capture the original semantics of the term, resulting in poor retrieval performance. To overcome the lack of semantic consideration, a new framework has been introduced which relies on the concept based mining model and semantic based approach. The core part of our model is concept extraction, which perform functions such as document cleaning, parts-of-speech tagging, parsing, term and phrase extraction, feasibility analysis and relation miner. Semantic net and synonym dictionary preserve the semantic relationship in the text document. The dataset used here is ACM abstract articles collected from ACM digital library. Large sets of experiments using the proposed model were conducted and the results demonstrate the accuracy of mining model using semantics preserved concepts, feasibility analysis using singular value decomposition and semantic net representation.
Keywords
data mining; digital libraries; information retrieval; singular value decomposition; text analysis; ACM abstract articles; ACM digital library; concept extraction; document cleaning; feasibility analysis; parsing; part-of-speech tagging; phrase extraction; relation miner; retrieval performance; semantic net representation; semantic-based approach; semantics preserved concept-based mining model; singular value decomposition; synonym dictionary; term extraction; term frequency; text document; text mining method; text retrieval method; word matching; Abstracts; Dictionaries; Large scale integration; Natural language processing; Semantics; Tagging; Vectors; Root verb extraction; Semantic Net; Singular Value Decomposition; Synonym dictionary; Term and phrase extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Emerging Trends in Communication, Control, Signal Processing & Computing Applications (C2SPCA), 2013 International Conference on
Conference_Location
Bangalore
Print_ISBN
978-1-4799-1082-3
Type
conf
DOI
10.1109/C2SPCA.2013.6749408
Filename
6749408
Link To Document