DocumentCode
249094
Title
Effective pattern discovery by cleaning patterns with pattern co-occurrence matrix and PDCS deploying approach
Author
Gangarde, Rupali ; Kolhe, V.L.
Author_Institution
D.Y. Patil Coll. of Eng., Akurdi Univ. of Pune, Pune, India
fYear
2014
fDate
19-20 Aug. 2014
Firstpage
119
Lastpage
124
Abstract
Text mining is a discovery of interesting knowledge in text documents. Exact and accurate knowledge in the text documents needed for the user to find what they require. Many data mining methods are used to mine useful patterns from text documents. However, using and updating these discovered patterns is still an open research issue. Many term based methods are suggested, but a disadvantage with these methods is that they suffer from the problem of synonymy and polysemy. To overcome these disadvantages pattern mining methods are recommended. Pattern mining methods are not proven to be better than term based methods because of low frequency and pattern misinterpretation problem. Here an effective pattern discovery technique is given which applies a pattern co-occurrence matrix to clean close sequential patterns. Process of pattern deploying is applied with the co-occurrence weight and absolute support (PDCS) as deploying approach to overcome pattern misinterpretation problems and pattern evolving to overcome low frequency problem. It also applies a pattern co-occurrence matrix to clean close sequential patterns. This improves performance by using and updating discovered patterns and finding interesting and relevant information.
Keywords
data mining; pattern classification; text analysis; PDCS deploying approach; data mining methods; knowledge discovery; pattern cooccurrence matrix; pattern discovery technique; pattern mining methods; pattern misinterpretation problems; text documents; text mining; Carbon dioxide; Data mining; Equations; Noise measurement; Phase change materials; Taxonomy; Training; Information Retrieval; Pattern Deploying; Pattern Evolving;
fLanguage
English
Publisher
ieee
Conference_Titel
Networks & Soft Computing (ICNSC), 2014 First International Conference on
Conference_Location
Guntur
Print_ISBN
978-1-4799-3485-0
Type
conf
DOI
10.1109/CNSC.2014.6906647
Filename
6906647
Link To Document