Title :
Combining Information Extratction for Text Mining by Using Morphological Patterns and Knowledge Discovery Using Inductive Logic Programming
Author :
Christy, A. ; Thambidurai, P.
Author_Institution :
Sathyabama Univ., Tamil Nadu
Abstract :
This paper introduces concepts and a rule-based model for information extraction (IE) strategy using unsupervised algorithm and inductive learning in a top-down fashion. We have used the natural language processing techniques for identifying the morphological patterns (features) and for constructing patterns based on which the necessary information is extracted. The extracted information is then used to discover knowledge in the form of if-then rules. We have considered the technical abstracts of two different domains, by relating the information extracted from the abstract part with the information provided in the conclusion part. The information gain is found as the result of knowledge discovery and we have found our system producing an accuracy of 90%.
Keywords :
data mining; feature extraction; inductive logic programming; learning by example; mathematical morphology; natural language processing; text analysis; unsupervised learning; feature extraction; if-then rules; inductive learning; inductive logic programming; information extraction; knowledge discovery; morphological patterns; natural language processing technique; text mining; unsupervised algorithm; Abstracts; Computational intelligence; Data mining; Educational institutions; Feature extraction; Hidden Markov models; Knowledge engineering; Logic programming; Natural language processing; Text mining; Information Extraction; Information gain; Parsing; Recall; feature extraction;
Conference_Titel :
Computational Intelligence and Intelligent Informatics, 2007. ISCIII '07. International Symposium on
Conference_Location :
Agadir
Print_ISBN :
1-4244-1158-0
Electronic_ISBN :
1-4244-1158-0
DOI :
10.1109/ISCIII.2007.367362