DocumentCode :
3656940
Title :
Discover trending domains using fusion of supervised machine learning with natural language processing
Author :
Shilpa Lakhanpal;Ajay Gupta;Rajeev Agrawal
Author_Institution :
Western Michigan University, Kalamazoo, MI, U.S.A
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
893
Lastpage :
900
Abstract :
In this paper, a new technique is presented for mining key domain areas from scientific publications. A domain refers to a particular branch of scientific knowledge and hence largely defines the theme of any scientific research paper. The proposed technique stems from a fusion of knowledge derived from natural language processing and machine learning. Some words or phrases are extracted based on their meaning inferred by the application of preposition disambiguation. These key words or phrases are then classified as domain areas using supervised learning. Various experiments and their analyses yield concrete results validating the efficacy and application of our methodology. The fusion technique therefore extracts an interesting aspect of research from scientific text and hence propounds a hybrid methodology for deriving meaning from underlying text. This approach thus takes a definitive step in advancing text analytics.
Keywords :
"Natural language processing","Hidden Markov models","Mathematical model","Feature extraction","Supervised learning","Computers","Data mining"
Publisher :
ieee
Conference_Titel :
Information Fusion (Fusion), 2015 18th International Conference on
Type :
conf
Filename :
7266654
Link To Document :
بازگشت