DocumentCode :
726915
Title :
Analyzing the Semantic Relatedness of Paper Abstracts: An Application to the Educational Research Field
Author :
Paraschiv, Ionut Cristian ; Dascalu, Mihai ; Trausan-Matu, Stefan ; Dessus, Philippe
Author_Institution :
Comput. Sci. Dept., Univ. Politeh. of Bucharest, Bucharest, Romania
fYear :
2015
fDate :
27-29 May 2015
Firstpage :
759
Lastpage :
764
Abstract :
Each domain, along with its knowledge base, changes over time and every timeframe is centered on specific topics that emerge from different ongoing research projects. As searching for relevant resources is a time-consuming process, the automatic extraction of the most important and relevant articles from a domain becomes essential in supporting researchers in their day-to-day activities. The proposed analysis extends other previous researches focused on extracting co-citations between the papers, with the purpose of comparing their overall importance within the domain from a semantic perspective. Our method focuses on the semantic analysis of paper abstracts by using Natural Language Processing (NLP) techniques such as Latent Semantic Analysis, Latent Dirichlet Allocation or specific ontology distances, i.e., Word Net. Moreover, the defined mechanisms are enforced on two different sub domains from the corpora generated around the keywords "e-learning" and "computer". Graph visual representations are used to highlight the keywords of each sub domain, links among concepts and between articles, as well as specific document similarity views, or scores reflecting the keyword-abstract overlaps. In the end, conclusions and future improvements are presented, emphasizing nevertheless the key elements of our research support framework.
Keywords :
citation analysis; computer aided instruction; document handling; knowledge based systems; natural language processing; ontologies (artificial intelligence); NLP techniques; cocitation extraction; computer; document similarity views; e-learning; educational research field; graph visual representations; keyword-abstract overlaps; knowledge base; latent dirichlet allocation; latent semantic analysis; natural language processing; ontology distances; paper abstracts; semantic relatedness analysis; Computational modeling; Computer science; Computers; Electronic learning; Semantics; Visualization; discourse analysis; extraction of domain key concepts; scientometrics; semantic similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Systems and Computer Science (CSCS), 2015 20th International Conference on
Conference_Location :
Bucharest
Print_ISBN :
978-1-4799-1779-2
Type :
conf
DOI :
10.1109/CSCS.2015.146
Filename :
7168510
Link To Document :
بازگشت