Title :
Named entity linking in microblog posts using graph-based centrality scoring
Author :
Kalloubi, Fahd ; Nfaoui, El Habib ; El Beqqali, Omar
Author_Institution :
Sidi Mohammed Ben Abdellah Univ., Fez, Morocco
Abstract :
Microblogging platforms have emerged as large collections of short documents, which have been growing in term of size and volume. In fact, providing an effective way to add semantics to this form of communication present a significant research challenge because his noisy and inconsistency nature. Named Entity Linking (NEL) is a subtask of information extraction that aims to ground entity mentions to their corresponding node in a Knowledge Base (KB), which requires a disambiguation step, because many resources can be matched to the same entity that lead to synonymy and polysemy problems. To overcome such problems especially in the context of short text, we present a robust system for automatically extract named entities, disambiguating and linking them to knowledge base resources, based on graph centrality algorithm and Linked Open Data (LOD) paradigm. Also, we evaluate our system using a real Twitter dataset [1] and comparing it with a public tool to show his effectiveness.
Keywords :
graph theory; knowledge based systems; social networking (online); text analysis; KB; LOD paradigm; Twitter dataset; automatic named entity extraction; graph centrality algorithm; graph-based centrality scoring; information extraction; knowledge base resources; linked open data paradigm; microblog posts; microblogging platforms; named entity disambiguation; named-entity linking; polysemy problem; public tool; semantic level; short-document collections; short-text; synonymy problem; Educational institutions; Ontologies; Centrality Algorithm; DBpedia; Linked Open Data; Named Entity Linking; Named Entity Recognition; Natural Language Processing; Semantic Web; Text Annotation;
Conference_Titel :
Intelligent Systems: Theories and Applications (SITA-14), 2014 9th International Conference on
Conference_Location :
Rabat
Print_ISBN :
978-1-4799-3566-6
DOI :
10.1109/SITA.2014.6847286