Title :
Evaluation of named entity recognition tools on microposts
Author :
Dlugolinsky, Stefan ; Ciglan, Marek ; Laclavik, Michal
Author_Institution :
Inst. of Inf., Bratislava, Slovakia
Abstract :
In this paper we evaluate eight well-known Information Extraction (IE) tools on a task of Named Entity Recognition (NER) in microposts. We have chosen six NLP tools and two Wikipedia concept extractors for the evaluation. Our intent was to see how these tools would perform on relatively short texts of microposts. Evaluation dataset has been adopted from the MSM 2013 IE Challenge. This dataset contained manually annotated microposts with classification restricted to four entity types: PER, LOC, ORG and MISC.
Keywords :
Web sites; natural language processing; text analysis; IE tools; LOC; MISC; MSM 2013 IE Challenge; NER; NLP tools; ORG; PER; Wikipedia concept extractors; evaluation dataset; information extraction tools; manually annotated microposts; named entity recognition tools evaluation; Electronic publishing; Encyclopedias; Feature extraction; Internet; Logic gates; Organizations;
Conference_Titel :
Intelligent Engineering Systems (INES), 2013 IEEE 17th International Conference on
Conference_Location :
San Jose
Print_ISBN :
978-1-4799-0828-8
DOI :
10.1109/INES.2013.6632810