Title :
A comparative study of biomedical named entity recognition methods based machine learning approach
Author :
Rais, Mohammed ; Lachkar, Abdelhamid ; Lachkar, Abdelhamid ; El Alaoui Ouatik, Said
Author_Institution :
LSIS Lab., USMBA, Fez, Morocco
Abstract :
Recognizing Biomedical Named Entities (BioNEs) such as genes, proteins, cells, drugs, diseases, etc. play a vital role in many Biomedical Text Mining applications. BioNER fall into five approaches: Dictionary-Based, Rule-Based, Machine-Learning-Based, Statistical-Based, and Hybrid-Based. Methods Based Machine Learning approach, are more effective than those of other approaches, and therefore have been widely used for learning to recognize BioNEs. In this paper, we present a comparative theoretical and experimental study between seven Machine Learning methods, by summarizing their advantages and weaknesses, and comparing their performance on two standard biomedical Corpora (GENIA and JNLPBA). The obtained results show that CRF outperforms all the other Machine-Learning methods on both corpora. That method (CRF) will be integrated in our future works.
Keywords :
data mining; learning (artificial intelligence); medical computing; statistics; BioNER; CRF method; GENIA biomedical corpora; JNLPBA biomedical corpora; biomedical named entity recognition methods; biomedical text mining applications; conditional random field method; dictionary-based approach; hybrid-based approach; machine learning approach; rule-based approach; statistical-based approach; Decision support systems; Entropy; Hidden Markov models; Markov processes; Niobium; Proteins; Support vector machines; BioNER; BioNEs; CRFs; DT; HMM; ME; MEMM; Machine Learning; NB; SVM;
Conference_Titel :
Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in
Conference_Location :
Tetouan
Print_ISBN :
978-1-4799-5978-5
DOI :
10.1109/CIST.2014.7016641