DocumentCode :
234375
Title :
A comparative study of biomedical named entity recognition methods based machine learning approach
Author :
Rais, Mohammed ; Lachkar, Abdelhamid ; Lachkar, Abdelhamid ; El Alaoui Ouatik, Said
Author_Institution :
LSIS Lab., USMBA, Fez, Morocco
fYear :
2014
fDate :
20-22 Oct. 2014
Firstpage :
329
Lastpage :
334
Abstract :
Recognizing Biomedical Named Entities (BioNEs) such as genes, proteins, cells, drugs, diseases, etc. play a vital role in many Biomedical Text Mining applications. BioNER fall into five approaches: Dictionary-Based, Rule-Based, Machine-Learning-Based, Statistical-Based, and Hybrid-Based. Methods Based Machine Learning approach, are more effective than those of other approaches, and therefore have been widely used for learning to recognize BioNEs. In this paper, we present a comparative theoretical and experimental study between seven Machine Learning methods, by summarizing their advantages and weaknesses, and comparing their performance on two standard biomedical Corpora (GENIA and JNLPBA). The obtained results show that CRF outperforms all the other Machine-Learning methods on both corpora. That method (CRF) will be integrated in our future works.
Keywords :
data mining; learning (artificial intelligence); medical computing; statistics; BioNER; CRF method; GENIA biomedical corpora; JNLPBA biomedical corpora; biomedical named entity recognition methods; biomedical text mining applications; conditional random field method; dictionary-based approach; hybrid-based approach; machine learning approach; rule-based approach; statistical-based approach; Decision support systems; Entropy; Hidden Markov models; Markov processes; Niobium; Proteins; Support vector machines; BioNER; BioNEs; CRFs; DT; HMM; ME; MEMM; Machine Learning; NB; SVM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in
Conference_Location :
Tetouan
Print_ISBN :
978-1-4799-5978-5
Type :
conf
DOI :
10.1109/CIST.2014.7016641
Filename :
7016641
Link To Document :
بازگشت