• DocumentCode
    234375
  • Title

    A comparative study of biomedical named entity recognition methods based machine learning approach

  • Author

    Rais, Mohammed ; Lachkar, Abdelhamid ; Lachkar, Abdelhamid ; El Alaoui Ouatik, Said

  • Author_Institution
    LSIS Lab., USMBA, Fez, Morocco
  • fYear
    2014
  • fDate
    20-22 Oct. 2014
  • Firstpage
    329
  • Lastpage
    334
  • Abstract
    Recognizing Biomedical Named Entities (BioNEs) such as genes, proteins, cells, drugs, diseases, etc. play a vital role in many Biomedical Text Mining applications. BioNER fall into five approaches: Dictionary-Based, Rule-Based, Machine-Learning-Based, Statistical-Based, and Hybrid-Based. Methods Based Machine Learning approach, are more effective than those of other approaches, and therefore have been widely used for learning to recognize BioNEs. In this paper, we present a comparative theoretical and experimental study between seven Machine Learning methods, by summarizing their advantages and weaknesses, and comparing their performance on two standard biomedical Corpora (GENIA and JNLPBA). The obtained results show that CRF outperforms all the other Machine-Learning methods on both corpora. That method (CRF) will be integrated in our future works.
  • Keywords
    data mining; learning (artificial intelligence); medical computing; statistics; BioNER; CRF method; GENIA biomedical corpora; JNLPBA biomedical corpora; biomedical named entity recognition methods; biomedical text mining applications; conditional random field method; dictionary-based approach; hybrid-based approach; machine learning approach; rule-based approach; statistical-based approach; Decision support systems; Entropy; Hidden Markov models; Markov processes; Niobium; Proteins; Support vector machines; BioNER; BioNEs; CRFs; DT; HMM; ME; MEMM; Machine Learning; NB; SVM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in
  • Conference_Location
    Tetouan
  • Print_ISBN
    978-1-4799-5978-5
  • Type

    conf

  • DOI
    10.1109/CIST.2014.7016641
  • Filename
    7016641