DocumentCode :
3317204
Title :
Named Entity Recognition from Biomedical Text Using SVM
Author :
Ju, Zhenfei ; Jian Wang ; Zhu, Fei
Author_Institution :
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
fYear :
2011
fDate :
10-12 May 2011
Firstpage :
1
Lastpage :
4
Abstract :
Nowadays biomedical research is developing rapidly. A large number of biomedical knowledge exists in the form of unstructured text documents in various files. Named Entity Recognition (NER) from biomedical text is one of the basic task s of biomedical text mining, of which purpose is to recognize the name of the specified type from the collection of biomedical text. NER result is usually the processing object of other text mining. NER from biological text is the foundation of bioinformatics research. At present, the best f-measure of biological named entity recognition system has reached more than 80%, but is lower than general NER system which can reach about 90%. Here we use support vector machine (SVM), which is an effective and efficient tool to analyze data and recognize patterns, to recognize biomedical named entity. We get data set from GENIA corpus which is a collection of Medline abstracts. In the experiment, we get precision rate= 84.24% and recall rate=80.76% finally.
Keywords :
bioinformatics; data analysis; data mining; document handling; pattern recognition; support vector machines; GENIA corpus; Medline abstract; NER system; SVM; bioinformatics research; biomedical knowledge; biomedical research; biomedical text mining; data analysis; named entity recognition; pattern recognition; support vector machine; unstructured text document; Biological information theory; Conferences; Data mining; Hidden Markov models; Support vector machines; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering, (iCBBE) 2011 5th International Conference on
Conference_Location :
Wuhan
ISSN :
2151-7614
Print_ISBN :
978-1-4244-5088-6
Type :
conf
DOI :
10.1109/icbbe.2011.5779984
Filename :
5779984
Link To Document :
بازگشت