Title :
Identifying named entities in biomedical text based on stacked generalization
Author :
Wang, Haochang ; Zhao, Tiejun
Author_Institution :
Coll. of Comput. & Inf. Technol., Daqing Pet. Inst., Daqing
Abstract :
Biomedical named entity recognition is a basic technique in the biomedical knowledge discovery and its performance has direct effects on further discovery and processing in biomedical texts. In this paper, we present stacked generalization strategy for biomedical named entity recognition including homogeneous classifier ensembles and heterogeneous classifier ensembles based on stacked generalization. Evaluations show that stacked generalization strategy can take advantage of more useful evidences, and make use of compensation and relativity among different classifiers to learn the correlation between individual classifiers predictions and the correct prediction to improve the performances of the system. This method breaks through the limitation of single classifier and achieves promising performances.
Keywords :
data mining; medical information systems; biomedical knowledge discovery; biomedical named entity recognition; classifier ensemble; stacked generalization strategy; Automation; Biomedical computing; Data mining; Educational institutions; Information technology; Intelligent control; Machine learning; Natural language processing; Petroleum; Text recognition; biomedical named entity recognition; classifiers ensemble; stacked generalization;
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
DOI :
10.1109/WCICA.2008.4592917