DocumentCode :
2476612
Title :
A hybrid strategy to protein name recognition
Author :
Wang, Haochang ; Zhao, Tiejun
Author_Institution :
Coll. of Comput. & Inf. Technol., Daqing Pet. Inst., Daqing
fYear :
2008
fDate :
25-27 June 2008
Firstpage :
627
Lastpage :
632
Abstract :
This paper presents a comprehensive approach to identifying protein name in biomedical texts. The new method integrated the generalized Winnow algorithm and the heuristic rules to implement of initial detection of protein name. Moreover, the system introduced a statistic method to analyses the reliability of recognized protein boundary, which can be then used for expanding protein boundary which has low confidence. The experimental results show that this algorithm improves the whole performance for protein name recognition and that effective performance can be achieved in identifying boundary of protein name.
Keywords :
bibliographic systems; data mining; feature extraction; medical information systems; proteins; statistical analysis; text analysis; MEDLINE; biomedical text; feature selection; generalized Winnow algorithm; heuristic rule; name entity recognition; protein name boundary recognition; statistical method; text mining; Amino acids; Automatic speech recognition; Automation; Biomedical computing; Educational institutions; Information technology; Intelligent control; Petroleum; Proteins; Text recognition; Generalized Winnow; boundary expansion; feature selection; name entity recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
Type :
conf
DOI :
10.1109/WCICA.2008.4592995
Filename :
4592995
Link To Document :
بازگشت