DocumentCode :
3360162
Title :
A new method for automatic pattern acquisition to extract information from biomedical texts
Author :
Huang, Minlie ; Zhu, Xiaoyan ; Li, Ming
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Volume :
3
fYear :
2004
fDate :
31 Aug.-4 Sept. 2004
Firstpage :
2214
Abstract :
Recently there have been many information extraction tasks applied to the biomedical domain, some of which contribute to extract protein-protein interactions from biomedical texts. This paper presents a new method for automatic pattern acquisition to extract protein interactions. The system automatically generates patterns by aligning sequences of tags of sentences from unlabeled corpus. To obtain a high tagging accuracy, we propose a morphology-based tagging method with a pre-tagging strategy for Brill´s tagger. Our method differs from the previous pattern acquisition algorithms in the ways: first, it does not need to provide any seed word or pattern before the algorithm runs; second, we do not apply any parsing algorithm. Lastly, our method, which is based on dynamic programming, is fast.
Keywords :
data acquisition; dynamic programming; mathematical morphology; medical computing; pattern recognition; proteins; text analysis; Brill tagger; automatic pattern acquisition; biomedical text; dynamic programming; extract protein-protein interaction; information extraction task; morphology-based tagging method; pre-tagging strategy; Biological processes; Computer science; DNA; Data mining; Dynamic programming; Laboratories; Proteins; Sequences; Tagging; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
Print_ISBN :
0-7803-8406-7
Type :
conf
DOI :
10.1109/ICOSP.2004.1442218
Filename :
1442218
Link To Document :
بازگشت