DocumentCode
3360162
Title
A new method for automatic pattern acquisition to extract information from biomedical texts
Author
Huang, Minlie ; Zhu, Xiaoyan ; Li, Ming
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Volume
3
fYear
2004
fDate
31 Aug.-4 Sept. 2004
Firstpage
2214
Abstract
Recently there have been many information extraction tasks applied to the biomedical domain, some of which contribute to extract protein-protein interactions from biomedical texts. This paper presents a new method for automatic pattern acquisition to extract protein interactions. The system automatically generates patterns by aligning sequences of tags of sentences from unlabeled corpus. To obtain a high tagging accuracy, we propose a morphology-based tagging method with a pre-tagging strategy for Brill´s tagger. Our method differs from the previous pattern acquisition algorithms in the ways: first, it does not need to provide any seed word or pattern before the algorithm runs; second, we do not apply any parsing algorithm. Lastly, our method, which is based on dynamic programming, is fast.
Keywords
data acquisition; dynamic programming; mathematical morphology; medical computing; pattern recognition; proteins; text analysis; Brill tagger; automatic pattern acquisition; biomedical text; dynamic programming; extract protein-protein interaction; information extraction task; morphology-based tagging method; pre-tagging strategy; Biological processes; Computer science; DNA; Data mining; Dynamic programming; Laboratories; Proteins; Sequences; Tagging; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
Print_ISBN
0-7803-8406-7
Type
conf
DOI
10.1109/ICOSP.2004.1442218
Filename
1442218
Link To Document