DocumentCode :
983782
Title :
Literature extraction of protein functions using sentence pattern mining
Author :
Chiang, Jung-Hsien ; Yu, Hsu-Chun
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume :
17
Issue :
8
fYear :
2005
Firstpage :
1088
Lastpage :
1098
Abstract :
With the rapid growth of articles of genomics research, it has become a challenge for biomedical researchers to access this ever-increasing quantity of information to understand the newest discovery of functions of proteins they are studying. To facilitate functional annotation of proteins by utilizing the huge amounts of biomedical literature and transforming the knowledge into easily accessible database formats, the text mining technique thus becomes essential. In this paper, we propose the method of sentence pattern mining to extract protein functions from biomedical literature. To recognize variants of function terms correctly, we identify morphological, syntactic, and semantic variation forms. The proposed methods can be used to aid database curators in annotating protein functions and to assist biologists and medical researchers in searching protein functions from biomedical literature.
Keywords :
biology computing; computational linguistics; data mining; genetics; medical information systems; proteins; text analysis; bioinformatics; biomedical literature; genomics; knowledge acquisition; linguistic processing; protein functions; sentence pattern mining; text mining technique; Bioinformatics; Data mining; Databases; Diseases; Genomics; Knowledge acquisition; Ontologies; Organisms; Proteins; Text mining; Index Terms- Text mining; bioinformatics; knowledge acquisition; linguistic processing.;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2005.132
Filename :
1458702
Link To Document :
بازگشت