Title :
Intelligent Agent System for Bio-medical Literature Mining
Author :
Islam, Md Tawhidul ; Bollina, Durgaprasad ; Nayak, Abhaya ; Ranganathan, Shoba
Author_Institution :
Macquarie Univ., Sydney
Abstract :
With the advances of World Wide Web technology and advanced research in bioinformatics and systems biology domain has highlighted the increasing need for automatic information extraction [IE] system to extract information from scientific literature databases. Extraction of scientific information in biomedical articles is a central task for supporting biomarker discovery efforts. In this paper, we propose an algorithm that is capable of extracting scientific information on biomarker like gene, genome, disease, allele, cell etc from the text by finding out the focal topic of the document and extracting the most relevant properties of that topic. The topic and its properties are represented as semantic networks and then stored in a database. This IE algorithm will extract the most important biological terms and relation by statistical and pattern matching NLP techniques. This IE tool expected to help the researchers to get the latest information on biomarker discovery and its other biomedical research advances. We show preliminary results, demonstrating that the method has a strong potential to biomarker discovery methods.
Keywords :
data mining; information retrieval; medical administrative data processing; multi-agent systems; nonlinear programming; semantic networks; statistical analysis; NLP techniques; World Wide Web technology; automatic information extraction; bioinformatics; biology domain; biomarker discovery efforts; biomedical literature mining; intelligent agent system; pattern matching; semantic networks; statistical matching; Bioinformatics; Biomarkers; Data mining; Databases; Diseases; Genomics; Intelligent agent; Pattern matching; Systems biology; Web sites;
Conference_Titel :
Information and Communication Technology, 2007. ICICT '07. International Conference on
Conference_Location :
Dhaka
Print_ISBN :
984-32-3394-8
DOI :
10.1109/ICICT.2007.375342