DocumentCode :
2922955
Title :
Data mining and its applications in bioinformatics: Techniques and methods
Author :
Hu, Xiaohua
Author_Institution :
Coll. of Inf. Sci. & Technol., Drexel Univ., Philadelphia, PA, USA
fYear :
2011
fDate :
8-10 Nov. 2011
Firstpage :
3
Lastpage :
3
Abstract :
In this talk, I will discuss some of the latest data mining techniques and methods and their applications in bioinformatics study, focusing on data integration, text mining and graph-based data mining in bioinformatics research. In data integration, I will present a semantic-based approach for multi source bioinformatics data integration. In our approach, a metamodel is utilized to represent the master search schema, and an effective interface extraction algorithm based on the hierarchical structure of the web and pattern is developed to capture the rich semantic relationships of the online bioinformatics data sources. Our final goal is to develop a meta-search interface for biologists as a single point of access to multiple online bioinformatics databases. In text mining, some of the challenging issues in mining and searching the biomedical literature are addressed, and I will present a unified architecture Bio-SET-DM (Biomedical Literature Searching, Extraction and Text Data Mining), discuss some novel algorithms such as semantic-based language model for literature retrieval, semi-supervised pattern learning for information extraction of biological relationships from biomedical literature. In the third part, graph-based data mining, the focus is on graph-based mining in biological networks. I will discuss how to apply graph-based mining techniques and algorithms in the analysis of modular and hierarchical structure of biological networks, how to identify and evaluate the subnetworks from complicated biological networks, and present the experimental results. To put these pieces together, a unified framework is introduced to integrate the three parts (data integration, text mining and graph-based data mining) in the bioinformatics data mining procedure.
Keywords :
Internet; bioinformatics; data mining; graph theory; information retrieval; learning (artificial intelligence); literature; pattern clustering; text analysis; user interfaces; Bio-SET-DM; bioinformatic research; biological networks; biomedical literature searching; graph based data mining; hierarchical structure; information extraction; interface extraction algorithm; literature retrieval; master search schema; metamodel; metasearch interface; modular structure; multisource bioinformatic data integration; online bioinformatic data sources; online bioinformatic databases; semantic based language model; semantic relationships; semisupervised pattern learning; text data mining; Awards activities; Bioinformatics; Biology; Conferences; Data mining; Educational institutions; Intelligent systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing (GrC), 2011 IEEE International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-4577-0372-0
Type :
conf
DOI :
10.1109/GRC.2011.6122559
Filename :
6122559
Link To Document :
بازگشت