DocumentCode :
3143481
Title :
Data Mining Methods for Protein-Protein Interactions
Author :
Nafar, Zahra ; Golshani, Ashkan
Author_Institution :
Fac. of Sci., Carleton Univ., Ottawa, Ont.
fYear :
2006
fDate :
38838
Firstpage :
991
Lastpage :
994
Abstract :
In this paper, recent bioinformatics methods using data mining techniques are presented to analyze protein-protein interaction data gathered from recent large-scale biological studies. Novel approaches are suggested to tackle some of the challenges in this area. Protein-protein interaction data can provide a wealth of information to better understand the biology of a cell. The analysis of these interactions is also important for the discovery of disease-associated proteins. The data can also be used for the identification of novel cellular sites that are crucial for the development of new and improved pharmaceutical drugs. Knowledge discovery and data mining (KDD) is the process of extracting implicit information from large amounts of data using mathematical and statistical methods. It grows in synergy with computer technology, creating new analytical tools and using them for knowledge discovery in large volume of data. A multidisciplinary science and technology with links in statistics, machine learning, database systems, and computer programming and visualization, KDD has proved to be a promising solution to various problems in molecular biology, and gene analysis. An overview of various data mining techniques is presented in this paper with specific examples of their applications in protein-protein interaction data analysis. While some of the most widely used data mining techniques for exploring protein interaction data sets are clustering (including supervised and unsupervised), classification and association rule discovery, others are based on methods for mining interaction information from scientific sources such as PubMed and MedLine. There are areas such as prediction and profiling that have not been explored much for mining information in protein-protein interactions. We propose methods to employ these novel techniques to analyze protein-protein interaction data
Keywords :
biological techniques; biology computing; cellular biophysics; data mining; learning (artificial intelligence); pattern classification; pattern clustering; proteins; association rule discovery; bioinformatic method; data mining method; database system; gene analysis; knowledge discovery; machine learning; molecular biology; pharmaceutical drug; protein-protein interaction; Bioinformatics; Biology computing; Cells (biology); Data analysis; Data mining; Drugs; Large-scale systems; Pharmaceutical technology; Proteins; Statistical analysis; Bioinformatics; Data Mining; Protein-Protein Interaction; Protein-protein interaction; functional proteomics; genomics; protein interaction network; system biology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on
Conference_Location :
Ottawa, Ont.
Print_ISBN :
1-4244-0038-4
Electronic_ISBN :
1-4244-0038-4
Type :
conf
DOI :
10.1109/CCECE.2006.277746
Filename :
4055007
Link To Document :
بازگشت