DocumentCode
827167
Title
Data mining in protein interactomics
Author
Chen, Jake Y. ; Sivachenko, Andrey Y.
Author_Institution
Indiana Univ., Indianapolis, IN, USA
Volume
24
Issue
3
fYear
2005
Firstpage
95
Lastpage
102
Abstract
In this article, protein interactomics, an emerging field that studies the total collection of proteins and intracellular protein interactions in an organism, i.e., the study of protein interactomes is introduced. Protein interactomics is concerned with all the expressed proteins in a given tissue or cell type and how proteins physically interact with, or bind to, one another in the protein interaction network. Protein interactomes can provide information about protein functional links and protein functional context not apparent from either protein sequence analysis or protein expression analysis. By studying protein interactomics, biologists can compile biological pathway models to understand functional roles of previously uncharacterized proteins and biological processes in different developmental and environmental conditions. The paper discussed new biological discovery opportunities by presenting six specific data mining challenges in protein interactomics - data generation, data representation, data cleansing, data integration, data analysis/visualization, and knowledge curation.
Keywords
biological tissues; biology computing; cellular biophysics; data analysis; data mining; data structures; data visualisation; molecular biophysics; proteins; biological pathway models; data analysis/visualization; data cleansing; data generation; data integration; data mining; data representation; intracellular protein interactions; knowledge curation; organism; protein binding; protein expression; protein functional context; protein functional links; protein interactomics; protein sequence; tissue; Bioinformatics; Biological processes; Biological system modeling; Biology computing; Cells (biology); Data mining; Genomics; Information analysis; Protein engineering; Sequences; Algorithms; Artificial Intelligence; Computational Biology; Computer Simulation; Database Management Systems; Gene Expression Profiling; Information Storage and Retrieval; Models, Biological; Protein Interaction Mapping; Proteome; Proteomics; Software; Systems Integration; User-Computer Interface;
fLanguage
English
Journal_Title
Engineering in Medicine and Biology Magazine, IEEE
Publisher
ieee
ISSN
0739-5175
Type
jour
DOI
10.1109/MEMB.2005.1436466
Filename
1436466
Link To Document