• DocumentCode
    3582138
  • Title

    An efficient technique for missing value imputation in microarray gene expression data

  • Author

    Valarmathie, P. ; Dinakaran, K.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Saveetha Eng. Coll., Thandalam, India
  • fYear
    2014
  • Firstpage
    73
  • Lastpage
    80
  • Abstract
    In recent years, rapid developments in genomics and proteomics have generated a large amount of biological data. Dealing with such huge data has become extremely challenging with traditional data analysis techniques. Bioinformatics, aka computational biology, is the interdisciplinary science of interpreting biological data that use the tools and techniques of information technology and computer science. However, gene expressions generated by the high-throughput microarray experiments often contain missing values, which significantly affect the performance of subsequent statistical analysis and clustering algorithms. So there is a great need for estimating or imputing these missing values as accurately as possible. In general the missing values could be imputed by means of various methods namely ignoring the tuple, using the attribute mean to fill the missing value, using a global constant to fill in the missing value. In this paper a new approach called JAD (Java Application Development) imputation is proposed for missing values that can be estimated more accurately. The results show that our method JAD imputation provides a better solution to completing the microarray gene expression human serum data.
  • Keywords
    Java; bioinformatics; data analysis; data mining; genomics; pattern clustering; proteomics; statistical analysis; JAD imputation; Java Application Development imputation; bioinformatics; biological data; clustering algorithms; computational biology; data analysis techniques; genomics; high-throughput microarray experiments; microarray gene expression human serum data; missing value imputation; proteomics; statistical analysis; Bioinformatics; Computers; Conferences; Data mining; Gene expression; Genomics; Data Mining; Gene Expression Data; JAD Imputation; Missing Values;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communication and Systems, 2014 International Conference on
  • Print_ISBN
    978-1-4799-3671-7
  • Type

    conf

  • DOI
    10.1109/ICCCS.2014.7068171
  • Filename
    7068171