• DocumentCode
    2933073
  • Title

    A simple yet effective data integration approach to tree-based microarray data classification

  • Author

    Liu, Lin ; Li, Yi ; Liu, Bing ; Li, Jiuyong

  • Author_Institution
    Sch. of Comput. & Inf. Sci., Univ. of South Australia, Mawson Lakes, SA, Australia
  • fYear
    2010
  • fDate
    Aug. 31 2010-Sept. 4 2010
  • Firstpage
    1503
  • Lastpage
    1506
  • Abstract
    Different biological labs conduct similar experiments on same diseases. It is highly desirable to have a better model based on more experimental results than that on a single result. In this paper, we propose a method for integrating microarray data from multiple sources for building classification models. To test the method, we use three real world microarray data sets from different labs with different experimental devices and environments. Although microarray data is well known for its inconsistencies across labs, we demonstrate that it is possible to build consistent models using data sets from multiple labs. We report our method, experimental results and observations in the paper.
  • Keywords
    bioinformatics; cellular biophysics; data analysis; decision trees; genetics; genomics; pattern classification; classification models; data integration; decision tree; gene expression levels; microarray data; random forest; tree-based classification models; Accuracy; Cancer; Classification algorithms; Classification tree analysis; Data models; Joints; Radio frequency; Algorithms; Humans; Information Storage and Retrieval; Lung Neoplasms; Neoplasm Proteins; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Systems Integration; Tumor Markers, Biological;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society (EMBC), 2010 Annual International Conference of the IEEE
  • Conference_Location
    Buenos Aires
  • ISSN
    1557-170X
  • Print_ISBN
    978-1-4244-4123-5
  • Type

    conf

  • DOI
    10.1109/IEMBS.2010.5626842
  • Filename
    5626842