• DocumentCode
    2834323
  • Title

    A Probability Based Approach for Processing Dimension Missing Data

  • Author

    Cheng, Yu ; Zhang, Tao

  • Author_Institution
    Dept. of Biomed. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2009
  • fDate
    19-20 Dec. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Processing missing value is one of the most important task in data mining. A great many applications, such as social commercial record, biological systems and remote sensing network, in which not only data values from particular features but even data dimension information may also be missing. Such missing values are known as dimension missing values-standard operation over these data may result in unrepresentable or uncertain problems. To tackle this problem of dealing with dimension missing data, in this paper, we first propose a probabilistic model to managing such data. Then, instead of enumerating all possible cases to recover the missed dimensions, we develop an effective and efficient bound confidence approach to speed up the retrieval process. A concrete evaluation using real data sets is reported, which shows that our method is effective and efficient on dimension incomplete data.
  • Keywords
    data mining; information retrieval; probability; bound confidence approach; data mining; dimension missing data processing; missing value processing; probability based approach; retrieval process; Automation; Biological systems; Biomedical engineering; Cleaning; Concrete; Data analysis; Data mining; Information retrieval; Remote sensing; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4994-1
  • Type

    conf

  • DOI
    10.1109/ICIECS.2009.5364328
  • Filename
    5364328