• DocumentCode
    3724111
  • Title

    Automated Feature Learning: Mining Unstructured Data for Useful Abstractions

  • Author

    Abhishek Bafna;Jenna Wiens

  • Author_Institution
    EECS, Univ. of Michigan, Ann Arbor, MI, USA
  • fYear
    2015
  • Firstpage
    703
  • Lastpage
    708
  • Abstract
    When the amount of training data is limited, the successful application of machine learning techniques typically hinges on the ability to identify useful features or abstractions. Expert knowledge often plays a crucial role in this feature engineering process. However, manual creation of such abstractions can be labor intensive and expensive. In this paper, we propose a feature learning framework that takes advantage of the vast amount of expert knowledge available in unstructured form on the Web. We explore the use of unsupervised learning techniques and non-Euclidean distance measures to automatically incorporate such expert knowledge when building feature representations. We demonstrate the utility of our proposed approach on the task of learning useful abstractions from a list of over two thousand patient medications. Applied to three clinically relevant patient risk stratification tasks, the classifiers built using the learned abstractions outperform several baselines including one based on a manually curated feature space.
  • Keywords
    "Knowledge engineering","Data models","Taxonomy","Kernel","Correlation","Hospitals","Buildings"
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2015 IEEE International Conference on
  • ISSN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2015.115
  • Filename
    7373376