DocumentCode
3724111
Title
Automated Feature Learning: Mining Unstructured Data for Useful Abstractions
Author
Abhishek Bafna;Jenna Wiens
Author_Institution
EECS, Univ. of Michigan, Ann Arbor, MI, USA
fYear
2015
Firstpage
703
Lastpage
708
Abstract
When the amount of training data is limited, the successful application of machine learning techniques typically hinges on the ability to identify useful features or abstractions. Expert knowledge often plays a crucial role in this feature engineering process. However, manual creation of such abstractions can be labor intensive and expensive. In this paper, we propose a feature learning framework that takes advantage of the vast amount of expert knowledge available in unstructured form on the Web. We explore the use of unsupervised learning techniques and non-Euclidean distance measures to automatically incorporate such expert knowledge when building feature representations. We demonstrate the utility of our proposed approach on the task of learning useful abstractions from a list of over two thousand patient medications. Applied to three clinically relevant patient risk stratification tasks, the classifiers built using the learned abstractions outperform several baselines including one based on a manually curated feature space.
Keywords
"Knowledge engineering","Data models","Taxonomy","Kernel","Correlation","Hospitals","Buildings"
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2015 IEEE International Conference on
ISSN
1550-4786
Type
conf
DOI
10.1109/ICDM.2015.115
Filename
7373376
Link To Document