DocumentCode :
1831587
Title :
Towards a minimum description length based stopping criterion for semi-supervised time series classification
Author :
Begum, Nurjahan ; Bing Hu ; Rakthanmanon, Thanawin ; Keogh, Eamonn
fYear :
2013
fDate :
14-16 Aug. 2013
Firstpage :
333
Lastpage :
340
Abstract :
In the last decade the plunging costs of sensors/storage have made it possible to obtain vast amounts of medical telemetry. However for this data to be useful, it must be annotated. This annotation, requiring the attention of medical experts is very expensive and time consuming, and remains the critical bottleneck in medical analysis. Semi-supervised learning is an obvious way to mitigate the need for human labor, however, most such algorithms are designed for intrinsically discrete objects, and do not work well in this domain, which requires the ability to deal with real-valued objects arriving in a streaming fashion. In this work we make two contributions. First, we demonstrate that in many cases just a handful of human annotated examples are sufficient to perform accurate classification. Second, we devise a novel parameter-free stopping criterion for semi-supervised learning. We evaluate our work with a comprehensive set of experiments on diverse medical data sources including electrocardiograms. Our experimental results show that our approach can construct accurate classifiers even if given only a single annotated instance.
Keywords :
biomedical telemetry; learning (artificial intelligence); medical information systems; pattern classification; time series; electrocardiograms; human annotated examples; human labor; intrinsically discrete objects; medical analysis; medical data sources; medical experts; medical telemetry; minimum description length based stopping criterion; parameter-free stopping criterion; semisupervised learning; semisupervised time series classification; single annotated instance; streaming fashion; Classification algorithms; Encoding; Euclidean distance; Semisupervised learning; Supervised learning; Time series analysis; Training; MDL; Semi-Supervised Learning; Time Series;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration (IRI), 2013 IEEE 14th International Conference on
Conference_Location :
San Francisco, CA
Type :
conf
DOI :
10.1109/IRI.2013.6642490
Filename :
6642490
Link To Document :
بازگشت