Title :
Similarity-based searching in multi-parameter time series databases
Author :
Lehman, L.H. ; Saeed, M. ; Moody, G.B. ; Mark, R.G.
Author_Institution :
Div. of Health Sci. & Technol., Harvard-MIT, Cambridge, MA
Abstract :
We present a similarity-based searching and pattern matching algorithm that identifies time series data with similar temporal dynamics in large-scale, multi-parameter databases. We represent time series segments by feature vectors that reflect the dynamical patterns of single and multi-dimensional physiological time series. Features include regression slopes at varying time scales, maximum transient changes, auto-correlation coefficients of individual signals, and cross correlations among multiple signals. We model the dynamical patterns with a Gaussian mixture model (GMM) learned with the Expectation Maximization algorithm, and compute similarity between segments as Mahalanobis distances. We evaluate the use of our algorithm in three applications: search-by-example based data retrieval, event classification, and forecasting, using synthetic and real physiologic time series from a variety of sources.
Keywords :
cardiology; expectation-maximisation algorithm; medical signal processing; pattern matching; regression analysis; search problems; time series; Gaussian mixture model; Mahalanobis distances; autocorrelation coefficients; event classification; expectation maximization algorithm; forecasting; maximum transient changes; multiparameter time series database; pattern matching algorithm; regression slopes; search-by-example based data retrieval; similarity-based searching; temporal dynamics; Autocorrelation; Biomedical monitoring; Clustering algorithms; Diseases; Event detection; Information retrieval; Large-scale systems; Pattern matching; Roentgenium; Spatial databases;
Conference_Titel :
Computers in Cardiology, 2008
Conference_Location :
Bologna
Print_ISBN :
978-1-4244-3706-1
DOI :
10.1109/CIC.2008.4749126