Title :
Probabilistic Sequence Translation-Alignment Model for Time-Series Classification
Author_Institution :
Seoul Nat. Univ. of Sci. & Technol., Seoul, South Korea
Abstract :
We tackle the time-series classification problem using a novel probabilistic model that represents the conditional densities of the observed sequences being time-warped and transformed from an underlying base sequence. We call it probabilistic sequence translation-alignment model (PSTAM) since it aims to capture both feature alignment and mapping between sequences, analogous to translating one language into another in the field of machine translation. To deal with general time-series, we impose the time-monotonicity constraints on the hidden alignment variables in the model parameter space, where by marginalizing them out it allows effective learning of class-specific time-warping and feature transformation simultaneously. Our PSTAM, thus, naturally enjoys the advantages from two typical approaches widely used in sequence classification: 1) benefits from the alignment-based methods that aim to estimate distance measures between non-equal-length sequences via direct comparison of aligned features, and 2) merits of the model-based approaches that can effectively capture the class-specific patterns or trends. Furthermore, the low-dimensional modeling of the latent base sequence naturally provides a way to discover the intrinsic manifold structure possibly retained in the observed data, leading to an unsupervised manifold learning for sequence data. The benefits of the proposed approach are demonstrated on a comprehensive set of evaluations with both synthetic and real-world sequence data sets.
Keywords :
language translation; learning (artificial intelligence); pattern classification; probability; time series; PSTAM; alignment variables; alignment-based methods; class-specific patterns; class-specific time-warping; conditional densities; distance measures; feature alignment; feature mapping; feature transformation; intrinsic manifold structure; low-dimensional modeling; machine translation; model parameter space; nonequal-length sequences; probabilistic model; probabilistic sequence translation-alignment model; sequence classification; sequence data; time-monotonicity constraints; time-series classification problem; unsupervised manifold learning; Computational modeling; Data models; Hidden Markov models; Manifolds; Probabilistic logic; Training; Training data; Time-series classification; probabilistic models; sequence alignment;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2013.8