Title :
Acoustic segmentation using switching state Kalman filter
Author :
Zheng, Yanli ; Hasegawa-Johnson, Mark
Author_Institution :
Dept. of Electr. & Comput. Eng., Illinois Univ., Urbana, IL, USA
Abstract :
Segmenting the acoustic signal in the TIMIT database by a switching state Kalman filter model is reported in this paper. According to the assumption that the high dimensional acoustic feature vector of the LSF (line spectrum frequency) of the speech signal is probably embedded in a low dimensional space, a two dimensional vector is used to represent the continuous state vector in this model. The parameters of the model are initialized by PPCA (probabilistic principal component analysis) and first order vector autoregression, and are re-estimated by the EM algorithm. We show that this model can be used to classify vowels, nasals, frication and silence by an approximate Viterbi inference.
Keywords :
Kalman filters; feature extraction; maximum likelihood estimation; pattern classification; principal component analysis; signal representation; spectral analysis; speech recognition; EM algorithm; LSF; PPCA; TIMIT database; acoustic segmentation; acoustic signal; approximate Viterbi inference; continuous state vector representation; first order vector autoregression; frication; high dimensional acoustic feature vector; line spectrum frequency; nasals; probabilistic principal component analysis; silence; speech signal; switching state Kalman filter; vowel classification; Algorithm design and analysis; Equations; Filters; Frequency; Hidden Markov models; Inference algorithms; Piecewise linear approximation; Spatial databases; Speech recognition; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198890