DocumentCode
394346
Title
Acoustic segmentation using switching state Kalman filter
Author
Zheng, Yanli ; Hasegawa-Johnson, Mark
Author_Institution
Dept. of Electr. & Comput. Eng., Illinois Univ., Urbana, IL, USA
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
Segmenting the acoustic signal in the TIMIT database by a switching state Kalman filter model is reported in this paper. According to the assumption that the high dimensional acoustic feature vector of the LSF (line spectrum frequency) of the speech signal is probably embedded in a low dimensional space, a two dimensional vector is used to represent the continuous state vector in this model. The parameters of the model are initialized by PPCA (probabilistic principal component analysis) and first order vector autoregression, and are re-estimated by the EM algorithm. We show that this model can be used to classify vowels, nasals, frication and silence by an approximate Viterbi inference.
Keywords
Kalman filters; feature extraction; maximum likelihood estimation; pattern classification; principal component analysis; signal representation; spectral analysis; speech recognition; EM algorithm; LSF; PPCA; TIMIT database; acoustic segmentation; acoustic signal; approximate Viterbi inference; continuous state vector representation; first order vector autoregression; frication; high dimensional acoustic feature vector; line spectrum frequency; nasals; probabilistic principal component analysis; silence; speech signal; switching state Kalman filter; vowel classification; Algorithm design and analysis; Equations; Filters; Frequency; Hidden Markov models; Inference algorithms; Piecewise linear approximation; Spatial databases; Speech recognition; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198890
Filename
1198890
Link To Document