• DocumentCode
    394346
  • Title

    Acoustic segmentation using switching state Kalman filter

  • Author

    Zheng, Yanli ; Hasegawa-Johnson, Mark

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Illinois Univ., Urbana, IL, USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    Segmenting the acoustic signal in the TIMIT database by a switching state Kalman filter model is reported in this paper. According to the assumption that the high dimensional acoustic feature vector of the LSF (line spectrum frequency) of the speech signal is probably embedded in a low dimensional space, a two dimensional vector is used to represent the continuous state vector in this model. The parameters of the model are initialized by PPCA (probabilistic principal component analysis) and first order vector autoregression, and are re-estimated by the EM algorithm. We show that this model can be used to classify vowels, nasals, frication and silence by an approximate Viterbi inference.
  • Keywords
    Kalman filters; feature extraction; maximum likelihood estimation; pattern classification; principal component analysis; signal representation; spectral analysis; speech recognition; EM algorithm; LSF; PPCA; TIMIT database; acoustic segmentation; acoustic signal; approximate Viterbi inference; continuous state vector representation; first order vector autoregression; frication; high dimensional acoustic feature vector; line spectrum frequency; nasals; probabilistic principal component analysis; silence; speech signal; switching state Kalman filter; vowel classification; Algorithm design and analysis; Equations; Filters; Frequency; Hidden Markov models; Inference algorithms; Piecewise linear approximation; Spatial databases; Speech recognition; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198890
  • Filename
    1198890