• DocumentCode
    498850
  • Title

    A new robust LPCCS for speech recogniton in channel distortion environments

  • Author

    He, Yongjun ; Han, Jiqing

  • Author_Institution
    Speech Process. Lab., Harbin Inst. of Technol., Harbin, China
  • Volume
    4
  • fYear
    2009
  • fDate
    12-15 July 2009
  • Firstpage
    2358
  • Lastpage
    2362
  • Abstract
    Linear prediction (LP) analysis plays an important role in speech processing. linear prediction cepstral coefficients (LPCCs) are proved to be better features for speech recognition. In LP analysis, the generation of speech is modeled with an autoregressive (AR) random process. In fact, speech from vocal cord goes through vocal tract first, and then goes through channel. However, general LP analysis does not consider the effect of channel and the LPCCs show little robustness in speech recognition under channel distortion. In this paper, distorted speech is modeled by two different AR models which denote vocal tract and channel, respectively. After the poles are computed, K-means is used to cluster all poles and a new method is proposed to choose Channel poles. For each frame of test utterances, instead of designing a filter using channel poles, robust LPCCs are computed directly by removing channel poles from all poles. Experiment shows that this approach can obtain a better performance than general methods.
  • Keywords
    autoregressive processes; filtering theory; speech recognition; K-means method; autoregressive random process; channel distortion environments; channel poles; filter design; linear prediction analysis; linear prediction cepstral Coefficients; speech processing; speech recognition; Cepstral analysis; Cybernetics; Machine learning; Nonlinear distortion; Predictive models; Robustness; Speech analysis; Speech processing; Speech recognition; Testing; Channel distortion; Feature extraction; Linear prediction; Speech recognition; Tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2009 International Conference on
  • Conference_Location
    Baoding
  • Print_ISBN
    978-1-4244-3702-3
  • Electronic_ISBN
    978-1-4244-3703-0
  • Type

    conf

  • DOI
    10.1109/ICMLC.2009.5212203
  • Filename
    5212203