• DocumentCode
    2857842
  • Title

    Acoustic Feature Optimization for Emotion Affected Speech Recognition

  • Author

    Sun, Yanqing ; Zhou, Yu ; Zhao, Qingwei ; Yan, Yonghong

  • Author_Institution
    ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
  • fYear
    2009
  • fDate
    19-20 Dec. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper tries to deal with the problem of performance degradation in emotion affected speech recognition. The F-ratio analysis method in statistics is utilized to analyze the significance of different frequency bands for speech unit classification. The result is then used to optimize filter bank design for Mel-frequency cepstral coefficients (MFCC) and perceptual linear prediction (PLP) features respectively in emotion affected speech recognition. Under comparable conditions, the modified features get a relative 40.14% decrease for MFCC and 34.93% for PLP in sentence error rate.
  • Keywords
    acoustic signal processing; emotion recognition; feature extraction; optimisation; speech recognition; statistical analysis; F-ratio analysis method; Mel-frequency cepstral coefficients; acoustic feature optimization; emotion affected speech recognition; filter bank design; perceptual linear prediction; performance degradation; speech unit classification; Acoustics; Algorithm design and analysis; Cepstral analysis; Emotion recognition; Filter bank; Mel frequency cepstral coefficient; Pattern recognition; Speech analysis; Speech recognition; Sun;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4994-1
  • Type

    conf

  • DOI
    10.1109/ICIECS.2009.5365821
  • Filename
    5365821