• DocumentCode
    2017976
  • Title

    DCT-based processing of dynamic features for robust speech recognition

  • Author

    Lin, Wen-Chi ; Fan, Hao-Teng ; Hung, Jeih-weih

  • Author_Institution
    Dept of Electr. Eng., Nat. Chi Nan Univ., Nantou, Taiwan
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    12
  • Lastpage
    17
  • Abstract
    In this paper, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coefficients (MFCC), and the used filters are from the discrete cosine transform (DCT) matrix. We modify these DCT-based filters by windowing, removing DC gain, and varying the filter length. The speech recognition task using Aurora-2 digit database show that the proposed methods can enhance the original CTC in improving the recognition accuracy. The resulting relative error reduction is around 20%.
  • Keywords
    discrete cosine transforms; matrix algebra; speech recognition; Aurora-2 digit database; CTC construction process; DCT-based processing; cepstral time coefficient; discrete cosine transform matrix; melfrequency cepstral coefficient; speech recognition; windowing; Discrete cosine transforms; Frequency modulation; Frequency response; Mel frequency cepstral coefficient; Speech; Speech recognition; automatic speech recognition; discrete cosine transform; temporal filter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684893
  • Filename
    5684893