• DocumentCode
    390493
  • Title

    Integration of tone related feature for Chinese speech recognition

  • Author

    Wong, Pui-Fung ; Siu, Man-Hung

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Hong Kong Univ. of Sci. & Technol., Kowloon, China
  • Volume
    1
  • fYear
    2002
  • fDate
    26-30 Aug. 2002
  • Firstpage
    476
  • Abstract
    Chinese is a tonal language that uses tones, in addition to phones for word differentiation. Commonly used front-end features, such as Mel-frequency cepstral coefficients (MFCC), however, are optimized for non-tonal languages such as English and explicitly remove vocal tract information that is important for tone identification. In this paper, we examine the integration of tone-related acoustic features for Chinese recognition.. We propose the use of a cepstrum method (CEP), which uses the same window as in MFCC extraction, for the extraction of pitch-related features. The pitch periods extracted from the CEP algorithm can be used directly for speech recognition and do not require any special treatment for unvoiced frames. In addition, we explore a number of feature transformations and find that the addition of a properly normalized and transformed set of pitch related-features can reduce the recognition error rate from 34.61% to 29.45% on the Chinese 1998 National Performance Assessment (Project 863) corpus.
  • Keywords
    cepstral analysis; feature extraction; speech recognition; Chinese; Mel-frequency cepstral coefficients; cepstrum method; pitch-related features; recognition error rate; speech recognition; tonal language; tone-related acoustic features; word differentiation; Cepstral analysis; Cepstrum; Data mining; Detection algorithms; Error analysis; Error correction; Hidden Markov models; Mel frequency cepstral coefficient; Natural languages; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, 2002 6th International Conference on
  • Print_ISBN
    0-7803-7488-6
  • Type

    conf

  • DOI
    10.1109/ICOSP.2002.1181095
  • Filename
    1181095