• DocumentCode
    417300
  • Title

    Tone articulation modeling for Mandarin spontaneous speech recognition

  • Author

    Zhou, Jian-lai ; Tian, Ye ; Shi, Yu ; Huang, Chao ; Chang, Eric

  • Author_Institution
    Microsoft Res. Asia, China
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Tone modeling is an unavoidable problem in Mandarin speech recognition. In continuous speech, the pitch contour exhibits variable patterns, and it is strongly influenced by its tone context. Although several effective methods have been proposed to improve the accuracy for tonal syllables in Mandarin continuous speech recognition, many recognition errors are caused by poor tone discrimination capability of the acoustic model. Furthermore, the case becomes worse for the recognition of spontaneous speech. In this paper, we report our work on tone articulation modeling. Tone context dependent models are used to model unstable pitch patterns caused by co-articulation in continuous speech. Corresponding acoustic features are investigated as well. Our methods are evaluated on two test sets: one is reading-style speech data, the other is spontaneous. The experimental results show that for the test set of casual speech, the proposed method turns out to be more effective than tone context independent model, while they are comparable for the test set of reading-style speech. Several factors which have potential to improve the proposed method are discussed in the final part in this paper.
  • Keywords
    feature extraction; frequency estimation; speech processing; speech recognition; Mandarin spontaneous speech recognition; acoustic features; acoustic model; casual speech; co-articulation; continuous speech; reading-style speech; tone articulation modeling; tone context dependent models; unstable pitch patterns; Chaos; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326156
  • Filename
    1326156