• DocumentCode
    394352
  • Title

    A multilevel framework to model the inherently confounding nature of sentential F0sentential F0 contours contours for recognizing Chinese lexical tones

  • Author

    Jin-song Zhang ; Hirose, Keikichi ; Nakamura, Satoshi

  • Author_Institution
    ATR Spoken Language Translation Res. Labs., Kyoto, Japan
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper presents a multilevel framework to cope with the complex variations in Chinese sentential F0 contours in order to recognize lexical tones. Tone nucleus model is to get rid of the influence of intrinsic F0 transition loci at sub-syllable level. The pitch anchoring concept is used to normalize tonal F0 contours at syllable level. The hypo- and hyper-intonation model is used to account for the interplay of tone coarticulation and higher level prosodic effects. The whole approach achieved significant higher performance than the conventional method.
  • Keywords
    hidden Markov models; natural languages; speech recognition; Chinese lexical tones recognition; Chinese sentential F0 contours; F0 transition loci; HMM; automatic tone recognition; hyper-intonation model; hypo-intonation model; multilevel framework; nucleus model; pitch anchoring; prosodic effects; sentential F0 contours; sub-syllable level; syllable level; tone coarticulation; tone nucleus model; Application software; Application specific integrated circuits; Automatic speech recognition; Frequency; Informatics; Labeling; Laboratories; Natural languages; Proposals; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198896
  • Filename
    1198896