Title :
A multilevel framework to model the inherently confounding nature of sentential F0sentential F0 contours contours for recognizing Chinese lexical tones
Author :
Jin-song Zhang ; Hirose, Keikichi ; Nakamura, Satoshi
Author_Institution :
ATR Spoken Language Translation Res. Labs., Kyoto, Japan
Abstract :
This paper presents a multilevel framework to cope with the complex variations in Chinese sentential F0 contours in order to recognize lexical tones. Tone nucleus model is to get rid of the influence of intrinsic F0 transition loci at sub-syllable level. The pitch anchoring concept is used to normalize tonal F0 contours at syllable level. The hypo- and hyper-intonation model is used to account for the interplay of tone coarticulation and higher level prosodic effects. The whole approach achieved significant higher performance than the conventional method.
Keywords :
hidden Markov models; natural languages; speech recognition; Chinese lexical tones recognition; Chinese sentential F0 contours; F0 transition loci; HMM; automatic tone recognition; hyper-intonation model; hypo-intonation model; multilevel framework; nucleus model; pitch anchoring; prosodic effects; sentential F0 contours; sub-syllable level; syllable level; tone coarticulation; tone nucleus model; Application software; Application specific integrated circuits; Automatic speech recognition; Frequency; Informatics; Labeling; Laboratories; Natural languages; Proposals; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198896