DocumentCode
394352
Title
A multilevel framework to model the inherently confounding nature of sentential F0sentential F0 contours contours for recognizing Chinese lexical tones
Author
Jin-song Zhang ; Hirose, Keikichi ; Nakamura, Satoshi
Author_Institution
ATR Spoken Language Translation Res. Labs., Kyoto, Japan
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
This paper presents a multilevel framework to cope with the complex variations in Chinese sentential F0 contours in order to recognize lexical tones. Tone nucleus model is to get rid of the influence of intrinsic F0 transition loci at sub-syllable level. The pitch anchoring concept is used to normalize tonal F0 contours at syllable level. The hypo- and hyper-intonation model is used to account for the interplay of tone coarticulation and higher level prosodic effects. The whole approach achieved significant higher performance than the conventional method.
Keywords
hidden Markov models; natural languages; speech recognition; Chinese lexical tones recognition; Chinese sentential F0 contours; F0 transition loci; HMM; automatic tone recognition; hyper-intonation model; hypo-intonation model; multilevel framework; nucleus model; pitch anchoring; prosodic effects; sentential F0 contours; sub-syllable level; syllable level; tone coarticulation; tone nucleus model; Application software; Application specific integrated circuits; Automatic speech recognition; Frequency; Informatics; Labeling; Laboratories; Natural languages; Proposals; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198896
Filename
1198896
Link To Document