DocumentCode :
417283
Title :
A study on robust segmentation and location of tone nuclei in Chinese continuous speech
Author :
Zhang, Jin-Song ; Hirose, Keikichi
Author_Institution :
ATR Spoken Language Translation Res. Labs., Kyoto, Japan
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
Tone nuclei in continuous speech are regarded as efficient targets for either tone recognition or intonation function decomposition. The paper presents our statistically robust method to segment and locate tone nuclei in continuous speech. The method includes: an iterative segmental K-means segmentation of the tonal F0 contours, which is further aided with t-test based segment amalgamation; a linear discriminant function based tone nucleus discriminator, whose features are selected by the sequential feature selection method. The developed system achieved 97.5% correct tone nuclei on a speaker dependent task. The tone recognizer based on the detected tone nuclei improved tone recognition rate by over 6% more than the baseline ones using the full tonal syllable features.
Keywords :
iterative methods; natural languages; speech recognition; Chinese continuous speech; automatic speech recognition; intonation function decomposition; iterative segmental K-means segmentation; linear discriminant function; sequential feature selection; speaker dependent tone recognition; t-test based segment amalgamation; tonal F0 contours; tonal syllable features; tone nuclei location; tone nuclei segmentation; tone nucleus discriminator; Automatic speech recognition; Cities and towns; Informatics; Iterative methods; Laboratories; Loudspeakers; Natural languages; Robustness; Speech recognition; Target recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326135
Filename :
1326135
Link To Document :
بازگشت