DocumentCode
417283
Title
A study on robust segmentation and location of tone nuclei in Chinese continuous speech
Author
Zhang, Jin-Song ; Hirose, Keikichi
Author_Institution
ATR Spoken Language Translation Res. Labs., Kyoto, Japan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
Tone nuclei in continuous speech are regarded as efficient targets for either tone recognition or intonation function decomposition. The paper presents our statistically robust method to segment and locate tone nuclei in continuous speech. The method includes: an iterative segmental K-means segmentation of the tonal F0 contours, which is further aided with t-test based segment amalgamation; a linear discriminant function based tone nucleus discriminator, whose features are selected by the sequential feature selection method. The developed system achieved 97.5% correct tone nuclei on a speaker dependent task. The tone recognizer based on the detected tone nuclei improved tone recognition rate by over 6% more than the baseline ones using the full tonal syllable features.
Keywords
iterative methods; natural languages; speech recognition; Chinese continuous speech; automatic speech recognition; intonation function decomposition; iterative segmental K-means segmentation; linear discriminant function; sequential feature selection; speaker dependent tone recognition; t-test based segment amalgamation; tonal F0 contours; tonal syllable features; tone nuclei location; tone nuclei segmentation; tone nucleus discriminator; Automatic speech recognition; Cities and towns; Informatics; Iterative methods; Laboratories; Loudspeakers; Natural languages; Robustness; Speech recognition; Target recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326135
Filename
1326135
Link To Document