Title :
A study on robust segmentation and location of tone nuclei in Chinese continuous speech
Author :
Zhang, Jin-Song ; Hirose, Keikichi
Author_Institution :
ATR Spoken Language Translation Res. Labs., Kyoto, Japan
Abstract :
Tone nuclei in continuous speech are regarded as efficient targets for either tone recognition or intonation function decomposition. The paper presents our statistically robust method to segment and locate tone nuclei in continuous speech. The method includes: an iterative segmental K-means segmentation of the tonal F0 contours, which is further aided with t-test based segment amalgamation; a linear discriminant function based tone nucleus discriminator, whose features are selected by the sequential feature selection method. The developed system achieved 97.5% correct tone nuclei on a speaker dependent task. The tone recognizer based on the detected tone nuclei improved tone recognition rate by over 6% more than the baseline ones using the full tonal syllable features.
Keywords :
iterative methods; natural languages; speech recognition; Chinese continuous speech; automatic speech recognition; intonation function decomposition; iterative segmental K-means segmentation; linear discriminant function; sequential feature selection; speaker dependent tone recognition; t-test based segment amalgamation; tonal F0 contours; tonal syllable features; tone nuclei location; tone nuclei segmentation; tone nucleus discriminator; Automatic speech recognition; Cities and towns; Informatics; Iterative methods; Laboratories; Loudspeakers; Natural languages; Robustness; Speech recognition; Target recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326135