Title :
Using burst onset information to improve stop/affricate phone recognition
Author :
Lin, Chi-yueh ; Wang, Hsiao-Chuan
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Abstract :
Reliably detecting salient phonetic-acoustic cues plays an important role in speech recognition based on speech landmarks. Once these speech landmarks are located, not only phone recognition can be performed but some other useful information can be derived as well. This paper focuses on the topic of detecting burst onset landmark, an important phonetic characteristic in stops and affricates. The proposed burst onset detector is based on random forest, a learning algorithm renowned for its high accuracy and efficiency in classification. By appending intermediate detection results to MFCCs, the expanded feature can bring benefit to the recognition of stop and affricate consonants in continuous speech.
Keywords :
feature extraction; speech processing; speech recognition; burst onset information; feature recognition; phonetic-acoustic cues; speech recognition; Bagging; Classification tree analysis; Decision making; Decision trees; Detectors; Regression tree analysis; Speech analysis; Speech recognition; Testing; Voting; affricate consonant; burst onset; phone recognition; random forest; stop consonant;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495132