Title :
Burst Onset Landmark Detection and Its Application to Speech Recognition
Author :
Lin, Chi-yueh ; Wang, Hsiao-Chuan
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
fDate :
7/1/2011 12:00:00 AM
Abstract :
The reliable detection of salient acoustic-phonetic cues in speech signal plays an important role in speech recognition based on speech landmarks. Once speech landmarks are located, not only can phone recognition be performed, but other useful information can also be derived. This paper focuses on the detection of burst onset landmarks, which are crucial to the recognition of stop and affricate consonants. The proposed detector is purely based on a random forest technique, which belongs to an ensemble of tree-structured classifiers. By adopting a special asymmetric bootstrapping method, a series of experiments conducted on the TIMIT database demonstrate that the proposed detector is an efficient and accurate method for detecting burst onsets. When the detection results are appended to mel frequency cepstral coefficient vectors, the augmented feature vectors enhance the recognition correctness of hidden Markov models in recognizing stop and affricate consonants in continuous speech.
Keywords :
hidden Markov models; signal classification; speech recognition; vectors; TIMIT database; affricate consonant recognition; asymmetric bootstrapping method; augmented feature vector; burst onset landmark detection; hidden Markov model; mel frequency cepstral coefficient vector; random forest technique; salient acoustic-phonetic cue detection; speech recognition; speech signal; stop consonant recognition; tree-structured classifier; Detectors; Feature extraction; Hidden Markov models; Speech; Speech recognition; Testing; Training; Affricate consonant; burst onset; random forest; speech recognition; stop consonant;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2010.2089518