Title :
Phonetic Boundary Refinement using Support Vector Machine
Author :
Lo, Hung-Yi ; Wang, Hsin-Min
Author_Institution :
Inst. of Inf. Sci., Academia Sinica, Taipei
Abstract :
In this paper, we propose using support vector machine (SVM) to refine the hypothesized phone transition boundaries given by the HMM-based Viterbi forced alignment. We conducted experiments on the TIMIT speech corpus. The phone transitions were automatically partitioned into 46 clusters according to their acoustic characteristics and the cross-validation using the training data; hence, 46 phone-transition-dependent SVM classifiers were used for phone boundary refinement. The proposed HMM-SVM approach performs as well as the recent discriminative HMM-based segmentation. The best accuracies achieved are 81.23% within a tolerance of 10 ms and 92.47% within a tolerance of 20 ms. The mean boundary distance is 7.73 ms.
Keywords :
hidden Markov models; speech processing; support vector machines; HMM-based Viterbi forced alignment; SVM; TIMIT speech corpus; discriminative HMM-based segmentation; phone transition boundaries; phone-transition-dependent SVM classifiers; phonetic boundary refinement; support vector machine; Hidden Markov models; Humans; Information science; Labeling; Speech recognition; Speech synthesis; Support vector machine classification; Support vector machines; Training data; Viterbi algorithm; Automatic phone alignment; reduced support vector machine; support vector machine;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2007.367224