Title :
Refining Segmental Boundaries using Support Vector Machine
Author :
Namnabat, M. ; Homayounpour, M.M.
Author_Institution :
Dept. of Comput. Eng. & Inf. Technol., Amirkabir Univ. of Technol., Tehran
Abstract :
High accuracy phonetic segmentation is critical to achieve good quality in concatenative speech synthesis. However, the processing and inspection of a large amount of recorded speech will become a labor-intensive and error-prone job. In this paper, a post-refining method based on support vector machines (SVMs), is proposed for auto-segmentation of speech data. Our baseline system is based on a set of hidden Markov models (HMMs). This system performs forced alignment of speech data and phonemic transcription of corresponding text. A de-biasing algorithm first refines initial boundary estimates. SVM models are then used for more refinement of de-biased boundaries. Subsequently, a LBG vector quantization algorithm is used to reduce the amount of speech for training SVM models. This leads to a considerable decrease in necessary time to train SVM models. We achieved a performance of 94.3% for segmentation of phoneme boundaries with less than 15 ms deviation from hand labeled boundaries
Keywords :
hidden Markov models; speech coding; speech synthesis; vector quantisation; HMM; LBG vector quantization algorithm; SVM; concatenative speech synthesis; de-biasing algorithm; hidden Markov models; phonetic segmentation; segmental boundaries; support vector machine; Context modeling; Hidden Markov models; Humans; Kernel; Speech analysis; Speech processing; Speech recognition; Speech synthesis; Support vector machine classification; Support vector machines;
Conference_Titel :
Signal Processing, 2006 8th International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-9736-3
Electronic_ISBN :
0-7803-9736-3
DOI :
10.1109/ICOSP.2006.345517