مرکز منطقه ای اطلاع رساني علوم و فناوري - Refining Segmental Boundaries using Support Vector Machine

DocumentCode :

461631

Title :

Refining Segmental Boundaries using Support Vector Machine

Author :

Namnabat, M. ; Homayounpour, M.M.

Author_Institution :

Dept. of Comput. Eng. & Inf. Technol., Amirkabir Univ. of Technol., Tehran

Volume :

fYear :

2006

fDate :

16-20 2006

Abstract :

High accuracy phonetic segmentation is critical to achieve good quality in concatenative speech synthesis. However, the processing and inspection of a large amount of recorded speech will become a labor-intensive and error-prone job. In this paper, a post-refining method based on support vector machines (SVMs), is proposed for auto-segmentation of speech data. Our baseline system is based on a set of hidden Markov models (HMMs). This system performs forced alignment of speech data and phonemic transcription of corresponding text. A de-biasing algorithm first refines initial boundary estimates. SVM models are then used for more refinement of de-biased boundaries. Subsequently, a LBG vector quantization algorithm is used to reduce the amount of speech for training SVM models. This leads to a considerable decrease in necessary time to train SVM models. We achieved a performance of 94.3% for segmentation of phoneme boundaries with less than 15 ms deviation from hand labeled boundaries

Keywords :

hidden Markov models; speech coding; speech synthesis; vector quantisation; HMM; LBG vector quantization algorithm; SVM; concatenative speech synthesis; de-biasing algorithm; hidden Markov models; phonetic segmentation; segmental boundaries; support vector machine; Context modeling; Hidden Markov models; Humans; Kernel; Speech analysis; Speech processing; Speech recognition; Speech synthesis; Support vector machine classification; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing, 2006 8th International Conference on

Conference_Location :

Beijing

Print_ISBN :

0-7803-9736-3

Electronic_ISBN :

0-7803-9736-3

Type :

conf

DOI :

10.1109/ICOSP.2006.345517

Filename :

4128932

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=461631