Title :
A hybrid architecture for automatic segmentation of speech waveforms
Author :
Mporas, Iosif ; Ganchev, Todor ; Fakotakis, Nikos
Author_Institution :
Dept. Electr. & Comput. Eng., Patras Univ., Rio Patras
fDate :
March 31 2008-April 4 2008
Abstract :
In the present work, we propose a hybrid architecture for automatic alignment of speech waveforms and their corresponding phone sequence. The proposed architecture does not exploit any phone boundary information. Our approach combines the efficiency of embedded training techniques and the high performance of isolated-unit training. Evaluating on the established for the task of phone segmentation TIMIT database, we achieved an accuracy of 83.56%, which corresponds to improving the baseline system´s accuracy by 6.09 %.
Keywords :
speech processing; TIMIT database; automatic alignment; automatic speech waveform segmentation; embedded training techniques; isolated-unit training; phone boundary information; phone sequence; Artificial intelligence; Computer architecture; Databases; Feature extraction; Hidden Markov models; Natural languages; Speech recognition; Text recognition; Viterbi algorithm; Wire; Speech segmentation; embedded training; hidden Markov models; isolated-unit training;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518645