DocumentCode :
3424293
Title :
A hybrid architecture for automatic segmentation of speech waveforms
Author :
Mporas, Iosif ; Ganchev, Todor ; Fakotakis, Nikos
Author_Institution :
Dept. Electr. & Comput. Eng., Patras Univ., Rio Patras
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4457
Lastpage :
4460
Abstract :
In the present work, we propose a hybrid architecture for automatic alignment of speech waveforms and their corresponding phone sequence. The proposed architecture does not exploit any phone boundary information. Our approach combines the efficiency of embedded training techniques and the high performance of isolated-unit training. Evaluating on the established for the task of phone segmentation TIMIT database, we achieved an accuracy of 83.56%, which corresponds to improving the baseline system´s accuracy by 6.09 %.
Keywords :
speech processing; TIMIT database; automatic alignment; automatic speech waveform segmentation; embedded training techniques; isolated-unit training; phone boundary information; phone sequence; Artificial intelligence; Computer architecture; Databases; Feature extraction; Hidden Markov models; Natural languages; Speech recognition; Text recognition; Viterbi algorithm; Wire; Speech segmentation; embedded training; hidden Markov models; isolated-unit training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518645
Filename :
4518645
Link To Document :
بازگشت