DocumentCode :
1732978
Title :
An investigation into VTLN for improved transcription of Czech broadcast programs
Author :
Cerva, Petr ; Palecek, Karel ; Silovsky, Jan ; Nouza, Jan
Author_Institution :
Inst. of Inf. Technol. & Electron., Tech. Univ. of Liberec, Liberec, Czech Republic
fYear :
2011
Firstpage :
201
Lastpage :
204
Abstract :
This paper deals with the Vocal Tract Length Normalization (VTLN) method. The aim is to investigate the best way how to utilize this technique for improving recognition accuracy of a LVCRS system that has been developed for broadcast program transcription at our lab in recent years. For this purpose, VTLN is evaluated experimentally in several configurations during testing as well as in speaker adaptive training scheme. In the former case, we employ VTLN as unsupervised for each testing utterance without the knowledge of transcription of adaptation data. Our results on different types of broadcast programs show that the resulting approach for VTLN reduces the Word Error Rate (WER) of our system significantly - by 7 % relatively.
Keywords :
broadcasting; speaker recognition; unsupervised learning; Czech broadcast programs; LVCRS system; VTLN; broadcast program transcription; recognition accuracy; speaker adaptive training; testing utterance; unsupervised; vocal tract length normalization; word error rate; Acoustics; Hidden Markov models; Optimized production technology; Speech; Speech recognition; Testing; Training; Transcription of broadcast programs; Unsupervised speaker adaptation; Vocal tract length normalization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
ELMAR, 2011 Proceedings
Conference_Location :
Zadar
ISSN :
1334-2630
Print_ISBN :
978-1-61284-949-2
Electronic_ISBN :
1334-2630
Type :
conf
Filename :
6044296
Link To Document :
بازگشت