DocumentCode :
3163627
Title :
A layered approach for dutch large vocabulary continuous speech recognition
Author :
Pelemans, Joris ; Demuynck, Kris ; Wambacq, Patrick
Author_Institution :
Dept. ESAT, Katholieke Univ. Leuven, Leuven, Belgium
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4421
Lastpage :
4424
Abstract :
In this paper we investigate whether a layered architecture that has already proven its value for small tasks, works for a system with large lexica (400k words) and language models (5-grams) as well. The architecture was designed to decouple phone and word recognition which allows for the integration of more complex linguistic components, especially at the sub-word level. It was tested on the Dutch language which - with its large variety of accents and rich morphology - is ideally suited to benefit from this integration. The results reveal that the architecture is already competitive to an all-in-one approach in which acoustic models, language models and lexicon are all applied simultaneously. Candidates for further improvement to the system based on a conditional phone confusion model are suggested.
Keywords :
speech recognition; accents; acoustic models; decouple phone; dutch large vocabulary continuous speech recognition; language models; lexicon; phone confusion model; rich morphology; word recognition; Acoustics; Context; Context modeling; Decoding; Hidden Markov models; Lattices; Speech; ASR architecture; LVCSR; accented speech; phone confusion matrix; phone lattice decoding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288900
Filename :
6288900
Link To Document :
بازگشت