DocumentCode
3163627
Title
A layered approach for dutch large vocabulary continuous speech recognition
Author
Pelemans, Joris ; Demuynck, Kris ; Wambacq, Patrick
Author_Institution
Dept. ESAT, Katholieke Univ. Leuven, Leuven, Belgium
fYear
2012
fDate
25-30 March 2012
Firstpage
4421
Lastpage
4424
Abstract
In this paper we investigate whether a layered architecture that has already proven its value for small tasks, works for a system with large lexica (400k words) and language models (5-grams) as well. The architecture was designed to decouple phone and word recognition which allows for the integration of more complex linguistic components, especially at the sub-word level. It was tested on the Dutch language which - with its large variety of accents and rich morphology - is ideally suited to benefit from this integration. The results reveal that the architecture is already competitive to an all-in-one approach in which acoustic models, language models and lexicon are all applied simultaneously. Candidates for further improvement to the system based on a conditional phone confusion model are suggested.
Keywords
speech recognition; accents; acoustic models; decouple phone; dutch large vocabulary continuous speech recognition; language models; lexicon; phone confusion model; rich morphology; word recognition; Acoustics; Context; Context modeling; Decoding; Hidden Markov models; Lattices; Speech; ASR architecture; LVCSR; accented speech; phone confusion matrix; phone lattice decoding;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288900
Filename
6288900
Link To Document