DocumentCode :
2665083
Title :
Performance improvement of automatic speech recognition systems via multiple language models produced by sentence-based clustering
Author :
Podder, Sushil Kumar ; Shaban, Khaled ; Sun, Jiping ; Karray, Fakhri ; Basir, Otman ; Kame, Mohamed
Author_Institution :
PAMI Lab., Waterloo Univ., Ont., Canada
fYear :
2003
fDate :
26-29 Oct. 2003
Firstpage :
362
Lastpage :
367
Abstract :
Grammar-based speech recognition systems exhibit performance degradation as their vocabulary sizes increase. Data clustering is deemed to reduce the proportionality of this problem. We introduce an approach to data clustering for automatic speech recognition systems using kohonen self-organized map. Clustering results are used further to build a language model for each of the clusters using CMU-Cambridge toolkit. The approach was implemented as a prototype for a large vocabulary and continuous speech recognition system and about 8% performance improvement was achieved in comparison with the performance achieved using the language model and dictionary provided by Sphinx3. We present the experimental results along with discussions, analysis and potential future directions.
Keywords :
grammars; pattern clustering; self-organising feature maps; speech recognition; CMU-Cambridge toolkit; automatic speech recognition system; continuous speech recognition system; data clustering; grammar-based speech recognition system; kohonen self-organized map; multiple language model; vocabulary; Automatic speech recognition; Degradation; Dictionaries; Engines; Natural languages; Performance analysis; Speech analysis; Speech recognition; Sun; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
Type :
conf
DOI :
10.1109/NLPKE.2003.1275932
Filename :
1275932
Link To Document :
بازگشت