DocumentCode :
549998
Title :
Evaluation of lexical models for Hungarian Broadcast speech transcription and spoken term detection
Author :
Tarján, Balázs ; Mihajlik, Péter ; Balog, András ; Fegyó, Tibor
Author_Institution :
Dept. of Telecommun. & Media Inf., Budapest Univ. of Technol. & Econ., Budapest, Hungary
fYear :
2011
fDate :
7-9 July 2011
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, we re-evaluate morph (data-driven subword) and word lexical models used for large vocabulary continuous speech recognition of agglutinative languages. Since such speech recognition systems are applied mostly for information retrieval purposes we use evaluation metrics accordingly. Standard 3-gram language model with one million words vocabulary is used for words whereas statistical morph-based models are applied with smaller vocabularies and with higher order of n-gram models. Fostering real life applicability, the computational time and memory usage of the various approaches is kept below real-time and 1.5 GB, respectively. The lexical modeling approaches are tested on Hungarian Broadcast News and Broadcast Conversation speech. In our setup, although word-based models outperformed morph-based ones in terms of both word error rate and spoken term detection measures, a search-cascade of the word and morph approaches improved the latter results significantly.
Keywords :
error statistics; natural language processing; speech recognition; text analysis; vocabulary; 3-gram language model; Hungarian broadcast news; Hungarian broadcast speech transcription; agglutinative language; broadcast conversation speec; continuous speech recognition; data-driven subword; evaluation metrics; n-gram model; speech recognition system; spoken term detection; statistical morph-based model; vocabulary; word error rate; word lexical model; word-based model; Accuracy; Decoding; Indexes; Real time systems; Speech; Speech recognition; Vocabulary; LVCSR; agglutinative languages; broadcast conversation; broadcast news; speech recognition; spoken term detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cognitive Infocommunications (CogInfoCom), 2011 2nd International Conference on
Conference_Location :
Budapest
Print_ISBN :
978-1-4577-1806-9
Electronic_ISBN :
978-963-8111-78-4
Type :
conf
Filename :
5999466
Link To Document :
بازگشت