Title :
Selection criteria for hypothesis driven lexical adaptation
Author :
Geutner, P. ; Finke, M. ; Waibel, A.
Author_Institution :
Interactive Syst. Labs., Karlsruhe Univ., Germany
Abstract :
Adapting the vocabulary of a speech recognizer to the utterance to be recognized has proven to be successful both in reducing high out-of-vocabulary as well as word error rates. This applies especially to languages that have a rapid vocabulary growth due to a large number of inflections and composita. This paper presents various adaptation methods within the hypothesis driven lexical adaptation (HDLA) framework which allow speech recognition on a virtually unlimited vocabulary. Selection criteria for the adaptation process are either based on morphological knowledge or distance measures at phoneme or grapheme level. Different methods are introduced for determining distances between phoneme pairs and for creating the large fallback lexicon the adapted vocabulary is chosen from. HDLA reduces the out-of-vocabulary-rate by 55% for Serbo-Croatian, 35% for German and 27% for Turkish. The reduced out-of-vocabulary rate also decreases the word error rate by an absolute 4.1% to 25.4% on Serbo-Croatian broadcast news data
Keywords :
natural languages; speech recognition; German; HDLA; Serbo-Croatian; Turkish; adaptation methods; broadcast news data; composita; distance measures; fallback lexicon; grapheme; hypothesis driven lexical adaptation; inflections; morphological knowledge; out-of-vocabulary rate; phoneme; selection criteria; speech recognition; speech recognizer; utterance; virtually unlimited vocabulary; word error rate; Broadcasting; Databases; Degradation; Dictionaries; Error analysis; Interactive systems; Laboratories; Natural languages; Speech recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.759742