Title :
Automatic transcription of unknown words in a speech recognition system
Author :
Haeb-Umbach, R. ; Beyerlein, P. ; Thelen, E.
Author_Institution :
Philips GmbH Forschungslab., Aachen, Germany
Abstract :
We address the problem of automatically finding an acoustic representation (i.e. a transcription) of unknown words as a sequence of subword units, given a few sample utterances of the unknown words, and an inventory of speaker-independent subword units. The problem arises if a user wants to add his own vocabulary to a speaker-independent recognition system simply by speaking the words a few times. Two methods are investigated which are both based on a maximum-likelihood formulation of the problem. The experimental results show that both automatic transcription methods provide a good estimate of the acoustic models of unknown words. The recognition error rates obtained with such models in a speaker-independent recognition task are clearly better than those resulting from separate whole-word models. They are comparable with the performance of transcriptions drawn from a dictionary
Keywords :
error analysis; hidden Markov models; maximum likelihood estimation; speech recognition; acoustic models; acoustic representation; automatic transcription; estimate; hidden Markov models; maximum-likelihood formulation; performance; recognition error rates; speaker-independent subword units; speech recognition system; unknown words; utterances; vocabulary; Automatic speech recognition; Dictionaries; Error analysis; Loudspeakers; Maximum likelihood estimation; Speech recognition; Speech synthesis; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479825