Title :
Combined optimisation of baseforms and model parameters in speech recognition based on acoustic subword units
Author :
Holter, Trym ; Svendsen, Torbjorn
Author_Institution :
Dept. of Telecommun., Norwegian Univ. of Sci. & Technol., Norway
Abstract :
A major challenge in speech recognition is creating a lexicon which is robust to inter and intra speaker variations. This is even more so in speech recognisers based on non linguistic units, e.g., acoustic subword units (ASWUs), since no standard pronunciation dictionaries are available. Thus the baseforms describing the vocabulary words in terms of the recognition units need to be generated from training data. We propose an algorithm for ASWU based speech recognition which performs a combined optimisation of the baseforms and the subword models. The resulting system has been tested on the DARPA Resource Management task, and is shown to perform comparably to a baseline phoneme based system
Keywords :
acoustic signal processing; optimisation; resource allocation; speech processing; speech recognition; word processing; ASWU based speech recognition; ASWUs; DARPA Resource Management task; acoustic subword units; baseforms; baseline phoneme based system; combined optimisation; intra speaker variations; lexicon; model parameters; non linguistic units; recognition units; speech recognition; standard pronunciation dictionaries; subword models; training data; vocabulary words; Automatic speech recognition; Dictionaries; Management training; Resource management; Robustness; Speech analysis; Speech recognition; System testing; Training data; Vocabulary;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.659006