Title :
Adaptive training for robust ASR
Author :
Gales, Mark J.F.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
Adaptive training is a powerful training technique for building speech recognition systems on nonhomogeneous data. The aim is to remove unwanted variability, such as changes in speaker, channel or acoustic environment, from desired changes, the acoustic differences between words. During training, two sets of models are generated: a canonical model set for the desired "true" variability of the speech data, and a set of transforms to represent the unwanted variability. The canonical model set trained in this fashion should be more "amenable" to being adapted to a particular target condition and more "compact". During recognition, a transform to the target domain is trained. This target specific transform is then used with the canonical model set in the recognition process. The paper gives an overview of the underlying theory and assumptions used in adaptive training. Furthermore, the use of adaptive training schemes in current state-of-the-art tasks is described, together with a discussion of how such schemes may be used in the future.
Keywords :
learning (artificial intelligence); speech processing; speech recognition; adaptive training; automatic speech recognition; canonical model set; feature extraction; nonhomogeneous data; robust ASR; target specific transform; true variability; unwanted variability; Acoustical engineering; Automatic speech recognition; Data engineering; Feature extraction; Loudspeakers; Power engineering and energy; Robustness; Speech recognition; Target recognition; Training data;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
DOI :
10.1109/ASRU.2001.1034578