Title :
Experiments using data augmentation for speaker adaptation
Author :
Bellegarda, Jerome R. ; De Souza, Peter V. ; Nahamoo, David ; Padmanabhan, Mukund ; Picheny, Michael A. ; Bahl, Lalit R.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Speaker adaptation typically involves customizing some existing (reference) models in order to account for the characteristics of a new speaker. This work considers the slightly different paradigm of customizing some reference data for the purpose of populating the new speaker´s space, and then using the resulting (augmented) data to derive the customized models. The data augmentation technique is based on the metamorphic algorithm first proposed in Bellegarda et al. [1992], assuming that a relatively modest amount of data (100 sentences) is available from each new speaker. This contraint requires that reference speakers be selected with some care. The performance of this method is illustrated on a portion of the Wall Street Journal task
Keywords :
natural languages; speech recognition; Wall Street Journal task; customized models; data augmentation; metamorphic algorithm; reference data; speaker adaptation; Error analysis; Hidden Markov models; Loudspeakers; Natural languages; Prototypes; Speech analysis; Speech recognition; Switches; Testing; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479788