DocumentCode :
294656
Title :
Experiments using data augmentation for speaker adaptation
Author :
Bellegarda, Jerome R. ; De Souza, Peter V. ; Nahamoo, David ; Padmanabhan, Mukund ; Picheny, Michael A. ; Bahl, Lalit R.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
692
Abstract :
Speaker adaptation typically involves customizing some existing (reference) models in order to account for the characteristics of a new speaker. This work considers the slightly different paradigm of customizing some reference data for the purpose of populating the new speaker´s space, and then using the resulting (augmented) data to derive the customized models. The data augmentation technique is based on the metamorphic algorithm first proposed in Bellegarda et al. [1992], assuming that a relatively modest amount of data (100 sentences) is available from each new speaker. This contraint requires that reference speakers be selected with some care. The performance of this method is illustrated on a portion of the Wall Street Journal task
Keywords :
natural languages; speech recognition; Wall Street Journal task; customized models; data augmentation; metamorphic algorithm; reference data; speaker adaptation; Error analysis; Hidden Markov models; Loudspeakers; Natural languages; Prototypes; Speech analysis; Speech recognition; Switches; Testing; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479788
Filename :
479788
Link To Document :
بازگشت