مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised incremental online adaptation to unknown environment and speaker

DocumentCode :

542265

Title :

Unsupervised incremental online adaptation to unknown environment and speaker

Author :

Yook, Dongsuk

Author_Institution :

Speech Information Processing Laboratory, Department of Computer Science and Engineering, Korea University, Seoul, Korea

Volume :

fYear :

2002

fDate :

13-17 May 2002

Abstract :

A Maximum Likelihood Spectral Transformation (MLST) technique is used for robust speech recognition under mismatched training and testing conditions. The linear spectral speech feature vectors of testing utterances are transformed such that the likelihood of the utterances is increased after the transformation. The cepstral vectors are computed from the transformed spectra. The function used for the spectral transformation is designed to handle both convolutional and additive noise. Since the function has small number of parameters to be estimated, MLST requires only a few utterances for adaptation. Furthermore, the computation for parameter estimation and spectral transformation can be done efficiently in linear time. Therefore, the MLST is suitable for rapid online adaptation. To evaluate the efficiency of the MLST technique, it has been implemented for unsupervised incremental online adaptation. The system is tested on speaker-phone telephone speech data, and MLST reduces the error rate by 29.5% when used for speaker and environment adaptation.

Keywords :

Ear;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location :

Orlando, FL, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.2002.5743793

Filename :

5743793

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=542265