مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker adaptation by a linear transformation with optimised parameters

DocumentCode :

3051436

Title :

Speaker adaptation by a linear transformation with optimised parameters

Author :

Jaschul, Johannes

Author_Institution :

Technical University, Munich, Federal Republic of Germany

Volume :

fYear :

1982

fDate :

30072

Firstpage :

1657

Lastpage :

1660

Abstract :

Speaker dependence of automatic speech recognition systems can be reduced by applying speaker-specific transformations to adapt the speech signal of a new speaker to that of the reference speaker. Initial investigations showed that speaker adaptation can be performed by transformations using spectral weighting and spectral warping. These heuristic methods can be substituted by a general linear matrix transformation, the parameters of which are determined by mean square error optimisation. The improvement of the recognition rate achievable by this matrix transformation is very high, but the method needs a large learning set. This can be reduced by restriction of the matrix to a band including the main diagonal in the middle. This banded matrix yields results close to those of the general matrix. Adaptation can be performed speaker-specifically as well as speaker- and class-specifically. As the cost of phoneme class-specific adaptation is very high, a grouping of phonemes is proposed so that one adaptation parameter set is used for all phonemes that belong to any one group.

Keywords :

Automatic speech recognition; Character recognition; Costs; Frequency; Matrices; Mean square error methods; Optimization methods; Speech analysis; Speech recognition; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.

Type :

conf

DOI :

10.1109/ICASSP.1982.1171494

Filename :

1171494

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3051436