Title :
Acoustic factorisation
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
This paper describes a new technique for training a speech recognition system on inhomogeneous training data. The proposed technique, acoustic factorisation, attempts to model explicitly all the factors that affect the acoustic signal. By explicitly modelling all the factors, the trained model set may be used in a more flexible fashion than in standard adaptive training schemes. Since an individual model is trained for each factor, it is possible to factor-in only those factors that are appropriate to a particular target domain, for example the distribution over all training speakers. The target domain specific factors are simply estimated from limited target specific data, for example the target acoustic environment. The paper describes the theory of this new approach for the transforms for a particular speaker and environment. Initial experiments on a large vocabulary speech recognition task are presented.
Keywords :
acoustic signal processing; learning (artificial intelligence); parameter estimation; speech recognition; acoustic factorisation; adaptive training; inhomogeneous training data; specific factor estimation; speech recognition; Acoustic noise; Acoustic testing; Acoustical engineering; Data engineering; Degradation; Loudspeakers; Maximum likelihood linear regression; Speech recognition; Training data; Working environment noise;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
DOI :
10.1109/ASRU.2001.1034593