Title :
Speaker-independent speech recognition using nonlinear predictor codebooks
Author :
Kawabata, Takeshi
Author_Institution :
NTT Basic Res. Lab., Musashino-shi, Tokyo, Japan
Abstract :
A neural spectrum-prediction mechanism is implemented in the predictor codebook for speaker-independent speech recognition. The nonlinear predictor codebook consists of neural predictors generated through LBG (Linde-Buzo-Gray) based predictor quantization procedures. Nonlinear prediction functions insulate each predictor code from the other codes, and accomplish high phoneme separation without decreasing the robustness to speaker variation. The structure of each predictor is equivalent to a three-layer neural network, but it is not trained by error backpropagation. The predictor is first optimized as a linear prediction function. Then, the nonlinear (sigmoid) function is implemented in it. A set of nonlinear predictors is totally optimized by the predictor quantization algorithm.<>
Keywords :
feedforward neural nets; filtering and prediction theory; speech coding; speech recognition; high phoneme separation; neural spectrum-prediction mechanism; nonlinear predictor codebooks; predictor quantization algorithm; robustness; sigmoid function; speaker-independent speech recognition; three-layer neural network;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.1993.319406