Title :
Speaker-independent features extracted by a neural network
Author :
Kato, Y. ; Sugiyama, M.
Author_Institution :
ATR Interpreting Telephony Res. Lab., Soraku-gun, Kyoto, Japan
Abstract :
The authors propose an algorithm using a neural network to normalize features that differ between speakers in speaker-independent speech recognition. The algorithm has three procedures: (1) initially training a neural network, (2) calculating the alignment function between the target signal and the network´s output by dynamic time warping, and (3) incrementally training the network for extracting speaker-independent features. The neural network is a fuzzy partition model (FPM) with multiple input-output units to give a probabilistic formulation. The algorithm was evaluated in phrase recognition experiments by FPM-LR recognizers. The FPM was directly combined with a LR parser. The algorithm is compared with a conventional training algorithm in terms of recognition performance. The experimental results show that a neural network can be used as a new speaker-independent feature extractor.<>
Keywords :
feature extraction; fuzzy logic; grammars; learning (artificial intelligence); neural nets; speech recognition; LR parser; algorithm; alignment function; dynamic time warping; fuzzy partition model; neural network; probabilistic formulation; recognition performance; speaker-independent feature extractor; speaker-independent speech recognition; training;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.1993.319178