DocumentCode
2018566
Title
Speaker-independent features extracted by a neural network
Author
Kato, Y. ; Sugiyama, M.
Author_Institution
ATR Interpreting Telephony Res. Lab., Soraku-gun, Kyoto, Japan
Volume
1
fYear
1993
fDate
27-30 April 1993
Firstpage
553
Abstract
The authors propose an algorithm using a neural network to normalize features that differ between speakers in speaker-independent speech recognition. The algorithm has three procedures: (1) initially training a neural network, (2) calculating the alignment function between the target signal and the network´s output by dynamic time warping, and (3) incrementally training the network for extracting speaker-independent features. The neural network is a fuzzy partition model (FPM) with multiple input-output units to give a probabilistic formulation. The algorithm was evaluated in phrase recognition experiments by FPM-LR recognizers. The FPM was directly combined with a LR parser. The algorithm is compared with a conventional training algorithm in terms of recognition performance. The experimental results show that a neural network can be used as a new speaker-independent feature extractor.<>
Keywords
feature extraction; fuzzy logic; grammars; learning (artificial intelligence); neural nets; speech recognition; LR parser; algorithm; alignment function; dynamic time warping; fuzzy partition model; neural network; probabilistic formulation; recognition performance; speaker-independent feature extractor; speaker-independent speech recognition; training;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location
Minneapolis, MN, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.1993.319178
Filename
319178
Link To Document