Title :
A fast neural net training algorithm and its application to voiced-unvoiced-silence classification of speech
Author :
Ghiselli-Crippa, Thea ; El-Jaroudi, Amro
Author_Institution :
Dept. of Electr. Eng., Pittsburgh Univ., PA, USA
Abstract :
The authors describe a fast training algorithm for feedforward neural nets, and apply it to a two-layer neural network to classify segments of speech as voiced, unvoiced, or silence. The speech classification method is based on features computed for each speech segment and used as input to the network. The network weights are trained using a fast training algorithm which uses a quasi-Newton error minimization method with a positive-definite approximation of the Hessian matrix. When used for voiced-unvoiced-silence classification of speech frames, the performance of the network compares favorably with that of current approaches. Experimental results are presented for speaker-dependent speech classification, including evaluation of the effects of the type of input data used during training. The results indicate satisfactory performance with errors in the range 3-5%, based on manual classification of the speech frames
Keywords :
neural nets; speech recognition; Hessian matrix; fast training algorithm; feedforward neural nets; network weights; positive-definite approximation; quasi-Newton error minimization; speaker-dependent speech classification; speech recognition; speech segment; two-layer neural network; voiced-unvoiced-silence classification; Approximation algorithms; Computer networks; Convergence; Feedforward neural networks; Least squares methods; Minimization methods; Neural networks; Nonlinear equations; Speech analysis; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150371