Title :
Robust speech recognition techniques using a radial basis function neural network for mobile applications
Author :
Sankar, Ravi ; Sethi, Netoo Singh
Author_Institution :
Dept. of Electr. Eng., Univ. of South Florida, Tampa, FL, USA
Abstract :
We present the development of robust speech recognition techniques for voice activated dialing (VAD) for a cellular phone in a car. Noise reduction techniques are implemented before modeling the speech parameters in the homomorphic domain. The FFT derived cepstral coefficients are liftered and then Mel-scale warped to generate the feature vector. A radial basis function neural network is used as the final classifier and variation in its structure is studied to evaluate its real-time performance. Results on speaker-dependent and speaker-independent recognition across -5 dB to 25 dB SNR range are presented and a performance evaluation is carried out for different front-end techniques. The system is evaluated using the NOISEX-92 database
Keywords :
acoustic noise; cellular radio; cepstral analysis; fast Fourier transforms; feedforward neural nets; noise abatement; speech processing; speech recognition; telephone sets; -5 to 25 dB; FFT; Mel-scale warped coefficients; NOISEX-92 database; cellular phone; cepstral coefficients; classifier; feature vector; front-end techniques; homomorphic domain; mobile applications; noise reduction techniques; radial basis function neural network; real-time performance evaluation; robust speech recognition techniques; speaker dependent recognition; speaker independent recognition; speech parameters modeling; voice activated dialing; Cepstral analysis; Databases; Low-frequency noise; Noise reduction; Noise robustness; Radial basis function networks; Signal to noise ratio; Speech enhancement; Speech recognition; Working environment noise;
Conference_Titel :
Southeastcon '97. Engineering new New Century., Proceedings. IEEE
Conference_Location :
Blacksburg, VA
Print_ISBN :
0-7803-3844-8
DOI :
10.1109/SECON.1997.598616