DocumentCode :
2704478
Title :
iLBC-Based Transparametrization: A Real Alternative to DSR for Speech Recognition Over Packet Networks
Author :
Carmona, J.L. ; Peinado, Antonio M. ; Perez-Cordoba, Jose L. ; Gomez, Angel M. ; Sanchez, Victor
Author_Institution :
Dpt. Teoria de la Senal, Telematica y Comunicaciones, Granada Univ., Spain
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
This paper proposes a method for the remote recognition of speech coded with the iLBC codec, which is employed by a number of VoIP systems. While the usual way of performing recognition of coded speech is to decode first the speech signal and use it as input to the recognition engine, our system directly converts the iLBC parameters into recognition features. The main advantage of this approach is to avoid any type of decoding post-processing which, although originally conceived to improve the speech perception, can be harmful for a recognition system. Our method ensures the compatibility between the speech spectra provided by the iLBC codec and those employed for cepstrum computation and introduces a robust and suitable packet loss concealment strategy. Our experimental results show that the proposed system achieves a performance better than that obtained from iLBC-decoded speech and similar to that of a distributed speech recognition system over a clean or degraded transmission channel.
Keywords :
Internet telephony; decoding; speech codecs; speech coding; speech recognition; voice communication; DSR; VoIP systems; decoding post-processing; degraded transmission channel; iLBC-based transparametrization; packet loss concealment strategy; packet networks; remote recognition; speech coding; speech perception; speech recognition; Automatic speech recognition; Decoding; Degradation; Engines; Feature extraction; Robustness; Speech codecs; Speech coding; Speech processing; Speech recognition; NSR; Speech recognition; iLBC codec; packet network; speech codecs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367231
Filename :
4218262
Link To Document :
بازگشت