Title :
The ETSI extended distributed speech recognition (DSR) standards: client side processing and tonal language recognition evaluation
Author :
Sorin, Alexander ; Ramabadran, Tenkasi ; Chazan, Dan ; Hoory, Ron ; McLaughlin, Michael ; Pearce, David ; Wang, Fan CR ; Zhang, Yaxin
Abstract :
We present work that has been carried out in developing the ETSI extended DSR standards ES 202 211 and ES 202 212 (2003). These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enhanced tonal language recognition as well as server-side speech reconstruction capability. The paper discusses the client-side estimation of pitch and voicing class parameters whereas a companion paper discusses the server-side speech reconstruction. Experimental results show enhancement of tonal language recognition rates of proprietary recognition engines, when the standard extensions are used.
Keywords :
natural languages; speech processing; speech recognition; standards; ES 201 108; ES 202 050; ES 202 211; ES 202 212; ETSI distributed speech recognition standards; advanced front-end; basic front-end; client-side pitch estimation; client-side voicing class parameter estimation; enhanced tonal language recognition; extended distributed speech recognition standards; noise robust front-end; server-side speech reconstruction; Delay; Feature extraction; Frequency estimation; Mel frequency cepstral coefficient; Natural languages; Noise robustness; Speech processing; Speech recognition; Standards development; Telecommunication standards;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325939