DocumentCode :
2770253
Title :
Interpolative variable frame rate transmission of speech features for distributed speech recognition
Author :
Deng, Huiqun ; Shaughnessy, Douglas O. ; Dahan, Jean ; Ganong, William F.
Author_Institution :
Univ. of Quebec, Montreal
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
591
Lastpage :
595
Abstract :
In distributed speech recognition, vector quantization is used to reduce the number of bits for coding speech features at the user end in order to save energy for transmitting speech feature streams to remote recognizers and reduce data traffic congestion. We notice that the overall bit rate of the transmitted feature streams could be further reduced by not sending redundant frames that can be interpolated at the remote server from received frames. Interpolation introduces errors and may degrade speech recognition. This paper investigates the methods of selecting frames for transmission and the effect of interpolation on recognition. Experiments on a large vocabulary recognizer show that with spline interpolation, the overall frame rate for transmission can be reduced by about 50% with a relative increase in word error rate less than 5.2% for clean and noisy speech.
Keywords :
interpolation; speech coding; speech recognition; splines (mathematics); vector quantisation; distributed speech recognition; interpolative variable frame rate transmission; large vocabulary recognizer; speech coding; spline; vector quantization; Bit rate; Degradation; Error analysis; Interpolation; Noise reduction; Speech coding; Speech recognition; Spline; Vector quantization; Vocabulary; Data compression; interpolation; speech coding; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430179
Filename :
4430179
Link To Document :
بازگشت