Title :
Quantization of cepstral parameters for speech recognition over the World Wide Web
Author :
Digalakis, V. ; Neumeyer, L. ; Perakakis, M.
Author_Institution :
Dept. of Electron. & Comput. Eng., Tech. Univ. of Crete, Hania, Greece
Abstract :
We examine alternative architectures for a client-server model of speech-enabled applications over the World Wide Web. We compare a server-only processing model, where the client encodes and transmits the speech signal to the server, to a model where the recognition front end, implemented as a Java applet runs locally at the client and encodes and transmits the cepstral coefficients to the recognition server over the Internet. We follow a novel encoding paradigm, trying to maximize the recognition performance instead of perceptual reproduction, and we find that by transmitting the cepstral coefficients we can achieve significantly higher recognition performance at a fraction of the bit rate required when encoding the speech signal directly
Keywords :
Internet; cepstral analysis; client-server systems; network servers; object-oriented methods; performance evaluation; quantisation (signal); speech coding; speech recognition; voice communication; Java applet; World Wide Web; bit rate; cepstral coefficients transmission; cepstral parameters quantization; client-server model; encoding paradigm; perceptual reproduction; recognition front end; recognition performance; recognition server; server-only processing model; speech recognition; speech signal encoding; speech-enabled applications; system architectures; Cepstral analysis; Encoding; Java; Quantization; Service oriented architecture; Signal processing; Speech processing; Speech recognition; Web server; Web sites;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675433