Title :
A 40 bps speech coding scheme
Author :
Lopes, Cristina Videira ; Chadha, Anshuman
Author_Institution :
Sch. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
Abstract :
We describe a method and an implementation for producing a highly compressed representation of speech, in the order of 40 bps. This compression method uses a speech recognition engine to analyze the speech signal at the morphological level, i.e. the words. The words are then coded using a word-level text compression mechanism. After decompression, the speech message is recovered using text-to-speech synthesis. We report experimental results of our implementation. In particular, we observed that human listeners were able to recover from errors introduced by the speech recognition engine, and that the human perceptual errors were highly dependent on the content of the messages, especially regarding familiarity with the topic.
Keywords :
data compression; speech coding; speech recognition; speech synthesis; text analysis; 40 bit/s; human perceptual errors; morphological level; speech coding; speech recognition engine; text-to-speech synthesis; word-level text compression mechanism; Bit rate; Computer science; Decoding; Engines; Humans; Signal synthesis; Speech analysis; Speech coding; Speech recognition; Speech synthesis;
Conference_Titel :
Global Telecommunications Conference, 2003. GLOBECOM '03. IEEE
Print_ISBN :
0-7803-7974-8
DOI :
10.1109/GLOCOM.2003.1258630