DocumentCode
401263
Title
A 40 bps speech coding scheme
Author
Lopes, Cristina Videira ; Chadha, Anshuman
Author_Institution
Sch. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
Volume
4
fYear
2003
fDate
1-5 Dec. 2003
Firstpage
2223
Abstract
We describe a method and an implementation for producing a highly compressed representation of speech, in the order of 40 bps. This compression method uses a speech recognition engine to analyze the speech signal at the morphological level, i.e. the words. The words are then coded using a word-level text compression mechanism. After decompression, the speech message is recovered using text-to-speech synthesis. We report experimental results of our implementation. In particular, we observed that human listeners were able to recover from errors introduced by the speech recognition engine, and that the human perceptual errors were highly dependent on the content of the messages, especially regarding familiarity with the topic.
Keywords
data compression; speech coding; speech recognition; speech synthesis; text analysis; 40 bit/s; human perceptual errors; morphological level; speech coding; speech recognition engine; text-to-speech synthesis; word-level text compression mechanism; Bit rate; Computer science; Decoding; Engines; Humans; Signal synthesis; Speech analysis; Speech coding; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Global Telecommunications Conference, 2003. GLOBECOM '03. IEEE
Print_ISBN
0-7803-7974-8
Type
conf
DOI
10.1109/GLOCOM.2003.1258630
Filename
1258630
Link To Document