• DocumentCode
    401263
  • Title

    A 40 bps speech coding scheme

  • Author

    Lopes, Cristina Videira ; Chadha, Anshuman

  • Author_Institution
    Sch. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
  • Volume
    4
  • fYear
    2003
  • fDate
    1-5 Dec. 2003
  • Firstpage
    2223
  • Abstract
    We describe a method and an implementation for producing a highly compressed representation of speech, in the order of 40 bps. This compression method uses a speech recognition engine to analyze the speech signal at the morphological level, i.e. the words. The words are then coded using a word-level text compression mechanism. After decompression, the speech message is recovered using text-to-speech synthesis. We report experimental results of our implementation. In particular, we observed that human listeners were able to recover from errors introduced by the speech recognition engine, and that the human perceptual errors were highly dependent on the content of the messages, especially regarding familiarity with the topic.
  • Keywords
    data compression; speech coding; speech recognition; speech synthesis; text analysis; 40 bit/s; human perceptual errors; morphological level; speech coding; speech recognition engine; text-to-speech synthesis; word-level text compression mechanism; Bit rate; Computer science; Decoding; Engines; Humans; Signal synthesis; Speech analysis; Speech coding; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Global Telecommunications Conference, 2003. GLOBECOM '03. IEEE
  • Print_ISBN
    0-7803-7974-8
  • Type

    conf

  • DOI
    10.1109/GLOCOM.2003.1258630
  • Filename
    1258630