• DocumentCode
    390657
  • Title

    Low bit and variable rate speech coding using local cosine transform

  • Author

    Enqing, Dong ; Heming, Zhao ; YongLi, Li

  • Author_Institution
    Dept. of Commun. & Electron. Eng., Soochow Univ., Su Zhou, China
  • Volume
    1
  • fYear
    2002
  • fDate
    28-31 Oct. 2002
  • Firstpage
    423
  • Abstract
    An average1.6kb/s low bit and variable rate speech coder based on local cosine transform (LCT) algorithm for a two-way conversational speech is designed for the first time in the paper. The result of the voice activity detector (VAD) based on support vector machine (SVM) and the classification method of the voicing modes of the GSM half rate standard for active speech are adopted in the design of the variable bit rate coder. The moderately voiced mode and the strongly voiced mode of the voicing modes are combined as a voicing mode, the new combined voicing mode is named as a moderately and strongly voiced mode. A few segment vector quantizers of the LCT coefficients for each voicing mode and silence voicing frame (background noise) are employed, and LGB algorithm is applied to design the codebooks. A tree fast search technique is used to select the vector of the LCT coefficients for each segment. The evaluation using subject informal listening tests and a few object parameters indicates that the speech quality (intelligibility and naturalness) of the designed speech coder is better than that of the FS1015 standard coder. The new coder has higher robust than the FS1015 standard coder, which is suitable for speech coding in any environments.
  • Keywords
    speech coding; speech intelligibility; support vector machines; transforms; vector quantisation; local cosine transform; object parameters; segment vector quantizers; speech coding; speech quality; subject informal listening tests; support vector machine; two-way conversational speech; voice activity detector; Algorithm design and analysis; Background noise; Bit rate; Code standards; Detectors; GSM; Speech analysis; Speech coding; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON '02. Proceedings. 2002 IEEE Region 10 Conference on Computers, Communications, Control and Power Engineering
  • Print_ISBN
    0-7803-7490-8
  • Type

    conf

  • DOI
    10.1109/TENCON.2002.1181304
  • Filename
    1181304