• DocumentCode
    57324
  • Title

    Scalable Speech Coding for IP Networks: Beyond iLBC

  • Author

    Seto, Kota ; Ogunfunmi, Tokunbo

  • Author_Institution
    Dept. of Electr. Eng., Santa Clara Univ., Santa Clara, CA, USA
  • Volume
    21
  • Issue
    11
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    2337
  • Lastpage
    2345
  • Abstract
    High quality speech at low bit rates makes code excited linear prediction (CELP) the dominant choice for a narrowband coding technique despite the susceptibility to packet loss. One of the few techniques which received attention after the introduction of CELP coding technique is the internet low bitrate codec (iLBC) because of inherent high robustness to packet loss. Addition of rate flexibility and scalability makes the iLBC an attractive choice for voice communication over IP networks. In this paper, performance improvement schemes of multi-rate iLBC and its scalable structure are proposed, and the proposed codec enhanced from the previous work is re-designed based on the subjective listening quality instead of the objective quality. In particular, perceptual weighting and the modified discrete cosine transform (MDCT) with short overlap in weighted signal domain are employed along with the improved packet loss concealment (PLC) algorithm. The subjective evaluation results show that the speech quality of the proposed codec is equivalent to that of state-of-the-art codec, G.718, under both a clean channel condition and lossy channel conditions. This result is significant considering that development of the proposed codec is still in early stage.
  • Keywords
    IP networks; Internet telephony; discrete cosine transforms; linear predictive coding; speech codecs; speech coding; voice communication; CELP coding technique; G.718 codec; IP networks; Internet low bitrate codec; MDCT; clean channel condition; code-excited linear prediction; codec speech quality; improved PLC algorithm; improved packet loss concealment algorithm; lossy channel condition; modified discrete cosine transform; multirate iLBC; narrowband coding technique; packet loss; perceptual weighting; quality speech; rate flexibility; rate scalability; scalable speech coding; subjective listening quality; voice communication; weighted signal domain; Bit rate; Codecs; Discrete cosine transforms; Encoding; Packet loss; Speech; Discrete cosine transform (DCT); internet low bitrate codec (iLBC); packet loss; scalable coding; speech coding; voice over Internet protocol (VoIP);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2274694
  • Filename
    6567952