DocumentCode :
57324
Title :
Scalable Speech Coding for IP Networks: Beyond iLBC
Author :
Seto, Kota ; Ogunfunmi, Tokunbo
Author_Institution :
Dept. of Electr. Eng., Santa Clara Univ., Santa Clara, CA, USA
Volume :
21
Issue :
11
fYear :
2013
fDate :
Nov. 2013
Firstpage :
2337
Lastpage :
2345
Abstract :
High quality speech at low bit rates makes code excited linear prediction (CELP) the dominant choice for a narrowband coding technique despite the susceptibility to packet loss. One of the few techniques which received attention after the introduction of CELP coding technique is the internet low bitrate codec (iLBC) because of inherent high robustness to packet loss. Addition of rate flexibility and scalability makes the iLBC an attractive choice for voice communication over IP networks. In this paper, performance improvement schemes of multi-rate iLBC and its scalable structure are proposed, and the proposed codec enhanced from the previous work is re-designed based on the subjective listening quality instead of the objective quality. In particular, perceptual weighting and the modified discrete cosine transform (MDCT) with short overlap in weighted signal domain are employed along with the improved packet loss concealment (PLC) algorithm. The subjective evaluation results show that the speech quality of the proposed codec is equivalent to that of state-of-the-art codec, G.718, under both a clean channel condition and lossy channel conditions. This result is significant considering that development of the proposed codec is still in early stage.
Keywords :
IP networks; Internet telephony; discrete cosine transforms; linear predictive coding; speech codecs; speech coding; voice communication; CELP coding technique; G.718 codec; IP networks; Internet low bitrate codec; MDCT; clean channel condition; code-excited linear prediction; codec speech quality; improved PLC algorithm; improved packet loss concealment algorithm; lossy channel condition; modified discrete cosine transform; multirate iLBC; narrowband coding technique; packet loss; perceptual weighting; quality speech; rate flexibility; rate scalability; scalable speech coding; subjective listening quality; voice communication; weighted signal domain; Bit rate; Codecs; Discrete cosine transforms; Encoding; Packet loss; Speech; Discrete cosine transform (DCT); internet low bitrate codec (iLBC); packet loss; scalable coding; speech coding; voice over Internet protocol (VoIP);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2013.2274694
Filename :
6567952
Link To Document :
بازگشت