Title :
Fast Algorithms for Low-Delay SBR Filterbanks in MPEG-4 AAC-ELD
Author :
Chivukula, Ravi K. ; Reznik, Yuriy A. ; Devarajan, Venkat ; Jayendra-Lakshman, Mythreya
Author_Institution :
Qualcomm Inc., San Diego, CA, USA
fDate :
3/1/2012 12:00:00 AM
Abstract :
The MPEG committee has recently completed development of a new audio coding standard “MPEG-4 Advanced Audio Coding-Enhanced Low Delay” (AAC-ELD). AAC-ELD is targeted towards high-quality, full-duplex communication applications such as audio and video conferencing. AAC-ELD uses low delay spectral band replication (LD-SBR) technology together with a low delay AAC core encoder to achieve high coding efficiency and low algorithmic delays. In this paper, we present fast algorithms for computing LD-SBR filterbanks in AAC-ELD. The proposed algorithms map complex exponential modulation portion of the filterbanks to discrete cosine transforms of types IV and II. Our proposed mapping also allows to merge some multiplications with the windowing stage that precedes or succeeds the modulation step. This further reduces computational complexity. Our presentation includes detailed explanation and flow-graphs of the algorithms, complexity analysis, and comparisons with alternative implementations.
Keywords :
audio coding; channel bank filters; computational complexity; discrete cosine transforms; graph theory; teleconferencing; DCT; MPEG-4 AAC-ELD; advanced audio coding-enhanced low delay; audio conferencing; complex exponential modulation; computational complexity; discrete cosine transforms; flow-graphs; full-duplex communication applications; high coding efficiency; low algorithmic delays; low delay spectral band replication technology; low-delay SBR filterbanks; video conferencing; windowing stage; Algorithm design and analysis; Audio coding; Decoding; Delay; Modulation; Speech; Transform coding; Advanced audio coding (AAC); DCT-IV; Moving Picture Experts Group (MPEG); discrete cosine transform (DCT); factorization; fast algorithms; filterbank; low delay audio coding; spectral band replication (SBR);
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2011.2170971