DocumentCode
3715945
Title
A quasi-orthogonal, invertible, and perceptually relevant time-frequency transform for audio coding
Author
Olivier Derrien;Thibaud Necciarf;Peter Balazs
Author_Institution
Université
fYear
2015
Firstpage
799
Lastpage
803
Abstract
We describe ERB-MDCT, an invertible real-valued time-frequency transform based on MDCT, which is widely used in audio coding (e.g. MP3 and AAC). ERB-MDCT was designed similarly to ERBLet, a recent invertible transform with a resolution evolving across frequency to match the perceptual ERB frequency scale, while the frequency scale in most invertible transforms (e.g. MDCT) is uniform. ERB-MDCT has mostly the same frequency scale as ERBLet, but the main improvement is that atoms are quasi-orthogonal, i.e. its redundancy is close to 1. Furthermore, the energy is more sparse in the time-frequency plane. Thus, it is more suitable for audio coding than ERBLet.
Keywords
"Redundancy","Transforms","Audio coding","Signal resolution","Time-frequency analysis","Bandwidth"
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN
2076-1465
Type
conf
DOI
10.1109/EUSIPCO.2015.7362493
Filename
7362493
Link To Document