• DocumentCode
    3715945
  • Title

    A quasi-orthogonal, invertible, and perceptually relevant time-frequency transform for audio coding

  • Author

    Olivier Derrien;Thibaud Necciarf;Peter Balazs

  • Author_Institution
    Université
  • fYear
    2015
  • Firstpage
    799
  • Lastpage
    803
  • Abstract
    We describe ERB-MDCT, an invertible real-valued time-frequency transform based on MDCT, which is widely used in audio coding (e.g. MP3 and AAC). ERB-MDCT was designed similarly to ERBLet, a recent invertible transform with a resolution evolving across frequency to match the perceptual ERB frequency scale, while the frequency scale in most invertible transforms (e.g. MDCT) is uniform. ERB-MDCT has mostly the same frequency scale as ERBLet, but the main improvement is that atoms are quasi-orthogonal, i.e. its redundancy is close to 1. Furthermore, the energy is more sparse in the time-frequency plane. Thus, it is more suitable for audio coding than ERBLet.
  • Keywords
    "Redundancy","Transforms","Audio coding","Signal resolution","Time-frequency analysis","Bandwidth"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2015 23rd European
  • Electronic_ISBN
    2076-1465
  • Type

    conf

  • DOI
    10.1109/EUSIPCO.2015.7362493
  • Filename
    7362493