• DocumentCode
    68328
  • Title

    Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction

  • Author

    Jianjun He ; Woon-Seng Gan ; Ee-Leng Tan

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    23
  • Issue
    9
  • fYear
    2015
  • fDate
    Sept. 2015
  • Firstpage
    1431
  • Lastpage
    1444
  • Abstract
    The diversity of today´s playback systems requires a flexible, efficient, and immersive reproduction of sound scenes in digital media. Spatial audio reproduction based on primary-ambient extraction (PAE) fulfills this objective, where accurate extraction of primary and ambient components from sound mixtures in channel-based audio is crucial. Severe extraction error was found in existing PAE approaches when dealing with sound mixtures that contain a relatively strong ambient component, a commonly encountered case in the sound scenes of digital media. In this paper, we propose a novel ambient spectrum estimation (ASE) framework to improve the performance of PAE. The ASE framework exploits the equal magnitude of the uncorrelated ambient components in two channels of a stereo signal, and reformulates the PAE problem into the problem of estimating either ambient phase or magnitude. In particular, we take advantage of the sparse characteristic of the primary components to derive sparse solutions for ASE based PAE, together with an approximate solution that can significantly reduce the computational cost. Our objective and subjective experimental results demonstrate that the proposed ASE approaches significantly outperform existing approaches, especially when the ambient component is relatively strong.
  • Keywords
    audio signal processing; time-frequency analysis; ASE framework; Immersive Spatial Audio Reproduction; PAE approaches; Primary-Ambient Extraction; channel-based audio; novel ambient spectrum estimation framework; stereo signal; Estimation; Loudspeakers; Media; Rendering (computer graphics); Spectral analysis; Speech; Time-frequency analysis; Ambient spectrum estimation (ASE); computational efficiency; primary-ambient extraction (PAE); sparsity; spatial audio;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2434272
  • Filename
    7109833