DocumentCode
43741
Title
Linear Estimation Based Primary-Ambient Extraction for Stereo Audio Signals
Author
Jianjun He ; Ee-Leng Tan ; Woon-Seng Gan
Author_Institution
Digital Signal Process. Lab., Nanyang Technol. Univ., Singapore, Singapore
Volume
22
Issue
2
fYear
2014
fDate
Feb. 2014
Firstpage
505
Lastpage
517
Abstract
Audio signals for moving pictures and video games are often linear combinations of primary and ambient components. In spatial audio analysis-synthesis, these mixed signals are usually decomposed into primary and ambient components to facilitate flexible spatial rendering and enhancement. Existing approaches such as principal component analysis (PCA) and least squares (LS) are widely used to perform this decomposition from stereo signals. However, the performance of these approaches in primary-ambient extraction (PAE) has not been well studied and no comparative analysis among the existing approaches has been carried out so far. In this paper, we generalize the existing approaches into a linear estimation framework. Under this framework, we propose a series of performance measures to identify the components that contribute to the extraction error. Based on the generalized linear estimation framework and our proposed performance measures, a comparative study and experimental testing of the linear estimation based PAE approaches including existing PCA, LS, and three proposed variant LS approaches are presented.
Keywords
audio signal processing; least squares approximations; principal component analysis; PCA; flexible spatial rendering; generalized linear estimation framework; least squares approach; primary-ambient extraction; principal component analysis; spatial audio analysis-synthesis; stereo audio signal; Accuracy; Correlation; Distortion measurement; Estimation; Interference; Principal component analysis; Vectors; Least squares (LS); linear estimation; performance measure; primary-ambient extraction (PAE); principal component analysis (PCA); spatial audio;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASLP.2013.2297015
Filename
6698287
Link To Document