Title :
Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation
Author :
Mouchtaris, Athanasios ; Narayanan, Shrikanth S. ; Kyriakakis, Chris
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
fDate :
3/1/2005 12:00:00 AM
Abstract :
Multichannel audio can immerse a group of listeners in a seamless aural environment. Previously, we proposed a system capable of synthesizing the multiple channels of a virtual multichannel recording from a smaller set of reference recordings. This problem was termed multichannel audio resynthesis and the application was to reduce the excessive transmission requirements of multichannel audio. In this paper, we address the more general problem of multichannel audio synthesis, i.e., how to completely synthesize a multichannel audio recording from a specific stereophonic or monophonic recording, which would significantly enhance the recording´s acoustic impression. We approach this problem by extending the model employed for the resynthesis problem. This is accomplished by adapting the resynthesis conversion parameters to the statistical properties of the recording that we wish to enhance. This parameter adaptation is similar to the task adaptation employed in speech recognition, when a specific model is applied to a different environment (speaker, language or channel). One particular approach to this problem is shown here to be quite advantageous toward solving the multichannel audio synthesis problem as well.
Keywords :
audio recording; audio signal processing; signal synthesis; statistical analysis; monophonic recording; multichannel audio synthesis; parameter adaptation; resynthesis conversion parameter; seamless aural environment; speech recognition; stereophonic recording; subband-based spectral conversion; virtual multichannel recording; Audio recording; Audio systems; Digital recording; IIR filters; Microphones; Modeling; Natural languages; Signal processing; Speech recognition; Systems engineering and theory; Audio recording; Gaussian mixture model; audio resynthesis; audio systems; multichannel audio; virtual microphones;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
DOI :
10.1109/TSA.2004.841061