DocumentCode :
1239878
Title :
Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation
Author :
Mouchtaris, Athanasios ; Narayanan, Shrikanth S. ; Kyriakakis, Chris
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
Volume :
13
Issue :
2
fYear :
2005
fDate :
3/1/2005 12:00:00 AM
Firstpage :
263
Lastpage :
274
Abstract :
Multichannel audio can immerse a group of listeners in a seamless aural environment. Previously, we proposed a system capable of synthesizing the multiple channels of a virtual multichannel recording from a smaller set of reference recordings. This problem was termed multichannel audio resynthesis and the application was to reduce the excessive transmission requirements of multichannel audio. In this paper, we address the more general problem of multichannel audio synthesis, i.e., how to completely synthesize a multichannel audio recording from a specific stereophonic or monophonic recording, which would significantly enhance the recording´s acoustic impression. We approach this problem by extending the model employed for the resynthesis problem. This is accomplished by adapting the resynthesis conversion parameters to the statistical properties of the recording that we wish to enhance. This parameter adaptation is similar to the task adaptation employed in speech recognition, when a specific model is applied to a different environment (speaker, language or channel). One particular approach to this problem is shown here to be quite advantageous toward solving the multichannel audio synthesis problem as well.
Keywords :
audio recording; audio signal processing; signal synthesis; statistical analysis; monophonic recording; multichannel audio synthesis; parameter adaptation; resynthesis conversion parameter; seamless aural environment; speech recognition; stereophonic recording; subband-based spectral conversion; virtual multichannel recording; Audio recording; Audio systems; Digital recording; IIR filters; Microphones; Modeling; Natural languages; Signal processing; Speech recognition; Systems engineering and theory; Audio recording; Gaussian mixture model; audio resynthesis; audio systems; multichannel audio; virtual microphones;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2004.841061
Filename :
1395971
Link To Document :
بازگشت