مرکز منطقه ای اطلاع رساني علوم و فناوري - Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation

DocumentCode :

1239878

Title :

Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation

Author :

Mouchtaris, Athanasios ; Narayanan, Shrikanth S. ; Kyriakakis, Chris

Author_Institution :

Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA

Volume :

Issue :

fYear :

2005

fDate :

3/1/2005 12:00:00 AM

Firstpage :

263

Lastpage :

274

Abstract :

Multichannel audio can immerse a group of listeners in a seamless aural environment. Previously, we proposed a system capable of synthesizing the multiple channels of a virtual multichannel recording from a smaller set of reference recordings. This problem was termed multichannel audio resynthesis and the application was to reduce the excessive transmission requirements of multichannel audio. In this paper, we address the more general problem of multichannel audio synthesis, i.e., how to completely synthesize a multichannel audio recording from a specific stereophonic or monophonic recording, which would significantly enhance the recording´s acoustic impression. We approach this problem by extending the model employed for the resynthesis problem. This is accomplished by adapting the resynthesis conversion parameters to the statistical properties of the recording that we wish to enhance. This parameter adaptation is similar to the task adaptation employed in speech recognition, when a specific model is applied to a different environment (speaker, language or channel). One particular approach to this problem is shown here to be quite advantageous toward solving the multichannel audio synthesis problem as well.

Keywords :

audio recording; audio signal processing; signal synthesis; statistical analysis; monophonic recording; multichannel audio synthesis; parameter adaptation; resynthesis conversion parameter; seamless aural environment; speech recognition; stereophonic recording; subband-based spectral conversion; virtual multichannel recording; Audio recording; Audio systems; Digital recording; IIR filters; Microphones; Modeling; Natural languages; Signal processing; Speech recognition; Systems engineering and theory; Audio recording; Gaussian mixture model; audio resynthesis; audio systems; multichannel audio; virtual microphones;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/TSA.2004.841061

Filename :

1395971

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1239878