DocumentCode :
667470
Title :
Virtual autoencoder based recommendation system for individualizing head-related transfer functions
Author :
Yuancheng Luo ; Zotkin, Dmitry N. ; Duraiswami, Ramani
Author_Institution :
Dept. of Comput. Sci., Univ. of Maryland, College Park, MD, USA
fYear :
2013
fDate :
20-23 Oct. 2013
Firstpage :
1
Lastpage :
4
Abstract :
We propose a virtual autoencoder based recommendation system for learning a user´s Head-related Transfer Functions (HRTFs) without subjecting a listener to impulse response or anthropometric measurements. Autoencoder neural-networks generalize principal component analysis (PCA) and learn non-linear feature spaces that supports both out-of-sample embedding and reconstruction; this may be applied to developing a more expressive low-dimensional HRTF representation. One application is to individualize HRTFs by tuning along the autoencoder feature spaces. We demonstrate this new approach by developing a virtual (black-box) user that can localize sound from query HRTFs reconstructed from those spaces. Standard optimization methods tune the autoencoder features based on the virtual user´s feedback. Experiments with CIPIC HRTFs show that the virtual user can localize along out-of-sample directions and that optimization in the autoencoder feature space improves upon initial non-individualized HRTFs. Other applications of the representation are also discussed.
Keywords :
codecs; neural nets; optimisation; principal component analysis; telecommunication computing; transient response; CIPIC HRTF; PCA; anthropometric measurements; autoencoder features; autoencoder neural-networks; black box; head-related transfer functions; impulse response; low-dimensional HRTF representation; nonlinear feature spaces; out-of-sample embedding; principal component analysis; standard optimization methods; virtual autoencoder based recommendation system; virtual user feedback; Acoustics; Conferences; Optimization; Principal component analysis; Signal processing; Training; Vectors; Autoencoder; Gaussian Process Regression; Head-related Transfer Function;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
Conference_Location :
New Paltz, NY
ISSN :
1931-1168
Type :
conf
DOI :
10.1109/WASPAA.2013.6701816
Filename :
6701816
Link To Document :
بازگشت