Title :
Virtual autoencoder based recommendation system for individualizing head-related transfer functions
Author :
Yuancheng Luo ; Zotkin, Dmitry N. ; Duraiswami, Ramani
Author_Institution :
Dept. of Comput. Sci., Univ. of Maryland, College Park, MD, USA
Abstract :
We propose a virtual autoencoder based recommendation system for learning a user´s Head-related Transfer Functions (HRTFs) without subjecting a listener to impulse response or anthropometric measurements. Autoencoder neural-networks generalize principal component analysis (PCA) and learn non-linear feature spaces that supports both out-of-sample embedding and reconstruction; this may be applied to developing a more expressive low-dimensional HRTF representation. One application is to individualize HRTFs by tuning along the autoencoder feature spaces. We demonstrate this new approach by developing a virtual (black-box) user that can localize sound from query HRTFs reconstructed from those spaces. Standard optimization methods tune the autoencoder features based on the virtual user´s feedback. Experiments with CIPIC HRTFs show that the virtual user can localize along out-of-sample directions and that optimization in the autoencoder feature space improves upon initial non-individualized HRTFs. Other applications of the representation are also discussed.
Keywords :
codecs; neural nets; optimisation; principal component analysis; telecommunication computing; transient response; CIPIC HRTF; PCA; anthropometric measurements; autoencoder features; autoencoder neural-networks; black box; head-related transfer functions; impulse response; low-dimensional HRTF representation; nonlinear feature spaces; out-of-sample embedding; principal component analysis; standard optimization methods; virtual autoencoder based recommendation system; virtual user feedback; Acoustics; Conferences; Optimization; Principal component analysis; Signal processing; Training; Vectors; Autoencoder; Gaussian Process Regression; Head-related Transfer Function;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
Conference_Location :
New Paltz, NY
DOI :
10.1109/WASPAA.2013.6701816