DocumentCode :
178635
Title :
Online NON-negative Tensor Deconvolution for source detection in 3DTV audio
Author :
Mitsufuji, Yuki ; Liuni, M. ; Baker, Anthony ; Roebel, A.
Author_Institution :
Sony Corp. Tokyo, Tokyo, Japan
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
3082
Lastpage :
3086
Abstract :
The following article describes research on source detection in multi channel (3DTV) audio streams. The problem is extremely complex due to the fact that multiple layers can be present in scenes (background music, ambience, commentator). In this work a new algorithm is developed that exploits the information from the different audio channels to detect, and possibly localize and separate independent audio sources. An algorithm based on online Non-negative Tensor Deconvolution is realized, to deal with sound sources with time dependent positions in the channel matrix. The evaluation is made on 3DTV 5.1 film soundtracks and on synthetic mixes of 3DTV 5.1 audio with target sounds from a sound effects database: a significant improvement of the detection performance is shown, compared with other decomposition techniques.
Keywords :
audio signal processing; audio streaming; deconvolution; three-dimensional television; 3DTV audio; audio channel detection; multichannel audio stream; online nonnegative tensor deconvolution; sound source; source detection; Conferences; Databases; Dictionaries; Optimized production technology; Source separation; Tensile stress; Training; 3DTV audio; Dictionary training; event detection; nonnegative tensor deconvolution; source separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854167
Filename :
6854167
Link To Document :
بازگشت