DocumentCode
1320708
Title
A smart background music mixing algorithm for portable digital imaging devices
Author
Jin Ah Kang ; Chan Jun Chun ; Hong Kook Kim ; Myeong Bo Kim ; Sang Ryong Kim
Author_Institution
Sch. of Inf. & Commun., Gwangju Inst. of Sci. & Technol. (GIST), Gwangju, South Korea
Volume
57
Issue
3
fYear
2011
fDate
8/1/2011 12:00:00 AM
Firstpage
1258
Lastpage
1263
Abstract
In this paper, we propose a smart background music (BGM) mixing algorithm for portable digital imaging devices to enable users to enjoy video content with BGM. The proposed algorithm automatically adjusts the BGM output energy based on the activity and energy of foreground audio (FGA) contained in a video file. To this end, the proposed algorithm classifies each segment of FGA as speech, non-speech, or a mixed signal. After that, it estimates a scale factor for mixing FGA and BGM according to the signal classification result and the energy of FGA. In addition, a fade-in and fade-out process is incorporated in the proposed algorithm in order to improve the perceptual quality of output audio at the boundaries where signal classification is changed. In order to demonstrate the effectiveness of the proposed algorithm, we implement it on a portable digital imaging device in real time and compare the user´s preference of the proposed algorithm with those of conventional algorithms that mixes FGA with BGM based on voice activity detection or a predefined fixed scale factor. It is shown from the experiments that the proposed algorithm is pretty much preferred by around 79%, compared to the conventional algorithms.
Keywords
audio signal processing; image classification; image segmentation; music; speech processing; video signal processing; BGM; FGA segment; fixed scale factor; foreground audio; portable digital imaging device; smart background music mixing algorithm; speech signal classification; video content; video file; voice activity detection; Algorithm design and analysis; Classification algorithms; Clocks; Digital images; Performance evaluation; Signal processing algorithms; Speech; Portable digital imaging device; audio content classification; audio mixing; backgroundmusic; fade-in andfade-out;
fLanguage
English
Journal_Title
Consumer Electronics, IEEE Transactions on
Publisher
ieee
ISSN
0098-3063
Type
jour
DOI
10.1109/TCE.2011.6018882
Filename
6018882
Link To Document