• DocumentCode
    1320708
  • Title

    A smart background music mixing algorithm for portable digital imaging devices

  • Author

    Jin Ah Kang ; Chan Jun Chun ; Hong Kook Kim ; Myeong Bo Kim ; Sang Ryong Kim

  • Author_Institution
    Sch. of Inf. & Commun., Gwangju Inst. of Sci. & Technol. (GIST), Gwangju, South Korea
  • Volume
    57
  • Issue
    3
  • fYear
    2011
  • fDate
    8/1/2011 12:00:00 AM
  • Firstpage
    1258
  • Lastpage
    1263
  • Abstract
    In this paper, we propose a smart background music (BGM) mixing algorithm for portable digital imaging devices to enable users to enjoy video content with BGM. The proposed algorithm automatically adjusts the BGM output energy based on the activity and energy of foreground audio (FGA) contained in a video file. To this end, the proposed algorithm classifies each segment of FGA as speech, non-speech, or a mixed signal. After that, it estimates a scale factor for mixing FGA and BGM according to the signal classification result and the energy of FGA. In addition, a fade-in and fade-out process is incorporated in the proposed algorithm in order to improve the perceptual quality of output audio at the boundaries where signal classification is changed. In order to demonstrate the effectiveness of the proposed algorithm, we implement it on a portable digital imaging device in real time and compare the user´s preference of the proposed algorithm with those of conventional algorithms that mixes FGA with BGM based on voice activity detection or a predefined fixed scale factor. It is shown from the experiments that the proposed algorithm is pretty much preferred by around 79%, compared to the conventional algorithms.
  • Keywords
    audio signal processing; image classification; image segmentation; music; speech processing; video signal processing; BGM; FGA segment; fixed scale factor; foreground audio; portable digital imaging device; smart background music mixing algorithm; speech signal classification; video content; video file; voice activity detection; Algorithm design and analysis; Classification algorithms; Clocks; Digital images; Performance evaluation; Signal processing algorithms; Speech; Portable digital imaging device; audio content classification; audio mixing; backgroundmusic; fade-in andfade-out;
  • fLanguage
    English
  • Journal_Title
    Consumer Electronics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0098-3063
  • Type

    jour

  • DOI
    10.1109/TCE.2011.6018882
  • Filename
    6018882