DocumentCode :
729723
Title :
Visualizing video sounds with sound word animation
Author :
Fangzhou Wang ; Nagano, Hidehisa ; Kashino, Kunio ; Igarashi, Takeo
Author_Institution :
Univ. of Tokyo, Tokyo, Japan
fYear :
2015
fDate :
June 29 2015-July 3 2015
Firstpage :
1
Lastpage :
6
Abstract :
Text captions are important means to provide sound information in videos when the sound is not accessible. However, conventional text captions are far less expressive for non-verbal sounds since they are designed to visualize speech sound. To address this problem, we propose a method for automatically transforming non-verbal video sounds to animated sound words, and positioning them near the sound source objects in the video for visualization. This provides natural visual representation of non-verbal sounds with rich information about the sound category and dynamics. We conducted a user study with over 300 participants using an online crowdsourcing service. The results showed that animated sound words could not only effectively and naturally visualize the dynamics of sound while clarify the position of the sound source, but also contribute to making video watching more enjoyable and increasing the visual impact of the video.
Keywords :
computer animation; data visualisation; video signal processing; natural visual representation; nonverbal video sounds; sound word animation; video sound visualization; video watching; Algorithm design and analysis; Animation; Attenuation; Engines; Image segmentation; Support vector machines; Visualization; Sound word; entertainment; environmental sound processing; sound visualization; video annotation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
Type :
conf
DOI :
10.1109/ICME.2015.7177422
Filename :
7177422
Link To Document :
بازگشت