DocumentCode
2828784
Title
Image and audio sequence visualization and interaction mechanisms for structured video browsing and editing
Author
Toklu, Candemir ; Liou, Shih-Ping
Author_Institution
Dept. of Multimedia & Video Technol., Siemens Corp. Res. Inc., Princeton, NJ, USA
Volume
2
fYear
2000
fDate
10-13 Sept. 2000
Firstpage
263
Abstract
We discuss extensions to our video browsing tool described by Hjelsvold et al. (see Handbook of Internet and multimedia systems and applications, 1998). We propose to include audio related informative visual content into our tool. Hence, we suggest representing the audio track of the video by its spectrogram image and pitch curve to enhance the video and audio related information available to the user. This representation also facilitates the correction of automatically computed audio event boundaries and introduction of speaker segments. We also provide a real-time approach for segmenting audio into events, namely, silence, speech and non-speech, to further enhance the audio information space.
Keywords
audio signals; image representation; image sequences; online front-ends; spectral analysis; video signal processing; audio event boundaries; audio related information enhancement; audio segmentation; audio sequence visualization; audio track representation; image sequence visualization; interaction mechanisms; non-speech event; pitch curve; real-time approach; silence; speaker segments; spectrogram image; speech event; structured video browsing; structured video editing; video browsing tool; video related information enhancement; Bandwidth; Computer networks; Educational institutions; Humans; Image segmentation; Image sequences; Spectrogram; Speech analysis; Speech enhancement; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 2000. Proceedings. 2000 International Conference on
Conference_Location
Vancouver, BC, Canada
ISSN
1522-4880
Print_ISBN
0-7803-6297-7
Type
conf
DOI
10.1109/ICIP.2000.899296
Filename
899296
Link To Document