DocumentCode :
2182013
Title :
A system for effortless content annotation to unfold the semantics in videos
Author :
Lienhart, Rainer
Author_Institution :
Microprocessor Res. Lab., Intel Corp., Santa Clara, CA, USA
fYear :
2000
fDate :
2000
Firstpage :
45
Lastpage :
49
Abstract :
We propose and investigate a new but simple and natural extension of the way people record video. This extension allows one to unfold the semantics of video clips and thus enables a completely new set of applications for raw video footage. Two microphones are connected to a camcorder: a headworn speech input microphone and an environmental microphone. During recording the cameraman speaks out loud content-descriptive annotations and/or editing commands. Due to the two-microphones setup the sound of annotations and editing commands can be removed from the environmental audio by adaptive filtering enabling people to play back the video as if there had been no annotations. Simultaneously, these annotations are transcribed to ASCII by means of a standard speech recognition engine. The viability of this approach is demonstrated by means of an important application for video libraries: the automatic abstraction of raw video footage
Keywords :
content-based retrieval; microphones; natural languages; speech recognition; telecommunication computing; video databases; video signal processing; ASCII; adaptive filtering; automatic abstraction; camcorder; cameraman; content annotation; content-descriptive annotations; editing commands; environmental microphone; headworn speech input microphone; microphones; raw video footage; semantics; speech recognition engine; video clips; video libraries; videos; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Content-based Access of Image and Video Libraries, 2000. Proceedings. IEEE Workshop on
Conference_Location :
Hilton Head Island, SC
Print_ISBN :
0-7695-0695-X
Type :
conf
DOI :
10.1109/IVL.2000.853838
Filename :
853838
Link To Document :
بازگشت