DocumentCode :
11383
Title :
Content Based Lecture Video Retrieval Using Speech and Video Text Information
Author :
Haojin Yang ; Meinel, Christoph
Author_Institution :
Hasso-Plattner-Inst. for Software Syst. Eng. GmbH (HPI), Potsdam, Germany
Volume :
7
Issue :
2
fYear :
2014
fDate :
April-June 2014
Firstpage :
142
Lastpage :
154
Abstract :
In the last decade e-lecturing has become more and more popular. The amount of lecture video data on the World Wide Web (WWW) is growing rapidly. Therefore, a more efficient method for video retrieval in WWW or within large lecture video archives is urgently needed. This paper presents an approach for automated video indexing and video search in large lecture video archives. First of all, we apply automatic video segmentation and key-frame detection to offer a visual guideline for the video content navigation. Subsequently, we extract textual metadata by applying video Optical Character Recognition (OCR) technology on key-frames and Automatic Speech Recognition (ASR) on lecture audio tracks. The OCR and ASR transcript as well as detected slide text line types are adopted for keyword extraction, by which both video- and segment-level keywords are extracted for content-based video browsing and search. The performance and the effectiveness of proposed indexing functionalities is proven by evaluation.
Keywords :
content-based retrieval; feature extraction; image segmentation; indexing; object detection; optical character recognition; speech recognition; video retrieval; ASR; ASR transcript; OCR technology; OCR transcript; WWW; World Wide Web; automated video indexing; automatic speech recognition; content based lecture video retrieval; e-lecturing; electronic lecturing; indexing functionalities; key-frame detection; keyword extraction; lecture video archives; optical character recognition; segment-level keywords; slide text line types; speech information; textual metadata extraction; video content navigation; video search; video segmentation; video text information; video-level keywords; visual guideline; Image segmentation; Indexing; Optical character recognition software; Semantics; Speech; Video signal processing; Visualization; Lecture videos; automatic video indexing; content-based video search; lecture video archives;
fLanguage :
English
Journal_Title :
Learning Technologies, IEEE Transactions on
Publisher :
ieee
ISSN :
1939-1382
Type :
jour
DOI :
10.1109/TLT.2014.2307305
Filename :
6750040
Link To Document :
بازگشت