DocumentCode
638189
Title
A LDA-based method for automatic tagging of Youtube videos
Author
Morchid, Mohamed ; Linares, Georges
Author_Institution
Lab. d´Inf. d´Avignon, Univ. of Avignon, Avignon, France
fYear
2013
fDate
3-5 July 2013
Firstpage
1
Lastpage
4
Abstract
This article presents a method for automatic tagging of Youtube videos. The proposed method combines an automatic speech recognition (ASR) system, that extracts the spoken contents, and a keyword extraction component that aims at finding a small set of tags representing a video. In order to improve the robustness of the tagging system to the recognition errors, a video transcription is represented in a topic space obtained by a Latent Dirichlet Allocation (LDA), in which each dimension is automatically characterized by a list of weighted terms. Tags are extracted by combining the weighted word list of the best LDA classes. We evaluate this method by employing the user-provided tags of Youtube videos as reference and we investigate the impact of the topic model granularity. The obtained results demonstrate the interest of such model to improve the robustness of the tagging system.
Keywords
multimedia systems; social networking (online); speech recognition; video retrieval; video signal processing; LDA based method; Youtube video; automatic speech recognition system; automatic tagging; keyword extraction; latent Dirichlet allocation; recognition error; spoken content extraction; tagging system; video transcription; weighted term; weighted word list; Acoustics; Robustness; Semantics; Speech; Speech recognition; Tagging; Videos; audio categorization; keyword extraction; speech recognition; structuring multimedia collection;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Analysis for Multimedia Interactive Services (WIAMIS), 2013 14th International Workshop on
Conference_Location
Paris
ISSN
2158-5873
Type
conf
DOI
10.1109/WIAMIS.2013.6616126
Filename
6616126
Link To Document