مرکز منطقه ای اطلاع رساني علوم و فناوري - Multimodal city-verification on flickr videos using acoustic and textual features

DocumentCode :

3153192

Title :

Multimodal city-verification on flickr videos using acoustic and textual features

Author :

Lei, Howard ; Choi, Jaeyoung ; Friedland, Gerald

Author_Institution :

Int. Comput. Sci. Inst., Berkeley, CA, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

2273

Lastpage :

2276

Abstract :

We have performed city-verification of videos based on the videos´ audio and metadata, using videos from the MediaEval Placing Task´s video set, which contain consumer-produced videos “from-the-wild”. 18 cities were used as targets, for which acoustic and language models were trained, and against which test videos were scored. We have obtained the first known results for the city verification task, with an EER minimum of 21.8%, suggesting that ~80% of test videos, when tested against a correct target city, were identified as belonging to that city. This result is well above-chance, even as the videos contained very few city-specific audio and metadata features. We have also demonstrated the complementarity of audio and metadata for this task.

Keywords :

acoustic signal processing; meta data; video signal processing; Flickr video; MediaEval placing task video set; acoustic feature; acoustic model training; from-the-wild video; language model training; multimodal city verification; textual feature; video audio; video metadata; Adaptation models; Cities and towns; Humans; Mel frequency cepstral coefficient; Testing; Training; Videos; City verification; N-gram language models; acoustic models; multimodal processing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288367

Filename :

6288367

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3153192