DocumentCode :
729747
Title :
A visual analysis on recognizability and discriminability of onomatopoeia words with DCNN features
Author :
Shimoda, Wataru ; Yanai, Keiji
Author_Institution :
Dept. of Inf., Univ. of Electro-Commun., Tokyo, Japan
fYear :
2015
fDate :
June 29 2015-July 3 2015
Firstpage :
1
Lastpage :
6
Abstract :
In this paper, we examine the relation between onomatopoeia and images using a large number of Web images. The objective of this paper is to examine if the images corresponding to Japanese onomatopoeia words which express the feeling of visual appearance can be recognized by the state-of-the-art visual recognition methods. In our work, first, we collect the images corresponding to onomatopoeia words using an Web image search engine, and then we filter out noise images to obtain clean dataset with automatic image re-ranking method. Next, we analyze the recognizability of various kinds of onomatopoeia images using improved Fisher vector (IFV) and deep convolutional neural network (DCNN) features. In addition, we collect images corresponding to the pairs of nouns and onomatopoeia words, and we examine if the images associated with the same nouns and the different onomatopoeia words are visually discriminable or not. By the experiments, it has been shown that the DCNN features extracted from the layer 7 of Overfeat´s network pre-trained with the ILSVRC 2013 data have prominent ability to represent onomatopoeia images, and most of the onomatopoeia words have visual characteristics which can be recognized.
Keywords :
feature extraction; image denoising; image filtering; image recognition; image representation; natural language processing; neural nets; support vector machines; DCNN feature extraction; IFV; ILSVRC 2013 data; Japanese onomatopoeia words; Overfeat network; Web image search engine; deep convolutional neural network; discriminability; image noise filtering; image reranking method; improved Fisher vector; linear SVM; noun pair; onomatopoeia image representation; recognizability; visual analysis; visual appearance; visual characteristics; visual recognition method; Compounds; Feature extraction; Image recognition; Neural networks; Noise; Support vector machines; Visualization; DCNN features; Web images; onomatopoeia;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
Type :
conf
DOI :
10.1109/ICME.2015.7177453
Filename :
7177453
Link To Document :
بازگشت