مرکز منطقه ای اطلاع رساني علوم و فناوري - Automatic audio tag classification via semi-supervised canonical density estimation

DocumentCode :

2163102

Title :

Automatic audio tag classification via semi-supervised canonical density estimation

Author :

Takagi, Jun ; Ohishi, Yasunori ; Kimura, Akisato ; Sugiyama, Masashi ; Yamada, Makoto ; Kameoka, Hirokazu

Author_Institution :

Grad. Sch. of Inf. Sci. & Eng., Tokyo Inst. of Technol., Tokyo, Japan

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

2232

Lastpage :

2235

Abstract :

We propose a novel semi-supervised method for building a statistical model that represents the relationship between sounds and text labels ("tags"). The proposed method, named semi-supervised canonical density estimation, makes use of unlabeled sound data in two ways: 1) a low-dimensional latent space representing topics of sounds is extracted by a semi-supervised variant of canonical correlation analysis, and 2) topic models are learned by multi-class extension of semi-supervised kernel density estimation in the topic space. Real-world audio tagging experiments indicate that our pro posed method improves the accuracy even when only a small number of labeled sounds are available.

Keywords :

audio signal processing; correlation methods; statistical analysis; automatic audio tag classification; canonical correlation analysis; low-dimensional latent space; semisupervised canonical density estimation; semisupervised kernel density estimation; statistical model; topic space; Correlation; Data models; Estimation; Feature extraction; Kernel; Principal component analysis; Semantics; Audio tag classification; canonical correlation analysis; kernel density estimation; semi-supervised learning; topic model;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5946925

Filename :

5946925

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2163102