DocumentCode
3286079
Title
A nested infinite Gaussian mixture model for identifying known and unknown audio events
Author
Sasaki, Yutaka ; Yoshii, Kazutomo ; Kagami, Satoshi
Author_Institution
Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Tsukuba, Japan
fYear
2013
fDate
3-5 July 2013
Firstpage
1
Lastpage
4
Abstract
This paper presents a novel statistical method that can classify given audio events into known classes or recognize them as an unknown class. We propose a nested infinite Gaussian mixture model (iGMM) to represent varied audio events in real environment. One of the main problems of conventional classification methods is that we need to specify a fixed number of classes in advance. Therefore, all audio events are forced to be classified into known classes. To solve the problem, the proposed method formulates a infinite Gaussian mixture model (iGMM) in which the number of classes are allowed to increase without bound. Another problem is that the complexity of each audio event is different. Then, the nested iGMM using nonparametric Bayesian approach is applied to adjust the needed dimension of each audio model. Experimental results show the effectiveness for these two problems to represent the given audio events.
Keywords
Bayes methods; Gaussian distribution; audio signal processing; statistical analysis; classification methods; iGMM; nested infinite Gaussian mixture model; nonparametric Bayesian approach; statistical method; unknown audio events; Accuracy; Acoustics; Bayes methods; Complexity theory; Human voice; Training; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Analysis for Multimedia Interactive Services (WIAMIS), 2013 14th International Workshop on
Conference_Location
Paris
ISSN
2158-5873
Type
conf
DOI
10.1109/WIAMIS.2013.6616152
Filename
6616152
Link To Document