• DocumentCode
    3286079
  • Title

    A nested infinite Gaussian mixture model for identifying known and unknown audio events

  • Author

    Sasaki, Yutaka ; Yoshii, Kazutomo ; Kagami, Satoshi

  • Author_Institution
    Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Tsukuba, Japan
  • fYear
    2013
  • fDate
    3-5 July 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper presents a novel statistical method that can classify given audio events into known classes or recognize them as an unknown class. We propose a nested infinite Gaussian mixture model (iGMM) to represent varied audio events in real environment. One of the main problems of conventional classification methods is that we need to specify a fixed number of classes in advance. Therefore, all audio events are forced to be classified into known classes. To solve the problem, the proposed method formulates a infinite Gaussian mixture model (iGMM) in which the number of classes are allowed to increase without bound. Another problem is that the complexity of each audio event is different. Then, the nested iGMM using nonparametric Bayesian approach is applied to adjust the needed dimension of each audio model. Experimental results show the effectiveness for these two problems to represent the given audio events.
  • Keywords
    Bayes methods; Gaussian distribution; audio signal processing; statistical analysis; classification methods; iGMM; nested infinite Gaussian mixture model; nonparametric Bayesian approach; statistical method; unknown audio events; Accuracy; Acoustics; Bayes methods; Complexity theory; Human voice; Training; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Analysis for Multimedia Interactive Services (WIAMIS), 2013 14th International Workshop on
  • Conference_Location
    Paris
  • ISSN
    2158-5873
  • Type

    conf

  • DOI
    10.1109/WIAMIS.2013.6616152
  • Filename
    6616152