Title :
Acoustic scene analysis based on latent acoustic topic and event allocation
Author :
Imoto, Keisuke ; Ohishi, Yasutake ; Uematsu, Hisashi ; Ohmuro, Hitoshi
Abstract :
We propose a model for analyzing acoustic scenes by using long-term (more than several seconds) acoustic signals based on a probabilistic generative model of an acoustic feature sequence associated with acoustic scenes (e.g. “cooking”) and acoustic events (e.g. “cutting with a knife,” “heating a skillet” or “running water”) called latent acoustic topic and event allocation (LATEA) model. The proposed model allows the analysis of a wide variety of sounds and the capture of abstract acoustic scenes by representing acoustic events and scenes as latent variables, and can also describe the acoustic similarity and variance between acoustic events by representing acoustic features as a mixture of Gaussian components. Experiments with real-life sounds indicated that the proposed model exhibited lower perplexity than conventional models; it improved the stability of acoustic scene estimation. The experimental results also suggested that the proposed model can better describe the acoustic similarity and variance between acoustic events than conventional models.
Keywords :
Gaussian processes; acoustic signal detection; Gaussian components; LATEA model; acoustic feature sequence; acoustic features; acoustic scene analysis; acoustic scene estimation; acoustic similarity; cooking; cutting; event allocation; heating; latent acoustic topic and event allocation (LATEA) model; latent variables; long-term acoustic signals; probabilistic generative model; real-life sounds; running water; skillet; Acoustics; Analytical models; Estimation; Hidden Markov models; Indexes; Probabilistic logic; Resource management; Acoustic event detection (AED); acoustic scene analysis; probabilistic generative model;
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2013 IEEE International Workshop on
Conference_Location :
Southampton
DOI :
10.1109/MLSP.2013.6661957