Title :
A tandem connectionist model using combination of multi-scale spectro-temporal features for acoustic event detection
Author :
Espi, Miquel ; Fujimoto, Masakiyo ; Saito, Daisuke ; Ono, Nobutaka ; Sagayama, Shigeki
Author_Institution :
Grad. Sch. of Informations Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
Abstract :
Acoustic event detection systems supporting heterogeneous sets of events face the problem of having to characterize them when they have different acoustic properties (transient, stationary, both, etc.), observing this fact even within the acoustic event itself. Moreover, managing large feature vectors with features characterizing different properties of the signal is always difficult. This paper introduces the usage of spectro-temporal fluctuation features in a tandem connectionist approach, modified to generate posterior features separately for each fluctuation scale and then combine the streams to be fed to a classic GMM-HMM model. The experiments explore scale and event wise performance, as well as different stream combination methods, and show that the proposed method outperforms the GMM-HMM baseline as well as recent proposals in the CHIL 2007 evaluation campaign´s related acoustic event detection tasks.
Keywords :
hidden Markov models; speech recognition; CHIL 2007 evaluation; acoustic event detection; acoustic properties; classic GMM-HMM model; large feature vectors; multiscale spectro-temporal feature; tandem connectionist model; Acoustics; Event detection; Feature extraction; Hidden Markov models; Speech; Speech recognition; Transient analysis; acoustic event detection; multi-stream combination; spectro-temporal fluctuation features; tandem connectionist;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288868