DocumentCode
3162947
Title
A tandem connectionist model using combination of multi-scale spectro-temporal features for acoustic event detection
Author
Espi, Miquel ; Fujimoto, Masakiyo ; Saito, Daisuke ; Ono, Nobutaka ; Sagayama, Shigeki
Author_Institution
Grad. Sch. of Informations Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear
2012
fDate
25-30 March 2012
Firstpage
4293
Lastpage
4296
Abstract
Acoustic event detection systems supporting heterogeneous sets of events face the problem of having to characterize them when they have different acoustic properties (transient, stationary, both, etc.), observing this fact even within the acoustic event itself. Moreover, managing large feature vectors with features characterizing different properties of the signal is always difficult. This paper introduces the usage of spectro-temporal fluctuation features in a tandem connectionist approach, modified to generate posterior features separately for each fluctuation scale and then combine the streams to be fed to a classic GMM-HMM model. The experiments explore scale and event wise performance, as well as different stream combination methods, and show that the proposed method outperforms the GMM-HMM baseline as well as recent proposals in the CHIL 2007 evaluation campaign´s related acoustic event detection tasks.
Keywords
hidden Markov models; speech recognition; CHIL 2007 evaluation; acoustic event detection; acoustic properties; classic GMM-HMM model; large feature vectors; multiscale spectro-temporal feature; tandem connectionist model; Acoustics; Event detection; Feature extraction; Hidden Markov models; Speech; Speech recognition; Transient analysis; acoustic event detection; multi-stream combination; spectro-temporal fluctuation features; tandem connectionist;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288868
Filename
6288868
Link To Document