DocumentCode
2151151
Title
Audio signal classification with temporal envelopes
Author
Altaf, M. Umair Bin ; Juang, Biing-Hwang
Author_Institution
Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
469
Lastpage
472
Abstract
The conventional approach to audio processing, based on the short-time power spectrum model, is not adequate when it comes to general audio signals. We propose an approach, justified by studies from psycho-acoustics and neuroimaging, which uses the magnitude and frequency envelope of the audio signal in the from of AM-FM modulations to build an ARMA model which is then fed to a GMM to classify into various audio classes. We show that it makes explicit certain aspects of the signal which are overlooked when processing is limited to the spectral domain.
Keywords
Gaussian processes; amplitude modulation; audio signal processing; autoregressive moving average processes; frequency modulation; signal classification; AM-FM modulation; ARMA model; GMM; Gaussian mixture model; audio signal classification; audio signal processing; frequency envelope; magnitude envelope; neuroimaging; psychoacoustics; short-time power spectrum model; temporal envelopes; Autoregressive processes; Brain modeling; Frequency modulation; Mel frequency cepstral coefficient; Psychoacoustic models; Speech; AM-FM Signal Models; Audio Classification; Audio Signal Processing; Temporal Features;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5946442
Filename
5946442
Link To Document