• DocumentCode
    2151151
  • Title

    Audio signal classification with temporal envelopes

  • Author

    Altaf, M. Umair Bin ; Juang, Biing-Hwang

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    469
  • Lastpage
    472
  • Abstract
    The conventional approach to audio processing, based on the short-time power spectrum model, is not adequate when it comes to general audio signals. We propose an approach, justified by studies from psycho-acoustics and neuroimaging, which uses the magnitude and frequency envelope of the audio signal in the from of AM-FM modulations to build an ARMA model which is then fed to a GMM to classify into various audio classes. We show that it makes explicit certain aspects of the signal which are overlooked when processing is limited to the spectral domain.
  • Keywords
    Gaussian processes; amplitude modulation; audio signal processing; autoregressive moving average processes; frequency modulation; signal classification; AM-FM modulation; ARMA model; GMM; Gaussian mixture model; audio signal classification; audio signal processing; frequency envelope; magnitude envelope; neuroimaging; psychoacoustics; short-time power spectrum model; temporal envelopes; Autoregressive processes; Brain modeling; Frequency modulation; Mel frequency cepstral coefficient; Psychoacoustic models; Speech; AM-FM Signal Models; Audio Classification; Audio Signal Processing; Temporal Features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946442
  • Filename
    5946442