• DocumentCode
    23297
  • Title

    A Comparison of Spectro-Temporal Representations of Audio Signals

  • Author

    van Hengel, P.W.J. ; Krijnders, J.D.

  • Author_Institution
    Cognitive Syst. Group, INCAS (Innovation Center for Adv. Sensors & Sensor Syst.), Assen, Netherlands
  • Volume
    22
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    303
  • Lastpage
    313
  • Abstract
    This article compares methods for the conversion of timeseries into a spectro-temporal representation. These methods are designed based on a resemblance with the auditory processing of sound in the mammalian inner ear, or on mathematical principles related to, for example, Fourier analysis. This study provides a comparison between several of these methods. Two tests were devised for this comparison: one based on susceptibility to noise and one on the expression of spectro-temporal detail. These two aspects were considered of importance for real world applications. While some methods produced good results on one of the two tests, others produced good results on both. Overall the transmission line model using an impedance function suggested by Zweig (“Finding the impedance of the organ of Corti,” J. Acoust. Soc. Amer., vol. 89, no. 3, pp. 1229-1254, 1991) provided the best results, though not significantly. Also a larger computational load may hinder application in some domains. The gammatone filterbank and straightforward spectrogram provide good alternatives with less computational load. The introduction of nonlinearity was shown to deteriorate performance on both tests, in both the filterbank and in the transmission line model.
  • Keywords
    acoustic signal processing; audio signal processing; channel bank filters; signal representation; spectral analysis; Fourier analysis; audio signal; auditory processing; gammatone filterbank; impedance function; mammalian inner ear; noise susceptibility; spectro-temporal representation; spectrogram; time-series conversion method; transmission line model; Computational modeling; Mathematical model; Signal to noise ratio; Spectrogram; Speech; Time-frequency analysis; Spectral analysis; acoustic signal detection; acoustic signal processing; signal mapping; spectrogram;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2283105
  • Filename
    6607173