• DocumentCode
    302973
  • Title

    Application of loudness/pitch/timbre decomposition operators to auditory scene analysis

  • Author

    Abe, Mototsugu ; Ando, Shigeru

  • Author_Institution
    Dept. of Math. Eng. & Inf. Phys., Tokyo Univ., Japan
  • Volume
    5
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    2646
  • Abstract
    Abe and Ando (see Proc. IEEE ICASSP95, p.1368-71, 1995) proposed nonlinear operators which decompose a changing energy of sound in the wavelet domain into three orthogonal components: i.e., loudness and pitch as coherent changes, and timbre as an incoherent change. They showed that they could detect the discontinuity of a single sound stream with excellent temporal resolution and sensitivity. In this paper, they extend the coherency principle so that it can describe and pursue the individual coherency of non-overlapping sound streams in the wavelet domain. It is realized by Parzen´s non-parametric estimates and Kalman filtering of the loudness change rate and the pitch shift rate. Using this method, they show some experiments for the extraction of the most salient stream from multiple sound streams
  • Keywords
    Kalman filters; acoustic signal processing; coherence; filtering theory; hearing; loudness; nonparametric statistics; signal resolution; time-frequency analysis; wavelet transforms; Kalman filtering; auditory scene analysts; coherency principle; coherent changes; experiments; incoherent change; loudness change rate; loudness decomposition operator; multiple sound streams; nonlinear operators; nonoverlapping sound streams; nonparametric estimates; orthogonal components; pitch decomposition operator; pitch shift rate; sensitivity; sound stream discontinuity detection; temporal resolution; timbre decomposition operator; time-frequency gradient space; wavelet domain; Acoustical engineering; Filtering; Image analysis; Kalman filters; Physics; Power engineering and energy; Timbre; Time frequency analysis; Wavelet analysis; Wavelet domain;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.548008
  • Filename
    548008