• DocumentCode
    1863569
  • Title

    A fusion study in speech / music classification

  • Author

    Pinquier, Julien ; Rouas, Jean-Luc ; André-Obrecht, Régine

  • Author_Institution
    Inst. de Recherche en Informatique de Toulouse, CNRS, Toulouse, France
  • Volume
    1
  • fYear
    2003
  • fDate
    6-9 July 2003
  • Abstract
    In this paper, we present and merge two speech / music classification approaches of that we have developed. The first one is a differentiated modeling approach based on a spectral analysis, which is implemented with GMM. The other one is based on three original features: entropy modulation, stationary segment duration and number of segments. They are merged with the classical 4 Hertz modulation energy. Our classification system is a fusion of the two approaches. It is divided in two classifications (speech/non-speech and music/non-music) and provides 94 % of accuracy for speech detection and 90 % for music detection, with one second of input signal. Beside the spectral information and GMM, classically used in speech / music discrimination, simple parameters bring complementary and efficient information.
  • Keywords
    music; signal classification; spectral analysis; speech processing; entropy modulation; modulation energy; music classification; music detection; spectral analysis; speech classification; speech detection; stationary segment duration; Acoustic signal detection; Cepstral analysis; Data mining; Entropy; Indexing; Multiple signal classification; Music; Signal processing algorithms; Spectral analysis; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
  • Print_ISBN
    0-7803-7965-9
  • Type

    conf

  • DOI
    10.1109/ICME.2003.1220941
  • Filename
    1220941