• DocumentCode
    2139443
  • Title

    Feature extraction for speech and music discrimination

  • Author

    Zhou, Huiyu ; Sadka, A. ; Jiang, Richard M.

  • Author_Institution
    Brunel Univ., London
  • fYear
    2008
  • fDate
    18-20 June 2008
  • Firstpage
    170
  • Lastpage
    173
  • Abstract
    Driven by the demand of information retrieval, video editing and human-computer interface, in this paper we propose a novel spectral feature for music and speech discrimination. This scheme attempts to simulate a biological model using the averaged cepstrum, where human perception tends to pick up the areas of large cepstral changes. The cepstrum data that is away from the mean value will be exponentially reduced in magnitude. We conduct experiments of music/speech discrimination by comparing the performance of the proposed feature with that of previously proposed features in classification. The dynamic time warping based classification verifies that the proposed feature has the best quality of music/speech classification in the test database.
  • Keywords
    audio signal processing; cepstral analysis; feature extraction; information retrieval; music; signal classification; speech processing; averaged cepstrum; dynamic time warping based classification; feature extraction; human perception; human-computer interface; information retrieval; music discrimination; speech discrimination; video editing; Biological system modeling; Cepstral analysis; Cepstrum; Feature extraction; Mel frequency cepstral coefficient; Multiple signal classification; Music information retrieval; Spatial databases; Speech; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing, 2008. CBMI 2008. International Workshop on
  • Conference_Location
    London
  • Print_ISBN
    978-1-4244-2043-8
  • Electronic_ISBN
    978-1-4244-2044-5
  • Type

    conf

  • DOI
    10.1109/CBMI.2008.4564943
  • Filename
    4564943