• DocumentCode
    178235
  • Title

    Speech/music discrimination in a large database of radio broadcasts from the wild

  • Author

    Wieser, Erhard ; Husinsky, Matthias ; Seidl, Martina

  • Author_Institution
    Inst. for Creative Media Technol., Univ. of Appl. Sci. St. Polten, St. Polten, Austria
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    2134
  • Lastpage
    2138
  • Abstract
    This paper describes the development, implementation and evaluation of a speech/music detector. We aim at audio from different sources with different qualities - i.e. audio from ”the wild”. We examine existing approaches for audio classification and select a recent feature. We modify the feature and evaluate the classification accuracy on a random test set of more than 60 hours of audio material against a standard speech/music detection approach. With our approach, we reach a classification accuracy of 96,6%. We provide a performant open source implementation of our detector.
  • Keywords
    audio signal processing; music; radio networks; speech processing; audio classification; audio material; large database; radio broadcasts; speech-music detector; speech-music discrimination; Accuracy; Conferences; Feature extraction; Materials; Speech; Speech processing; Support vector machines; Audio classification; radio broadcast; speech/music discrimination;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853976
  • Filename
    6853976