• DocumentCode
    705223
  • Title

    A scanning window scheme based on SVM training error rate for unsupervised audio segmentation

  • Author

    Sadjadi, Seyed Omid ; Hansen, John H. L.

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Texas at Dallas, Richardson, TX, USA
  • fYear
    2010
  • fDate
    23-27 Aug. 2010
  • Firstpage
    1262
  • Lastpage
    1266
  • Abstract
    Audio segmentation has applications in a variety of contexts, such as automatic broadcast news transcription, audio information retrieval, and as a pre-processing step in automatic speech recognition (ASR). The Support vector machine (SVM), as a binary classifier, is commonly used for supervised audio signal segmentation and classification. In this study, inspired by the idea of scanning window, we present and evaluate an unsupervised audio segmentation approach based on the SVM training error rate. The approach is unsupervised in the sense that it does not require prior knowledge of audio classes. Experimental results indicate that the segmentation technique outperforms traditional Bayesian information criterion (BIC), generalized likelihood ratio (GLR), and Gaussian mixture models (GMM) methods, particularly in detecting audio landmarks of short duration.
  • Keywords
    audio signal processing; support vector machines; SVM training error rate; audio information retrieval; audio landmarks; automatic broadcast news transcription; automatic speech recognition; binary classifier; scanning window scheme; support vector machine; unsupervised audio segmentation; Error analysis; Hidden Markov models; Kernel; Speech; Support vector machines; Training; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2010 18th European
  • Conference_Location
    Aalborg
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7096496