Title :
A scanning window scheme based on SVM training error rate for unsupervised audio segmentation
Author :
Sadjadi, Seyed Omid ; Hansen, John H. L.
Author_Institution :
Dept. of Electr. Eng., Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
Audio segmentation has applications in a variety of contexts, such as automatic broadcast news transcription, audio information retrieval, and as a pre-processing step in automatic speech recognition (ASR). The Support vector machine (SVM), as a binary classifier, is commonly used for supervised audio signal segmentation and classification. In this study, inspired by the idea of scanning window, we present and evaluate an unsupervised audio segmentation approach based on the SVM training error rate. The approach is unsupervised in the sense that it does not require prior knowledge of audio classes. Experimental results indicate that the segmentation technique outperforms traditional Bayesian information criterion (BIC), generalized likelihood ratio (GLR), and Gaussian mixture models (GMM) methods, particularly in detecting audio landmarks of short duration.
Keywords :
audio signal processing; support vector machines; SVM training error rate; audio information retrieval; audio landmarks; automatic broadcast news transcription; automatic speech recognition; binary classifier; scanning window scheme; support vector machine; unsupervised audio segmentation; Error analysis; Hidden Markov models; Kernel; Speech; Support vector machines; Training; Trajectory;
Conference_Titel :
Signal Processing Conference, 2010 18th European
Conference_Location :
Aalborg