DocumentCode
705223
Title
A scanning window scheme based on SVM training error rate for unsupervised audio segmentation
Author
Sadjadi, Seyed Omid ; Hansen, John H. L.
Author_Institution
Dept. of Electr. Eng., Univ. of Texas at Dallas, Richardson, TX, USA
fYear
2010
fDate
23-27 Aug. 2010
Firstpage
1262
Lastpage
1266
Abstract
Audio segmentation has applications in a variety of contexts, such as automatic broadcast news transcription, audio information retrieval, and as a pre-processing step in automatic speech recognition (ASR). The Support vector machine (SVM), as a binary classifier, is commonly used for supervised audio signal segmentation and classification. In this study, inspired by the idea of scanning window, we present and evaluate an unsupervised audio segmentation approach based on the SVM training error rate. The approach is unsupervised in the sense that it does not require prior knowledge of audio classes. Experimental results indicate that the segmentation technique outperforms traditional Bayesian information criterion (BIC), generalized likelihood ratio (GLR), and Gaussian mixture models (GMM) methods, particularly in detecting audio landmarks of short duration.
Keywords
audio signal processing; support vector machines; SVM training error rate; audio information retrieval; audio landmarks; automatic broadcast news transcription; automatic speech recognition; binary classifier; scanning window scheme; support vector machine; unsupervised audio segmentation; Error analysis; Hidden Markov models; Kernel; Speech; Support vector machines; Training; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2010 18th European
Conference_Location
Aalborg
ISSN
2219-5491
Type
conf
Filename
7096496
Link To Document