DocumentCode
3342256
Title
A dynamic programming approach to audio segmentation and speech/music discrimination
Author
Goodwin, Michael M. ; Laroche, Jean
Author_Institution
Creative Adv. Technol. Center, Scotts Valley, CA, USA
Volume
4
fYear
2004
fDate
17-21 May 2004
Abstract
We consider the problem of segmenting an audio signal into characteristic regions based on feature-set similarities. In the proposed approach, a feature-space representation of the signal is generated; sequences of these feature-space samples are then aggregated into clusters corresponding to distinct signal regions. The algorithm consists of using linear discriminant analysis (LDA) to condition the feature space and dynamic programming (DP) to identify data clusters. We consider the design of the dynamic program cost functions; we are able to derive effective cost functions without relying on significant prior information about the structure of the expected data clusters. We demonstrate the application of the LDA-DP segmentation algorithm to speech/music discrimination. Experimental results are given and discussed.
Keywords
audio signal processing; dynamic programming; music; speech; speech processing; audio segmentation; audio signal segmentation; data clusters; dynamic program cost functions; dynamic programming; feature-space representation; linear discriminant analysis; signal representation; speech/music discrimination; Clustering algorithms; Cost function; Covariance matrix; Dynamic programming; Fingerprint recognition; Linear discriminant analysis; Multiple signal classification; Robustness; Signal generators; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326825
Filename
1326825
Link To Document