Title : 
Audio segmentation by feature-space clustering using linear discriminant analysis and dynamic programming
         
        
            Author : 
Goodwin, Michael M. ; Laroche, Jean
         
        
            Author_Institution : 
Creative Adv. Technol. Center, Scotts Valley, CA, USA
         
        
        
        
        
        
            Abstract : 
We consider the problem of segmenting an audio signal into characteristic regions based on feature-set similarities. In the proposed method, a feature-space representation of the signal is generated; then, sequences of feature-space samples are aggregated into clusters corresponding to distinct signal regions. The clustering of feature sets is improved via linear discriminant analysis (LDA); dynamic programming (DP) is used to derive optimal cluster boundaries. The method avoids the heuristics employed in various feature-space segmentation schemes and is able to derive an optimal segmentation once the LDA and DP cost metrics have been chosen. We demonstrate that the method outperforms typical feature-space approaches described in the literature. We focus on an illustrative example of the basic segmentation task; however, by judicious design of the feature set, the training set, and the dynamic program, the method can be tailored for various applications such as speech/music discrimination, segmentation of audio streams for smart transport, or song structure analysis for thumbnailing.
         
        
            Keywords : 
audio signal processing; dynamic programming; pattern classification; pattern clustering; signal representation; LDA; audio segmentation; characteristic regions; classification; cluster boundaries; dynamic programming; feature-set clustering; feature-set similarities; feature-space clustering; feature-space samples; linear discriminant analysis; signal representation; smart transport; song structure analysis; speech/music discrimination; thumbnails; training set; Algorithm design and analysis; Clustering algorithms; Cost function; Dynamic programming; Linear discriminant analysis; Robustness; Signal analysis; Signal processing; Speech analysis; Streaming media;
         
        
        
        
            Conference_Titel : 
Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on.
         
        
            Print_ISBN : 
0-7803-7850-4
         
        
        
            DOI : 
10.1109/ASPAA.2003.1285837