Title : 
Construction and evaluation of a robust multifeature speech/music discriminator
         
        
            Author : 
Scheirer, Eric ; Slaney, Malcoh
         
        
            Author_Institution : 
Interval Res. Corp., Palo Alto, CA, USA
         
        
        
        
        
        
            Abstract : 
We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in several multidimensional classification frameworks. We provide extensive data on system performance and the cross-validated training/test setup used to evaluate the system. For the datasets currently in use, the best classifier classifies with 5.8% error on a frame-by-frame basis, and 1.4% error when integrating long (2.4 second) segments of sound
         
        
            Keywords : 
audio signals; digital communication; feature extraction; music; real-time systems; speech processing; cross validated training/test setup; digital audio input; features; multidimensional classification; music signals; real-time computer system; robust multifeature speech/music discriminator; sound segments; speech signals; system performance; Automatic speech recognition; Band pass filters; Energy measurement; Milling machines; Multimedia systems; Multiple signal classification; Real time systems; Robustness; Speech analysis; System performance;
         
        
        
        
            Conference_Titel : 
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
         
        
            Conference_Location : 
Munich
         
        
        
            Print_ISBN : 
0-8186-7919-0
         
        
        
            DOI : 
10.1109/ICASSP.1997.596192