Title : 
Multi-scale modulation filtering in automatic detection of emotions in telephone speech
         
        
            Author : 
Pohjalainen, Jouni ; Alku, Paavo
         
        
            Author_Institution : 
Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland
         
        
        
        
        
        
            Abstract : 
This study investigates emotion detection from noise-corrupted telephone speech. A generic modulation filtering approach for audio pattern recognition is proposed that utilizes inherent long-term properties of acoustic features in different classes. When applied to binary classification along the activation and valence dimensions, filtering the baseline short-time timbral features in both the training and detection phase leads to significant improvement especially in noise robustness. Automatic selection of training data based on the filter´s prediction residual further improves the results.
         
        
            Keywords : 
emotion recognition; filtering theory; signal classification; speech recognition; acoustic features; activation dimensions; audio pattern recognition; automatic emotion detection; baseline short-time timbral features; binary classification; generic modulation filtering; multiscale modulation filtering; noise robustness; noise-corrupted telephone speech; valence dimensions; Acoustics; Feature extraction; Modulation; Noise; Speech; Speech processing; Speech recognition; computational paralinguistics; emotion detection; speech analysis;
         
        
        
        
            Conference_Titel : 
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
         
        
            Conference_Location : 
Florence
         
        
        
            DOI : 
10.1109/ICASSP.2014.6853743