Title : 
ROBUST detection of infant crying in adverse environments using weighted segmental two-dimensional linear frequency cepstral coefficients
         
        
            Author : 
Myung Jong Kim ; Younggwan Kim ; Seungki Hong ; Hoirin Kim
         
        
            Author_Institution : 
Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol. (KAIST), Daejeon, South Korea
         
        
        
        
        
        
            Abstract : 
This paper addresses the problem of automatically detecting infant crying sounds. Infant crying sounds show the distinct and regular time-frequency patterns that include a clear harmonic structure and a unique melody. Therefore, extracting appropriate features to properly represent these characteristics is important in achieving a good performance. In this paper, we propose weighted segment-based two-dimensional linear-frequency cepstral coefficients to characterize the time-frequency patterns within a long-range segment of the target signal. A Gaussian mixture model is adopted to statistically represent the crying and non-crying sounds, and test sounds are classified by using a likelihood ratio test. Evaluation of the proposed feature extraction method on a database of several hundred crying and non-crying sound clips yields an average equal error rate of 4.42% in various noisy environments, showing over 20% relative improvements compared to conventional feature extraction methods.
         
        
            Keywords : 
Gaussian processes; cepstral analysis; feature extraction; signal classification; signal detection; speech processing; Gaussian mixture model; adverse environments; automatic infant crying sound detection; crying sound representation; distinct time-frequency patterns; feature extraction method; likelihood ratio test; long-range target signal segmentation; noncrying sound representation; regular time-frequency patterns; robust infant crying detection; test sound classification; weighted segmental 2D linear frequency cepstral coefficients; Abstracts; Cepstral analysis; Feature extraction; Indexes; Multimedia communication; Robustness; Transforms; Gaussian mixture model; Infant crying detection; likelihood ratio test; weighted segmental two-dimensional linear-frequency cepstral coefficients;
         
        
        
        
            Conference_Titel : 
Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
         
        
            Conference_Location : 
San Jose, CA
         
        
        
            DOI : 
10.1109/ICMEW.2013.6618321