DocumentCode :
3318254
Title :
ROBUST detection of infant crying in adverse environments using weighted segmental two-dimensional linear frequency cepstral coefficients
Author :
Myung Jong Kim ; Younggwan Kim ; Seungki Hong ; Hoirin Kim
Author_Institution :
Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol. (KAIST), Daejeon, South Korea
fYear :
2013
fDate :
15-19 July 2013
Firstpage :
1
Lastpage :
4
Abstract :
This paper addresses the problem of automatically detecting infant crying sounds. Infant crying sounds show the distinct and regular time-frequency patterns that include a clear harmonic structure and a unique melody. Therefore, extracting appropriate features to properly represent these characteristics is important in achieving a good performance. In this paper, we propose weighted segment-based two-dimensional linear-frequency cepstral coefficients to characterize the time-frequency patterns within a long-range segment of the target signal. A Gaussian mixture model is adopted to statistically represent the crying and non-crying sounds, and test sounds are classified by using a likelihood ratio test. Evaluation of the proposed feature extraction method on a database of several hundred crying and non-crying sound clips yields an average equal error rate of 4.42% in various noisy environments, showing over 20% relative improvements compared to conventional feature extraction methods.
Keywords :
Gaussian processes; cepstral analysis; feature extraction; signal classification; signal detection; speech processing; Gaussian mixture model; adverse environments; automatic infant crying sound detection; crying sound representation; distinct time-frequency patterns; feature extraction method; likelihood ratio test; long-range target signal segmentation; noncrying sound representation; regular time-frequency patterns; robust infant crying detection; test sound classification; weighted segmental 2D linear frequency cepstral coefficients; Abstracts; Cepstral analysis; Feature extraction; Indexes; Multimedia communication; Robustness; Transforms; Gaussian mixture model; Infant crying detection; likelihood ratio test; weighted segmental two-dimensional linear-frequency cepstral coefficients;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
Type :
conf
DOI :
10.1109/ICMEW.2013.6618321
Filename :
6618321
Link To Document :
بازگشت