DocumentCode
606409
Title
Classification of infant cries using source, system and supra-segmental features
Author
Singh, A.K. ; Mukhopadhyay, Jayanta ; Rao, K. Sreenivasa
Author_Institution
Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
fYear
2013
fDate
28-30 March 2013
Firstpage
58
Lastpage
63
Abstract
In this paper, source, system and supra-segmental features are explored for recognition of infant cry. Different types of infant cries considered in this work are hunger, pain and wet-diaper. In this work, mel-frequency cepstral coefficients (MFCC), residual MFCC (RMFCC), implicit LP residual features, features from modulation spectrum and time domain envelope features are used for representing the infant cry specific information from the acoustic signal. Gaussian Mixture Models (GMM) are used for classifying the above mentioned cries from the features proposed in this work. GMM models are developed separately by using the proposed features. Infant cry database collected under telemedicine project (eNPCS) at IIT-KGP has been used for carrying out this study. The recognition performance of the developed GMM models is observed to be varying significantly based on the features. Results have indicated that, the proposed features have complementary evidences in view of discriminating the infant cries. For enhancing the recognition performance, GMM models developed using various features are combined using score level fusion. The recognition performance using combination of evidences is found to be superior over individual systems.
Keywords
Gaussian processes; acoustic signal processing; cepstral analysis; feature extraction; medical signal processing; modulation spectra; signal classification; Gaussian Mixture Models; IIT-KGP; Infant cry database; RMFCC; acoustic signal; eNPCS; hunger; implicit LP residual features; infant cry classification; infant cry recognition; infant cry specific information; mel-frequency cepstral coefficients; modulation spectrum; pain; recognition performance; residual MFCC; score level fusion; source feature; suprasegmental feature; system feature; telemedicine project; time domain envelope features; wet-diaper; Covariance matrices; Feature extraction; Mel frequency cepstral coefficient; Modulation; Pain; Time-domain analysis; Vectors; Gaussian Mixture Model; Infant cry recognition; Modulation Spectrum; Spectral features; Time domain envelope;
fLanguage
English
Publisher
ieee
Conference_Titel
Medical Informatics and Telemedicine (ICMIT), 2013 Indian Conference on
Conference_Location
Kharagpur
Print_ISBN
978-1-4673-5840-8
Type
conf
DOI
10.1109/IndianCMIT.2013.6529409
Filename
6529409
Link To Document