• DocumentCode
    606409
  • Title

    Classification of infant cries using source, system and supra-segmental features

  • Author

    Singh, A.K. ; Mukhopadhyay, Jayanta ; Rao, K. Sreenivasa

  • Author_Institution
    Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
  • fYear
    2013
  • fDate
    28-30 March 2013
  • Firstpage
    58
  • Lastpage
    63
  • Abstract
    In this paper, source, system and supra-segmental features are explored for recognition of infant cry. Different types of infant cries considered in this work are hunger, pain and wet-diaper. In this work, mel-frequency cepstral coefficients (MFCC), residual MFCC (RMFCC), implicit LP residual features, features from modulation spectrum and time domain envelope features are used for representing the infant cry specific information from the acoustic signal. Gaussian Mixture Models (GMM) are used for classifying the above mentioned cries from the features proposed in this work. GMM models are developed separately by using the proposed features. Infant cry database collected under telemedicine project (eNPCS) at IIT-KGP has been used for carrying out this study. The recognition performance of the developed GMM models is observed to be varying significantly based on the features. Results have indicated that, the proposed features have complementary evidences in view of discriminating the infant cries. For enhancing the recognition performance, GMM models developed using various features are combined using score level fusion. The recognition performance using combination of evidences is found to be superior over individual systems.
  • Keywords
    Gaussian processes; acoustic signal processing; cepstral analysis; feature extraction; medical signal processing; modulation spectra; signal classification; Gaussian Mixture Models; IIT-KGP; Infant cry database; RMFCC; acoustic signal; eNPCS; hunger; implicit LP residual features; infant cry classification; infant cry recognition; infant cry specific information; mel-frequency cepstral coefficients; modulation spectrum; pain; recognition performance; residual MFCC; score level fusion; source feature; suprasegmental feature; system feature; telemedicine project; time domain envelope features; wet-diaper; Covariance matrices; Feature extraction; Mel frequency cepstral coefficient; Modulation; Pain; Time-domain analysis; Vectors; Gaussian Mixture Model; Infant cry recognition; Modulation Spectrum; Spectral features; Time domain envelope;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Medical Informatics and Telemedicine (ICMIT), 2013 Indian Conference on
  • Conference_Location
    Kharagpur
  • Print_ISBN
    978-1-4673-5840-8
  • Type

    conf

  • DOI
    10.1109/IndianCMIT.2013.6529409
  • Filename
    6529409