• DocumentCode
    590884
  • Title

    Emotion classification of infant cries with consideration for local and global features

  • Author

    Honda, Kazuhiro ; Kitahara, Kuninori ; Matsunaga, Shinichiro ; Yamashita, Masaru ; Shinohara, K.

  • Author_Institution
    Nagasaki Univ., Nagasaki, Japan
  • fYear
    2012
  • fDate
    3-6 Dec. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, we propose an approach to the classification of emotion clusters in infant cries with consideration for frame-wise/local acoustic features and global prosodic features. Our proposed approach has two main characteristics as follows. The emotion cluster detection procedure is based on the most likely segment sequence, which delivers the emotion cluster as a classification result. This is obtained based on a maximum likelihood approach using the frame-wise likelihood and the global prosodic likelihood. We exploit the duration ratios of resonant cry segments and silent segments as prosodic features, while the duration ratios are calculated using the derived segment sequence. The second characteristic is the use of pitch information, in addition to conventional power and spectral information, during the modeling of frame-wise acoustic features with hidden Markov models. The classification performance (74.7%) of our proposed approach with added pitch information was better than (71.5%) the classification method using only power and spectral features. The proposed method based on a maximum likelihood approach using both frame-wise and global features also achieved better performance (75.5%).
  • Keywords
    acoustic signal processing; hidden Markov models; maximum likelihood detection; emotion cluster detection; emotion clusters classification; frame-wise acoustic features; frame-wise likelihood; frame-wise/local acoustic features; global features; global prosodic features; global prosodic likelihood; hidden Markov models; infant cries emotion classification; local features; maximum likelihood; pitch information; resonant cry segments; silent segments; spectral features; spectral information; Acoustics; Feature extraction; Hidden Markov models; Maximum likelihood detection; Pediatrics; Resonant frequency; Sleep;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
  • Conference_Location
    Hollywood, CA
  • Print_ISBN
    978-1-4673-4863-8
  • Type

    conf

  • Filename
    6412031