DocumentCode :
590884
Title :
Emotion classification of infant cries with consideration for local and global features
Author :
Honda, Kazuhiro ; Kitahara, Kuninori ; Matsunaga, Shinichiro ; Yamashita, Masaru ; Shinohara, K.
Author_Institution :
Nagasaki Univ., Nagasaki, Japan
fYear :
2012
fDate :
3-6 Dec. 2012
Firstpage :
1
Lastpage :
4
Abstract :
In this paper, we propose an approach to the classification of emotion clusters in infant cries with consideration for frame-wise/local acoustic features and global prosodic features. Our proposed approach has two main characteristics as follows. The emotion cluster detection procedure is based on the most likely segment sequence, which delivers the emotion cluster as a classification result. This is obtained based on a maximum likelihood approach using the frame-wise likelihood and the global prosodic likelihood. We exploit the duration ratios of resonant cry segments and silent segments as prosodic features, while the duration ratios are calculated using the derived segment sequence. The second characteristic is the use of pitch information, in addition to conventional power and spectral information, during the modeling of frame-wise acoustic features with hidden Markov models. The classification performance (74.7%) of our proposed approach with added pitch information was better than (71.5%) the classification method using only power and spectral features. The proposed method based on a maximum likelihood approach using both frame-wise and global features also achieved better performance (75.5%).
Keywords :
acoustic signal processing; hidden Markov models; maximum likelihood detection; emotion cluster detection; emotion clusters classification; frame-wise acoustic features; frame-wise likelihood; frame-wise/local acoustic features; global features; global prosodic features; global prosodic likelihood; hidden Markov models; infant cries emotion classification; local features; maximum likelihood; pitch information; resonant cry segments; silent segments; spectral features; spectral information; Acoustics; Feature extraction; Hidden Markov models; Maximum likelihood detection; Pediatrics; Resonant frequency; Sleep;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location :
Hollywood, CA
Print_ISBN :
978-1-4673-4863-8
Type :
conf
Filename :
6412031
Link To Document :
بازگشت