Title :
Emotional Speech Clustering Based Robust Speaker Recognition System
Author :
Li, Dongdong ; Yang, Yingchun
Author_Institution :
Coll. of Inf. Sci. & Eng., East China Univ. of Sci. & Technol., Shanghai, China
Abstract :
Speech with various emotions aggravates the performance of speaker recognition system. The existing speaker modeling disregards the match of the emotional state between training and testing speech, and the systems suffer the lapsus of the emotion recognition as to practical application. We propose an alternative approach that exploits the prosodic difference to cluster affective speech, and then builds corresponding models with the clustered speech for a given speaker. The aim is to match the test utterances with one of the clustered speaker models and utilize the limited affective speech effectively. The method is evaluated with the Mandarin Affective Speech Corpus. Experimental results show that the proposed approach achieves a relative improvement of at least 19% over the traditional speaker recognition task. We also show that such approach are more robust to communication the emotional affects than the other speaker recognition systems.
Keywords :
emotion recognition; image matching; learning (artificial intelligence); pattern clustering; speaker recognition; clustered speaker model; emotional speech clustering; robust speaker recognition system; Computer science; Educational institutions; Emotion recognition; Humans; Information science; Robustness; Speaker recognition; Speech analysis; Speech recognition; System testing;
Conference_Titel :
Image and Signal Processing, 2009. CISP '09. 2nd International Congress on
Conference_Location :
Tianjin
Print_ISBN :
978-1-4244-4129-7
Electronic_ISBN :
978-1-4244-4131-0
DOI :
10.1109/CISP.2009.5304327