DocumentCode :
2452314
Title :
Features for phoneme independent speaker identification
Author :
Wang, Jianglin ; Ji, An ; Johnson, Michael T.
Author_Institution :
Dept. of Electr. & Comput. Eng., Marquette Univ., Milwaukee, WI, USA
fYear :
2012
fDate :
16-18 July 2012
Firstpage :
1141
Lastpage :
1145
Abstract :
This paper describes a unique cross-phoneme speaker identification experiment, using deliberately mismatched phoneme sets for training and testing. The underlying goal is to identify features that represent broad individually unique characteristics rather than those that represent phonetic differences, as are more typical of modern speaker identification and verification systems. A wide range of features are proposed and evaluated within this context using a Gaussian Mixture Model framework. The results show that log-area ratio has better phonetic independence than MFCCs, that residual phase carries substantial speaker information, and identifies several other features that also have usefulness for speaker identification independent of phonetic content.
Keywords :
Gaussian processes; speaker recognition; Gaussian mixture model framework; cross phoneme speaker identification; phoneme independent speaker identification; phonetic content; phonetic differences; verification systems; Accuracy; Harmonic analysis; Jitter; Speaker recognition; Speech; Testing; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
Type :
conf
DOI :
10.1109/ICALIP.2012.6376788
Filename :
6376788
Link To Document :
بازگشت