DocumentCode :
231548
Title :
Emarati speaker identification
Author :
Shahin, Ismail ; Ba-Hutair, Mohammed Nasser
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Sharjah, Sharjah, United Arab Emirates
fYear :
2014
fDate :
19-23 Oct. 2014
Firstpage :
488
Lastpage :
493
Abstract :
In this work we focus on Emarati speaker identification systems in neutral talking environments based on each of Vector Quantization (VQ), Gaussian Mixture Models (GMMs), and Hidden Markov Models (HMMs) as classifiers. These systems have been tested on our collected Emarati speech database which is composed of 25 male and 25 female Emarati speakers using Mel-Frequency Cepstral Coefficients (MFCCs). Our results yield an average text-dependent Emarati speaker identification performance of 100.00%, %99.81, and 99.69% based on VQ, GMMs, and HMMs, respectively. For text-independent systems, the average Emarati speaker identification performance based on VQ, GMMs, and HMMs is 94.48%, 86.55%, and 74.83%, respectively. The achieved results based on VQ are close to those obtained in subjective assessment by human listeners.
Keywords :
Gaussian processes; hidden Markov models; speaker recognition; visual databases; Emarati speech database; GMM; Gaussian mixture models; HMM; Hidden Markov Models; MFCCs; Mel-frequency cepstral coefficients; VQ; neutral talking environments; text-dependent Emarati speaker identification performance; vector quantization; Databases; Feature extraction; Hidden Markov models; Speaker recognition; Speech; Training; Vectors; Emarati speech database; Gaussian mixture models; hidden Markov models; speaker identification; vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing (ICSP), 2014 12th International Conference on
Conference_Location :
Hangzhou
ISSN :
2164-5221
Print_ISBN :
978-1-4799-2188-1
Type :
conf
DOI :
10.1109/ICOSP.2014.7015053
Filename :
7015053
Link To Document :
بازگشت