DocumentCode :
3738750
Title :
Effect of plosives on isolated speaker recognition system performance
Author :
Zekeriya ?ent?rk;?zg?l Salor
Author_Institution :
Department of Electronics Engineering, Turkish Military Academy, Ankara, Turkey
fYear :
2015
Firstpage :
1263
Lastpage :
1265
Abstract :
In this paper, the effect of keyword choice including and excluding plosive sounds on isolated speaker recognition system is investigated. In order to perform this study, a Turkish word database has been created consisting of 48 words including plosives and 7 words without plosives. Records are acquired at a sampling frequency of 16 kHz in a professional recording studio, with sound insulation. The records have been acquired during three or four sessions, achieved at different times of the day, for each participant to reflect the sound variability of the human vocal tract on the database. A speaker recognition system employing Mel-Frequency Cepstrum Coefficients (MFCC) for feature extraction and Dynamic Time Warping (DTW) for time equalization has been developed. After the system training stage, average speaker recognition performances for the keywords in the test set including plosives and excluding plosives has been found to be % 98.24 and % 91.76, respectively.
Keywords :
"Speaker recognition","Feature extraction","Filter banks","Mel frequency cepstral coefficient","Hidden Markov models","Databases","Heuristic algorithms"
Publisher :
ieee
Conference_Titel :
Electrical and Electronics Engineering (ELECO), 2015 9th International Conference on
Type :
conf
DOI :
10.1109/ELECO.2015.7394575
Filename :
7394575
Link To Document :
بازگشت