DocumentCode
700308
Title
Boosting speaker identification performance using a frame level based algorithm
Author
Djemili, Rafik ; Amara Korba, M.C. ; Bourouba, Hocine ; O´Shaughnessy, Douglas
Author_Institution
Electr. Eng. Dept., Univ. du 20 Aout 1955, Skikda, Algeria
fYear
2015
fDate
17-19 Feb. 2015
Firstpage
1
Lastpage
6
Abstract
In this paper, we propose an algorithm to improve the performance of speaker identification systems. A baseline speaker identification system uses a scoring of a test utterance against all speakers´ models; this could be termed as an evaluation at the observation level. In the proposed approach, and prior to the standard evaluation phase, an algorithm based on a frame level evaluation is applied. The speaker identification study is conducted using IVIE corpus and a randomly selected 120 speakers from TIMIT. Mel-frequency cepstral coefficients (MFCC) and Gaussian mixture model (GMM) are the main components in state of the art speaker identification systems and will be adopted in this work. Experimental results based on several systems with different training and testing conditions, showed that our proposed algorithm yielded to relative reduction in error rates of 24.4 and 37.3% over the baseline systems respectively for IVIE and TIMIT. The final performances reached measured by identification error rates are 3.4% and 5.2% for IVIE and TIMIT corpuses.
Keywords
Gaussian processes; cepstral analysis; mixture models; speaker recognition; Gaussian mixture model; IVIE corpus; MFCC; Mel-frequency cepstral coefficients; frame level based algorithm; speaker identification performance; Boosting; Gaussian mixture model; Identification Error Rate; Mel-frequency cepstral coefficient; Speaker identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications, Signal Processing, and their Applications (ICCSPA), 2015 International Conference on
Conference_Location
Sharjah
Type
conf
DOI
10.1109/ICCSPA.2015.7081273
Filename
7081273
Link To Document