Boosting speaker identification performance using a frame level based algorithm

Author

Djemili, Rafik ; Amara Korba, M.C. ; Bourouba, Hocine ; O´Shaughnessy, Douglas

Author_Institution

Electr. Eng. Dept., Univ. du 20 Aout 1955, Skikda, Algeria

fYear

2015

fDate

17-19 Feb. 2015

Firstpage

1

Lastpage

6

Abstract

In this paper, we propose an algorithm to improve the performance of speaker identification systems. A baseline speaker identification system uses a scoring of a test utterance against all speakers´ models; this could be termed as an evaluation at the observation level. In the proposed approach, and prior to the standard evaluation phase, an algorithm based on a frame level evaluation is applied. The speaker identification study is conducted using IVIE corpus and a randomly selected 120 speakers from TIMIT. Mel-frequency cepstral coefficients (MFCC) and Gaussian mixture model (GMM) are the main components in state of the art speaker identification systems and will be adopted in this work. Experimental results based on several systems with different training and testing conditions, showed that our proposed algorithm yielded to relative reduction in error rates of 24.4 and 37.3% over the baseline systems respectively for IVIE and TIMIT. The final performances reached measured by identification error rates are 3.4% and 5.2% for IVIE and TIMIT corpuses.

Keywords

Gaussian processes; cepstral analysis; mixture models; speaker recognition; Gaussian mixture model; IVIE corpus; MFCC; Mel-frequency cepstral coefficients; frame level based algorithm; speaker identification performance; Boosting; Gaussian mixture model; Identification Error Rate; Mel-frequency cepstral coefficient; Speaker identification;

fLanguage

English

Publisher

ieee

Conference_Titel

Communications, Signal Processing, and their Applications (ICCSPA), 2015 International Conference on

Conference_Location

Sharjah

Type

conf

DOI

10.1109/ICCSPA.2015.7081273

Filename

7081273