DocumentCode :
3414549
Title :
On the Use of Distributed DCT in Speaker Identification
Author :
Sahidullah, Md ; Saha, Goutam
Author_Institution :
Dept. of Electron. & Electr. Commun. Eng., Indian Inst. of Technol. Kharagpur, Kharagpur, India
fYear :
2009
fDate :
18-20 Dec. 2009
Firstpage :
1
Lastpage :
4
Abstract :
Feature extraction is one of the most significant stage in development of a speaker identification (SI) system. Most of the SI systems use mel-frequency cepstral coefficient (MFCC) as a parameter for representing the speech signal into compact form. MFCC are extracted through spectral weighting by a bank of overlapping triangular filters followed by a de-correlation process. Conventionally, discrete cosine transform (DCT-II) is used for de-correlation. In this paper, we propose the usage of a better de-correlation algorithm for MFCC. In traditional method DCT was applied coarsely to all the filterbank energies. In the proposed technique we have incorporated the DCT in a distributed manner. The experimental results on two publicly available database, each consisting more than 130 speakers show that the proposed method improves the performance over baseline MFCC based SI system for various number of filters in the filterbank.
Keywords :
discrete cosine transforms; feature extraction; filtering theory; speaker recognition; SI systems; de-correlation process; distributed discrete cosine transform; feature extraction; mel frequency cepstral coefficient; overlapping triangular filters; speaker identification; speech signal; Cepstral analysis; Discrete cosine transforms; Feature extraction; Filter bank; Loudspeakers; Mel frequency cepstral coefficient; Robustness; Signal processing; Speaker recognition; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
India Conference (INDICON), 2009 Annual IEEE
Conference_Location :
Gujarat
Print_ISBN :
978-1-4244-4858-6
Electronic_ISBN :
978-1-4244-4859-3
Type :
conf
DOI :
10.1109/INDCON.2009.5409408
Filename :
5409408
Link To Document :
بازگشت