DocumentCode :
2323411
Title :
Fractional Fourier transform based auditory feature for language identification
Author :
Zhang, Wei-Qiang ; He, Liang ; Hou, Tao ; Liu, Jia
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing
fYear :
2008
fDate :
Nov. 30 2008-Dec. 3 2008
Firstpage :
209
Lastpage :
212
Abstract :
In this paper, a novel auditory feature based on fractional Fourier transform (FRFT), namely, fractional auditory cepstrum coefficient (FACC), is presented for language identification (LID). Different from the widely used Mel-frequency cepstrum coefficient (MFCC), the proposed feature utilizes the human auditory model and performs Gammatone filtering for the short-time fractional spectrum of the speech. Experimental results on NIST 2003 Language Recognition Evaluation (LRE03) show that the FACC feature decreases the equal error rate (EER) of 10.5% relatively when compared with the MFCC feature.
Keywords :
Fourier transforms; filtering theory; speech processing; Gammatone filtering; Mel-frequency cepstrum coefficient; NIST 2003 Language Recognition Evaluation; equal error rate; fractional Fourier transform; fractional auditory cepstrum coefficient; human auditory model; language identification; short-time fractional spectrum; Band pass filters; Bandwidth; Cepstrum; Filter bank; Fourier transforms; Frequency domain analysis; Mel frequency cepstral coefficient; Natural languages; Signal processing; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2008. APCCAS 2008. IEEE Asia Pacific Conference on
Conference_Location :
Macao
Print_ISBN :
978-1-4244-2341-5
Electronic_ISBN :
978-1-4244-2342-2
Type :
conf
DOI :
10.1109/APCCAS.2008.4745997
Filename :
4745997
Link To Document :
بازگشت