DocumentCode :
2423068
Title :
Auditory features with vocal track length normalization for language identification
Author :
Zhang, Weiqiang ; Liu, Jia ; He, Liang
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing
fYear :
2008
fDate :
7-9 July 2008
Firstpage :
66
Lastpage :
70
Abstract :
This paper reports on a novel feature, auditory cepstrum coefficient (ACC) with vocal tract length normalization (VTLN), for language identification (LID). The ACC feature is based on the auditory characteristics of human ear and the VTLN technology compensates the speaker variability. The detailed implementation of ACC feature with VTLN in frequency domain is given. Experimental results show that the proposed auditory feature outperforms its widely used Mel-frequency cepstrum coefficient (MFCC) counterpart and is more effective when combined with VTLN.
Keywords :
cepstral analysis; feature extraction; frequency-domain analysis; speech recognition; Mel-frequency cepstrum coefficient; auditory cepstrum coefficient feature extraction; frequency domain analysis; language identification; speaker variability compensation; speech recognition; vocal track length normalization; Band pass filters; Bandwidth; Biomembranes; Cepstrum; Ear; Frequency domain analysis; Helium; Humans; Mel frequency cepstral coefficient; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1723-0
Electronic_ISBN :
978-1-4244-1724-7
Type :
conf
DOI :
10.1109/ICALIP.2008.4590021
Filename :
4590021
Link To Document :
بازگشت