Title :
Auditory features with vocal track length normalization for language identification
Author :
Zhang, Weiqiang ; Liu, Jia ; He, Liang
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing
Abstract :
This paper reports on a novel feature, auditory cepstrum coefficient (ACC) with vocal tract length normalization (VTLN), for language identification (LID). The ACC feature is based on the auditory characteristics of human ear and the VTLN technology compensates the speaker variability. The detailed implementation of ACC feature with VTLN in frequency domain is given. Experimental results show that the proposed auditory feature outperforms its widely used Mel-frequency cepstrum coefficient (MFCC) counterpart and is more effective when combined with VTLN.
Keywords :
cepstral analysis; feature extraction; frequency-domain analysis; speech recognition; Mel-frequency cepstrum coefficient; auditory cepstrum coefficient feature extraction; frequency domain analysis; language identification; speaker variability compensation; speech recognition; vocal track length normalization; Band pass filters; Bandwidth; Biomembranes; Cepstrum; Ear; Frequency domain analysis; Helium; Humans; Mel frequency cepstral coefficient; Speech recognition;
Conference_Titel :
Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1723-0
Electronic_ISBN :
978-1-4244-1724-7
DOI :
10.1109/ICALIP.2008.4590021