DocumentCode :
3472242
Title :
Neural net approach for speaker sensitive measure analysis
Author :
Qixiu, Hu ; Yue, Pan
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear :
1997
fDate :
9-12 Sep 1997
Firstpage :
365
Lastpage :
368
Abstract :
Speech signals can be described by filters and excitation. This paper presents an analysis of the speaker information from this view point. We use the self organizing map of Kohonen (SOM) to explore the effectiveness of these two parts of a parameter in representing individual features of speakers. LSPs can be viewed as all-pole filters. Then, the excitation sequences of the filters are studied for discriminative characteristics of the speaker. In the experiment, three kinds of parameters-direct MFCC, LSP, and residual MFCC are used to build a feature map. Finally SOM shows that direct MFCC and LSP produce very similar feature maps for the same speaker in their general feature space. A correlation criterion gives further verification. While the SOMs of excitation signals are less discriminative between each other, they are less speaker sensitive
Keywords :
feature extraction; filtering theory; prediction theory; self-organising feature maps; speaker recognition; transfer functions; LSP; all-pole filters; correlation criterion; direct MFCC; discriminative characteristics; excitation; excitation sequences; excitation signals; residual MFCC; self organizing map; speaker sensitive measure analysis; speech signals; Cepstral analysis; Computer science; Feature extraction; Filters; Mel frequency cepstral coefficient; Neural networks; Organizing; Speaker recognition; Speech analysis; Speech coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Technologies and Factory Automation Proceedings, 1997. ETFA '97., 1997 6th International Conference on
Conference_Location :
Los Angeles, CA
Print_ISBN :
0-7803-4192-9
Type :
conf
DOI :
10.1109/ETFA.1997.616297
Filename :
616297
Link To Document :
بازگشت