Title :
Application of peripheral auditory model to speaker identification
Author :
Abuku, Masahiro ; Azetsu, Tadahiro ; Uchino, Eiji ; Suetake, Noriaki
Author_Institution :
Grad. Sch. of Sci. & Eng., Yamaguchi Univ., Yamaguchi, Japan
Abstract :
This paper discusses an approach for speaker identification using the multi-dimensional pulse signals generated from a model of a peripheral auditory system. The model of the peripheral auditory system employed here consists of a basilar membrane, hair cells, and auditory nerves. The input to this model is a speech signal divided into frames, and the outputs from which are the multi-dimensional pulse signals for each framed signal. The feature vectors based on the post-stimulus time histogram (PSTH) of the pulse signals are used for the speaker identification. Also, in order to improve the accuracy of the speaker identification, the feature vector conversion, using the mean and the diagonal matrix of standard deviations, is performed. The experiments were conducted for each Japanese vowel spoken by 12 speakers (9 males and 3 females), and the speaker identification accuracy is evaluated by 5 hold leave 2 out cross-validation for each vowel. The effectiveness of the proposed method has been verified by comparing with the conventional LPC analysis.
Keywords :
speaker recognition; statistical analysis; Japanese vowel; auditory nerves; basilar membrane; feature vector conversion; hair cells; multidimensional pulse signals; peripheral auditory system; post-stimulus time histogram; speaker identification; standard deviations; Encoding; Production facilities; Feature vector conversion; Multi-dimensional pulse signals; Peripheral auditory system; Post-stimulus time histogram; Speaker identification;
Conference_Titel :
Nature and Biologically Inspired Computing (NaBIC), 2010 Second World Congress on
Conference_Location :
Fukuoka
Print_ISBN :
978-1-4244-7377-9
DOI :
10.1109/NABIC.2010.5716338