DocumentCode :
2361836
Title :
Neural network models for extracting complementary speaker-specific information from residual phase
Author :
Kodukula, Sri Rama Murty ; Prasanna, S. R Mahadeva ; Yegnanarayana, B.
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Chennai, India
fYear :
2005
fDate :
4-7 Jan. 2005
Firstpage :
421
Lastpage :
425
Abstract :
In this paper using neural network models we demonstrate the presence of complementary speaker-specific information in the residual phase as compared to the conventional spectral features. The spectral features mainly represent the speaker-specific vocal tract system features. The proposed LP residual phase represents the speaker-specific excitation source information. Speaker recognition studies are conducted using NIST 2003 speaker recognition evaluation database. The speaker recognition system using only spectral features gives an equal error rate (EER) of 15.5% and using only LP residual phase information gives an EER of 22.0%. However, combining the evidences from LP residual phase and spectral features increases the performance to an EER of 13.5%. This result clearly demonstrates the complementary nature of speaker-specific information present in the LP residual phase.
Keywords :
information retrieval; neural nets; prediction theory; speaker recognition; speech processing; LP residual phase; complementary speaker-specific information extraction; linear prediction analysis; neural network models; speaker recognition system; speaker-specific vocal tract system features; Cepstral analysis; Computer science; Data mining; Laboratories; Linear predictive coding; Neural networks; Signal analysis; Speaker recognition; Speech analysis; Time varying systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Sensing and Information Processing, 2005. Proceedings of 2005 International Conference on
Print_ISBN :
0-7803-8840-2
Type :
conf
DOI :
10.1109/ICISIP.2005.1529489
Filename :
1529489
Link To Document :
بازگشت