DocumentCode
2834622
Title
Neural network models for combining evidence from spectral and suprasegmental features for text-dependent speaker verification
Author
Prasanna, S. R Mahadeva ; Zachariah, Jinu Mariam ; Yegnanarayana, B.
Author_Institution
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
fYear
2004
fDate
2004
Firstpage
359
Lastpage
363
Abstract
This paper proposes a method using neural network models for combining evidence from spectral and suprasegmental features for text-dependent speaker verification. Spectral features are extracted using the Dynamic Time Warping (DTW) technique. While extracting the spectral features, the DTW algorithm is used only to obtain a matching score and the information present in the warping path is ignored. In this work a method is discussed to extract suprasegmental features such as pitch and duration using the information in the warping path. Although the suprasegmental features may not yield good performance, combining the evidence from suprasegmental and spectral features improves the performance of the speaker verification system significantly.
Keywords
feature extraction; neural nets; speaker recognition; dynamic time warping technique; neural network model; pitch extraction; spectral feature extraction; suprasegmental feature extraction; text dependent speaker verification; Biometrics; Computer science; Data mining; Feature extraction; Laboratories; Neural networks; Speaker recognition; Speech analysis; Speech processing; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Sensing and Information Processing, 2004. Proceedings of International Conference on
Print_ISBN
0-7803-8243-9
Type
conf
DOI
10.1109/ICISIP.2004.1287683
Filename
1287683
Link To Document