DocumentCode :
2426492
Title :
Different aspects of source information for limited data speaker verification
Author :
Das, Rohan Kumar ; Pati, Debadatta ; Mahadeva Prasanna, S.R.
Author_Institution :
Dept. of Electron. & Electr. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
fYear :
2015
fDate :
Feb. 27 2015-March 1 2015
Firstpage :
1
Lastpage :
6
Abstract :
Limited data speaker verification has shown its significance in practical system oriented applications. The paper shows the importance of different aspects of voice source feature for limited test data scenario. A baseline speaker verification system using conventional mel frequency cepstral co-efficients (MFCC) feature is developed and performance under limited test data condition (≤10 s) is evaluated. A parallel system based on source feature mel power difference of spectrum in subband (M-PDSS) is developed in the i-vector based speaker verification framework. Both the systems were fused at the score level for the cases of short segments of test speech, which demonstrated the importance of source feature with reduction in test data duration. A comparative study of the M-PDSS feature is then made with our earlier work using discrete cosine transform of the integrated linear prediction residual (DCTILPR) feature and then fusion of two source features M-PDSS and DCTILPR along with MFCC features is carried out. An absolute improvement of 5.19% is obtained for 2 s of test data which conveys the significance of multiple source information under limited data speaker verification as it carries different aspects of source information.
Keywords :
cepstral analysis; speaker recognition; DCTILPR feature; M-PDSS feature; MFCC feature; baseline speaker verification system; conventional mel frequency cepstral co-efficients feature; discrete cosine transform of integrated linear prediction residual feature; limited data speaker verification; multiple source information; practical system oriented applications; test data condition; voice source feature; Decision support systems; Dynamic range; Feature extraction; Handheld computers; Market research; Mel frequency cepstral coefficient; NIST; DCTILPR; M-PDSS; MFCC; short utterances; source features; speaker verification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (NCC), 2015 Twenty First National Conference on
Conference_Location :
Mumbai
Type :
conf
DOI :
10.1109/NCC.2015.7084846
Filename :
7084846
Link To Document :
بازگشت