DocumentCode :
1516988
Title :
Significance of Vowel-Like Regions for Speaker Verification Under Degraded Conditions
Author :
Prasanna, S.R.M. ; Pradhan, G.
Author_Institution :
Dept. of Electron. & Electr. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
Volume :
19
Issue :
8
fYear :
2011
Firstpage :
2552
Lastpage :
2565
Abstract :
Vowel-like regions (VLRs) in speech includes vowels, semi-vowels, and diphthong sound units. VLR can be identified using a vowel-like region onset point (VLROP) event. By production, the VLR has impulse-like excitation and therefore information about the vocal tract system may be better manifested in them. Also, the VLR is a relatively high signal-to-noise ratio (SNR) region. Speaker information extracted from such a region may therefore be more speaker discriminative and relatively less affected by the degradations like noise, reverberation, and sensor mismatches. Due to this, better speaker modeling and reliable testing may be possible. In this paper, VLRs are detected using the knowledge of VLROPs during training and testing. Features from the VLRs are then used for training and testing the speaker models. As a result, significant improvement in the performance is reported for speaker verification under degraded conditions.
Keywords :
reverberation; speaker recognition; VLR; degradation like noise; diphthong sound unit; impulse-like excitation; reverberation; sensor mismatch; signal-to-noise ratio; speaker information; speaker model testing; speaker model training; speaker verification; vocal tract system; vowel-like region onset point; Degradation; Noise; Reverberation; Signal to noise ratio; Speaker recognition; Testing; Training; Degraded condition; speaker information; speaker verification (SV); vowel-like region (VLR); vowel-like region onset point;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2011.2155061
Filename :
5767548
Link To Document :
بازگشت