DocumentCode :
2329044
Title :
Robust prosodic features for speaker identification
Author :
Carey, Michael J. ; Parris, Eluned S. ; Lloyd-Thomas, Harvey ; Bennett, Stephen
Author_Institution :
Ensigma Ltd., Gwent, UK
Volume :
3
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1800
Abstract :
The paper describes the use of prosodic features for speaker identification. Features based on the pitch and energy contours of speech are described and the relative importance of each feature for speaker identification is investigated. The mean and variance of the pitch period in voiced sections of speech are shown to be particularly useful at discriminating between speakers. Fusing these features with a hidden Markov model speaker identification system gave a marked improvement in figure of merit; over 30% gain was achieved on the six NIST 1995 evaluation tests presented. Handset variability is known to have an adverse effect on performance when traditional spectral features are used, e.g. cepstra. Results are presented showing that the prosodic features are more robust to handset variability
Keywords :
cepstral analysis; hidden Markov models; speaker recognition; NIST 1995 evaluation tests; cepstra; energy contours; handset variability; hidden Markov model speaker identification system; pitch contours; pitch period; robust prosodic features; speaker identification; spectral features; voiced sections; Digital signal processors; Hidden Markov models; NIST; Robustness; Signal processing algorithms; Speaker recognition; Speech coding; System testing; Telephone sets; Workstations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607979
Filename :
607979
Link To Document :
بازگشت