Title :
Text-prompted speaker verification experiments with phoneme specific MLPs
Author :
Delacrétaz, Dijana Petrovska ; Hennebert, Jean
Author_Institution :
Swiss Fed. Inst. of Technol., Switzerland
Abstract :
The aims of the study described in this paper are (1) to assess the relative speaker discriminant properties of phonemes and (2) to investigate the importance of the temporal frame-to-frame information for speaker modelling in the framework of a text-prompted speaker verification system using hidden Markov models (HMMs) and multilayer perceptrons (MLPs). It is shown that, with similar experimental conditions, nasals, fricatives and vowels convey more speaker specific information than plosives and liquids. Regarding the influence of the frame-to-frame temporal information, significant improvements are reported from the inclusion of several acoustic frames at the input of the MLPs. The results tend also to show that each phoneme has its optimal MLP context size giving the best equal error rate (EER)
Keywords :
acoustic signal processing; error statistics; hidden Markov models; multilayer perceptrons; speaker recognition; speech processing; HMM; acoustic frames; equal error rate; fricatives; hidden Markov models; liquids; multilayer perceptrons; nasals; optimal MLP context size; phoneme specific MLP; plosives; speaker discriminant properties; speaker modelling; temporal frame-to-frame information; text-prompted speaker verification experiments; vowels; Circuits and systems; Error analysis; Hidden Markov models; Liquids; Loudspeakers; Security; Speech coding; Speech recognition; Text recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675380