Title :
A multi-modal virtual environment with text-independent real-time speaker identification
Author :
Dagtas, Serhan ; Sarimollaoglu, Mustafa ; Iqbal, Kamran
Author_Institution :
Dept. of Inf. Sci., Arkansas Univ., Little Rock, AR, USA
Abstract :
We present a speaker identification method to support a multimodal virtual environment. Most virtual environments require real-time and text-independent recognition of human speakers in order to manage the dialogue between the virtual characters and human users in the environment. Being widely used in pattern recognition tasks, neural networks have also been applied in speaker recognition. In this project, we developed a realtime text-independent speaker identification system based on probabilistic neural network (PNN). PNNs supply flexibility and straightforward design which make the system easily operable along with the successful classification results. We were able to correctly identify 96% of the speakers, using 0.8 seconds of test samples from each speaker. In addition to the description of the system and the experimental results, the effects of the feature vectors and codebook sizes on the performance are provided.
Keywords :
neural nets; real-time systems; speaker recognition; virtual reality; multi-modal virtual environment; pattern recognition tasks; probabilistic neural network; text-independent real-time speaker identification; Computational efficiency; Computer architecture; Error analysis; Humans; Neural networks; Speaker recognition; Speech recognition; System testing; Transaction databases; Virtual environment; Multi-modal virtual environments; Probabilistic Neural Networks; Speaker Identification;
Conference_Titel :
Multimedia Software Engineering, 2004. Proceedings. IEEE Sixth International Symposium on
Print_ISBN :
0-7695-2217-3
DOI :
10.1109/MMSE.2004.14