DocumentCode
2265667
Title
A multi-modal virtual environment with text-independent real-time speaker identification
Author
Dagtas, Serhan ; Sarimollaoglu, Mustafa ; Iqbal, Kamran
Author_Institution
Dept. of Inf. Sci., Arkansas Univ., Little Rock, AR, USA
fYear
2004
fDate
13-15 Dec. 2004
Firstpage
557
Lastpage
560
Abstract
We present a speaker identification method to support a multimodal virtual environment. Most virtual environments require real-time and text-independent recognition of human speakers in order to manage the dialogue between the virtual characters and human users in the environment. Being widely used in pattern recognition tasks, neural networks have also been applied in speaker recognition. In this project, we developed a realtime text-independent speaker identification system based on probabilistic neural network (PNN). PNNs supply flexibility and straightforward design which make the system easily operable along with the successful classification results. We were able to correctly identify 96% of the speakers, using 0.8 seconds of test samples from each speaker. In addition to the description of the system and the experimental results, the effects of the feature vectors and codebook sizes on the performance are provided.
Keywords
neural nets; real-time systems; speaker recognition; virtual reality; multi-modal virtual environment; pattern recognition tasks; probabilistic neural network; text-independent real-time speaker identification; Computational efficiency; Computer architecture; Error analysis; Humans; Neural networks; Speaker recognition; Speech recognition; System testing; Transaction databases; Virtual environment; Multi-modal virtual environments; Probabilistic Neural Networks; Speaker Identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Software Engineering, 2004. Proceedings. IEEE Sixth International Symposium on
Print_ISBN
0-7695-2217-3
Type
conf
DOI
10.1109/MMSE.2004.14
Filename
1376707
Link To Document