A multi-modal virtual environment with text-independent real-time speaker identification

Author

Dagtas, Serhan ; Sarimollaoglu, Mustafa ; Iqbal, Kamran

Author_Institution

Dept. of Inf. Sci., Arkansas Univ., Little Rock, AR, USA

fYear

2004

fDate

13-15 Dec. 2004

Firstpage

557

Lastpage

560

Abstract

We present a speaker identification method to support a multimodal virtual environment. Most virtual environments require real-time and text-independent recognition of human speakers in order to manage the dialogue between the virtual characters and human users in the environment. Being widely used in pattern recognition tasks, neural networks have also been applied in speaker recognition. In this project, we developed a realtime text-independent speaker identification system based on probabilistic neural network (PNN). PNNs supply flexibility and straightforward design which make the system easily operable along with the successful classification results. We were able to correctly identify 96% of the speakers, using 0.8 seconds of test samples from each speaker. In addition to the description of the system and the experimental results, the effects of the feature vectors and codebook sizes on the performance are provided.

Keywords

neural nets; real-time systems; speaker recognition; virtual reality; multi-modal virtual environment; pattern recognition tasks; probabilistic neural network; text-independent real-time speaker identification; Computational efficiency; Computer architecture; Error analysis; Humans; Neural networks; Speaker recognition; Speech recognition; System testing; Transaction databases; Virtual environment; Multi-modal virtual environments; Probabilistic Neural Networks; Speaker Identification;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia Software Engineering, 2004. Proceedings. IEEE Sixth International Symposium on

Print_ISBN

0-7695-2217-3

Type

conf

DOI

10.1109/MMSE.2004.14

Filename

1376707