• DocumentCode
    2265667
  • Title

    A multi-modal virtual environment with text-independent real-time speaker identification

  • Author

    Dagtas, Serhan ; Sarimollaoglu, Mustafa ; Iqbal, Kamran

  • Author_Institution
    Dept. of Inf. Sci., Arkansas Univ., Little Rock, AR, USA
  • fYear
    2004
  • fDate
    13-15 Dec. 2004
  • Firstpage
    557
  • Lastpage
    560
  • Abstract
    We present a speaker identification method to support a multimodal virtual environment. Most virtual environments require real-time and text-independent recognition of human speakers in order to manage the dialogue between the virtual characters and human users in the environment. Being widely used in pattern recognition tasks, neural networks have also been applied in speaker recognition. In this project, we developed a realtime text-independent speaker identification system based on probabilistic neural network (PNN). PNNs supply flexibility and straightforward design which make the system easily operable along with the successful classification results. We were able to correctly identify 96% of the speakers, using 0.8 seconds of test samples from each speaker. In addition to the description of the system and the experimental results, the effects of the feature vectors and codebook sizes on the performance are provided.
  • Keywords
    neural nets; real-time systems; speaker recognition; virtual reality; multi-modal virtual environment; pattern recognition tasks; probabilistic neural network; text-independent real-time speaker identification; Computational efficiency; Computer architecture; Error analysis; Humans; Neural networks; Speaker recognition; Speech recognition; System testing; Transaction databases; Virtual environment; Multi-modal virtual environments; Probabilistic Neural Networks; Speaker Identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Software Engineering, 2004. Proceedings. IEEE Sixth International Symposium on
  • Print_ISBN
    0-7695-2217-3
  • Type

    conf

  • DOI
    10.1109/MMSE.2004.14
  • Filename
    1376707