• DocumentCode
    438876
  • Title

    An adaptive model of person identification combining speech and image information

  • Author

    Zhang, David ; Ghobakhlou, Akbar ; Kasabov, Nikola

  • Author_Institution
    Knowledge Eng. & Discovery Res. Inst., Auckland Univ. of Technol., New Zealand
  • Volume
    1
  • fYear
    2004
  • fDate
    6-9 Dec. 2004
  • Firstpage
    413
  • Abstract
    The paper introduces a combination of adaptive neural network systems and statistical method for integrating speech and face image information for person identification. The method allows for the development of models of persons and their on-going adjustment based on new speech and face images. The method is illustrated with a modeling and classification of different persons, when speech and face images are presented in an incremental way. In this model, there are two subnetworks, one for face image and one for speaker recognition. A higher-level layer is applied to make a final decision. In the speaker recognition subnetwork, a text-dependant model is built using evolving connectionist systems (ECOS) [N. Kasabov, 2002]. In the face image recognition subnetwork, composite profile technique is applied for face image feature extraction and zero instruction set computing (ZISC) [ZISC Manual, 2000] technology is used to build the neural network. In the higher-level conceptual subsystem, final recognition decision is made using statistical method. The experiments show that ECOS and ZISC are appropriate techniques for the creation of evolving models for the task of speaker and face recognition individually. It is also shown that the integration of the speech and image information using statistical method improves the person identification rate.
  • Keywords
    adaptive systems; face recognition; feature extraction; neural nets; speaker recognition; statistical analysis; adaptive neural network systems; composite profile technique; evolving connectionist systems; face image feature extraction; face image information; face image recognition subnetwork; higher-level conceptual subsystem; person identification; speaker recognition subnetwork; speech information; statistical method; zero instruction set computing; Adaptive systems; Computer aided instruction; Computer networks; Feature extraction; Image recognition; Manuals; Neural networks; Speaker recognition; Speech; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control, Automation, Robotics and Vision Conference, 2004. ICARCV 2004 8th
  • Print_ISBN
    0-7803-8653-1
  • Type

    conf

  • DOI
    10.1109/ICARCV.2004.1468861
  • Filename
    1468861