مرکز منطقه ای اطلاع رساني علوم و فناوري - Audiovisual-based adaptive speaker identification

DocumentCode :

1873861

Title :

Audiovisual-based adaptive speaker identification

Author :

Li, Ying ; Narayanan, Shrikanth ; Kuo, C. C Jay

Author_Institution :

Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA

Volume :

fYear :

2003

fDate :

6-9 July 2003

Abstract :

An adaptive speaker identification system is presented in this paper, which aims to recognize speakers in feature films by exploiting both audio and visual cues. Specifically, the audio source is first analyzed to identify speakers using a likelihood-based approach. Meanwhile, the visual source is parsed to recognize talking faces using face detection/recognition and mouth tracking techniques. These two information sources are then integrated under a probabilistic framework for improved system performance. Moreover, to account for speakers´ voice variations along time, we update their acoustic models on the fly by adapting to their newly contributed speech data. An average of 80% identification accuracy has been achieved on two test movies. This shows a promising future of the proposed audiovisual-based adaptive speaker identification approach.

Keywords :

adaptive signal processing; audio signal processing; audio-visual systems; face recognition; speaker recognition; video signal processing; adaptive speaker identification; audio source; audiovisual-based; face detection/recognition; likelihood-based approach; mouth tracking techniques; probabilistic framework; visual source; Adaptive systems; Databases; Face detection; Face recognition; Loudspeakers; Motion pictures; Mouth; Speech; System performance; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on

Print_ISBN :

0-7803-7965-9

Type :

conf

DOI :

10.1109/ICME.2003.1221374

Filename :

1221374

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1873861