DocumentCode :
1732897
Title :
Design of audio-visual TV broadcast news transcription system prototype
Author :
Chaloupka, Josef
Author_Institution :
Lab. of Comput. Speech Process., Tech. Univ. of Liberec, Liberec, Czech Republic
fYear :
2011
Firstpage :
209
Lastpage :
212
Abstract :
This contribution focuses on the design of our automatic audio-visual TV broadcast news transcription system, where we would like to extend our Czech transcription system to use information from the visual signal of TV news video recordings. The subsystems for visual signal segmentation, for visual speaker identification and for visual voice activity detection are described here. These subsystems should help to develop our automatic audiovisual transcription system.
Keywords :
audio-visual systems; image segmentation; speaker recognition; television broadcasting; video signal processing; Czech transcription system; TV news video recordings; automatic audio-visual TV broadcast news transcription system; visual signal segmentation; visual speaker identification; visual voice activity detection; Discrete cosine transforms; Hidden Markov models; Humans; Image color analysis; Image segmentation; Speech recognition; Visualization; audio-visual TV broadcast news transcription; visual signal segmentation; visual speaker idntification; visual voice activity detector;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
ELMAR, 2011 Proceedings
Conference_Location :
Zadar
ISSN :
1334-2630
Print_ISBN :
978-1-61284-949-2
Electronic_ISBN :
1334-2630
Type :
conf
Filename :
6044293
Link To Document :
بازگشت