A main speaker decision for a distributed telepresence system

Author

Hyun Woo Kim ; Mi Suk Lee ; Do Young Kim

Author_Institution

Spoken Language Process. Res. Sect., Electron. & Telecommun. Res. Inst., Daejeon, South Korea

fYear

2013

fDate

14-16 Oct. 2013

Firstpage

862

Lastpage

864

Abstract

In this paper, we propose a method to detect a main speaker and automatically change into one´s high definition (HD) video for a distributed telepresence system, so that the users feel immersive and convenient. In contrast to centralized systems, user equipment (UE) performs the main speaker decision (MSD) with a time synchronization using network time protocol (NTP). The MSD method includes a voice activity detection (VAD) and post-corrections to remove unwanted voice detections and share the same main speaker. We emphasize an audio signal of the main speaker to become more immersive. The proposed approach is applied to the telepresence system developed by ETRI and shows good performances^.

Keywords

audio signals; high definition video; protocols; telecontrol; virtual reality; ETRI; HD video; MSD method; NTP; UE; VAD; audio signal; centralized system; distributed telepresence system; high definition video; main speaker decision; network time protocol; time synchronization; unwanted voice detection removal; user equipment; voice activity detection; HD video changing; Main speaker decision; immersive teleconference; telpresence;

fLanguage

English

Publisher

ieee

Conference_Titel

ICT Convergence (ICTC), 2013 International Conference on

Conference_Location

Jeju

Type

conf

DOI

10.1109/ICTC.2013.6675502

Filename

6675502