DocumentCode
2372135
Title
A main speaker decision for a distributed telepresence system
Author
Hyun Woo Kim ; Mi Suk Lee ; Do Young Kim
Author_Institution
Spoken Language Process. Res. Sect., Electron. & Telecommun. Res. Inst., Daejeon, South Korea
fYear
2013
fDate
14-16 Oct. 2013
Firstpage
862
Lastpage
864
Abstract
In this paper, we propose a method to detect a main speaker and automatically change into one´s high definition (HD) video for a distributed telepresence system, so that the users feel immersive and convenient. In contrast to centralized systems, user equipment (UE) performs the main speaker decision (MSD) with a time synchronization using network time protocol (NTP). The MSD method includes a voice activity detection (VAD) and post-corrections to remove unwanted voice detections and share the same main speaker. We emphasize an audio signal of the main speaker to become more immersive. The proposed approach is applied to the telepresence system developed by ETRI and shows good performances.
Keywords
audio signals; high definition video; protocols; telecontrol; virtual reality; ETRI; HD video; MSD method; NTP; UE; VAD; audio signal; centralized system; distributed telepresence system; high definition video; main speaker decision; network time protocol; time synchronization; unwanted voice detection removal; user equipment; voice activity detection; HD video changing; Main speaker decision; immersive teleconference; telpresence;
fLanguage
English
Publisher
ieee
Conference_Titel
ICT Convergence (ICTC), 2013 International Conference on
Conference_Location
Jeju
Type
conf
DOI
10.1109/ICTC.2013.6675502
Filename
6675502
Link To Document