• DocumentCode
    2372135
  • Title

    A main speaker decision for a distributed telepresence system

  • Author

    Hyun Woo Kim ; Mi Suk Lee ; Do Young Kim

  • Author_Institution
    Spoken Language Process. Res. Sect., Electron. & Telecommun. Res. Inst., Daejeon, South Korea
  • fYear
    2013
  • fDate
    14-16 Oct. 2013
  • Firstpage
    862
  • Lastpage
    864
  • Abstract
    In this paper, we propose a method to detect a main speaker and automatically change into one´s high definition (HD) video for a distributed telepresence system, so that the users feel immersive and convenient. In contrast to centralized systems, user equipment (UE) performs the main speaker decision (MSD) with a time synchronization using network time protocol (NTP). The MSD method includes a voice activity detection (VAD) and post-corrections to remove unwanted voice detections and share the same main speaker. We emphasize an audio signal of the main speaker to become more immersive. The proposed approach is applied to the telepresence system developed by ETRI and shows good performances.
  • Keywords
    audio signals; high definition video; protocols; telecontrol; virtual reality; ETRI; HD video; MSD method; NTP; UE; VAD; audio signal; centralized system; distributed telepresence system; high definition video; main speaker decision; network time protocol; time synchronization; unwanted voice detection removal; user equipment; voice activity detection; HD video changing; Main speaker decision; immersive teleconference; telpresence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    ICT Convergence (ICTC), 2013 International Conference on
  • Conference_Location
    Jeju
  • Type

    conf

  • DOI
    10.1109/ICTC.2013.6675502
  • Filename
    6675502