• DocumentCode
    989741
  • Title

    Lecture Video Enhancement and Editing by Integrating Posture, Gesture, and Text

  • Author

    Wang, Feng ; Ngo, Chong-Wah ; Pong, Ting-Chuen

  • Author_Institution
    Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol.
  • Volume
    9
  • Issue
    2
  • fYear
    2007
  • Firstpage
    397
  • Lastpage
    409
  • Abstract
    This paper describes a novel framework for automatic lecture video editing by gesture, posture, and video text recognition. In content analysis, the trajectory of hand movement is tracked and the intentional gestures are automatically extracted for recognition. In addition, head pose is estimated through overcoming the difficulties due to the complex lighting conditions in classrooms. The aim of recognition is to characterize the flow of lecturing with a series of regional focuses depicted by human postures and gestures. The regions of interest (ROIs) in videos are semantically structured with text recognition and the aid of external documents. By tracing the flow of lecturing, a finite state machine (FSM) which incorporates the gestures, postures, ROIs, general editing rules and constraints, is proposed to edit videos with novel views. The FSM is designed to generate appropriate simulated camera motion and cutting effects that suit the pace of a presenter´s gestures and postures. To remedy the undesirable visual effects due to poor lighting conditions, we also propose approaches to automatically enhance the visibility and readability of slides and whiteboard images in the edited videos
  • Keywords
    computer aided instruction; finite state machines; gesture recognition; image enhancement; pose estimation; text analysis; video signal processing; automatic lecture video editing; content analysis; finite state machine; gesture recognition; hand movement trajectory; head pose estimation; posture recognition; video enhancement; video text recognition; Gesture; lecture video editing; posture and video text recognition;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2006.886292
  • Filename
    4067011