• DocumentCode
    166104
  • Title

    A hybrid approach for Discourse Segment Detection in the automatic subtitle generation of computer science lecture videos

  • Author

    Sridhar, Rajeswari ; Aravind, S. ; Muneerulhudhakalvathi, Hamid ; Senthur, M. Sibi

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Anna Univ., Chennai, India
  • fYear
    2014
  • fDate
    24-27 Sept. 2014
  • Firstpage
    284
  • Lastpage
    287
  • Abstract
    The aim of this paper is to develop an automatic subtitle generation system for computer science lecture videos. CMU Sphinx Speech API is used to accomplish speech recognition. The main challenge of this work, is to align the translated text with the video. Discourse Segment Detection (DSD) is the process of analyzing and identifying discourse boundaries in human speech. Discourse Segment Detection (DSD) is carried out that classifies word boundaries and groups words until a discourse break occurs. The approach that has been devised in this paper for DSD to identify word boundary is a hybrid approach combining acoustic and linguistic features from the speech. This helps to segment the text obtained from Speech Engine, group words that can be written to the subtitles file without violating the subtitle standards. The devised approach has shown an improved performance than the existing approach as the error has reduced from 30% to 18 %.
  • Keywords
    computer aided instruction; computer science education; speech recognition; speech synthesis; video signal processing; CMU Sphinx speech API; DSD; automatic subtitle generation; computer science lecture videos; hybrid discourse segment detection approach; speech engine; speech recognition; translated text; Acoustics; Automatic speech recognition; Computer science; Pragmatics; Speech; Videos; Discourse Segment Detection; Subtitles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
  • Conference_Location
    New Delhi
  • Print_ISBN
    978-1-4799-3078-4
  • Type

    conf

  • DOI
    10.1109/ICACCI.2014.6968422
  • Filename
    6968422