• DocumentCode
    1841080
  • Title

    A Robust Endpoint Detection Algorithm for Video Caption Generation

  • Author

    Li, Qi ; Ma, Huadong ; Feng, Shuo

  • Author_Institution
    Beijing Key Lab. of Intell. Telecommun. Software & Multimedia, Beijing Univ. of Posts & Telecommun., Beijing
  • fYear
    2008
  • fDate
    18-21 Nov. 2008
  • Firstpage
    942
  • Lastpage
    946
  • Abstract
    This paper proposes a robust endpoint detection algorithm for continuous speech in noisy environment, and it can be used in automatic video caption generation systems. In the proposed algorithm, we integrate the widely used energy, zero crossing and entropy to form a new feature, EZE-feature, which possesses advantages while compensating the drawbacks of each individual. Moreover, we propose a robust endpoint detection method which makes the EZE-feature modify its environment parameters by adapting to the strength of background noise. The proposed algorithm has been used in an automatic video caption generation system, and the performance of the proposed algorithm is very well.
  • Keywords
    entropy; speech recognition; video signal processing; EZE-feature; automatic video caption generation; continuous speech recognition system; entropy; robust endpoint detection algorithm; zero crossing feature; Acoustic noise; Background noise; Detection algorithms; Entropy; Noise generators; Noise robustness; Speaker recognition; Speech; Telecommunication computing; Working environment noise; Endpoint detection; Speech analysis; video caption generation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Young Computer Scientists, 2008. ICYCS 2008. The 9th International Conference for
  • Conference_Location
    Hunan
  • Print_ISBN
    978-0-7695-3398-8
  • Electronic_ISBN
    978-0-7695-3398-8
  • Type

    conf

  • DOI
    10.1109/ICYCS.2008.71
  • Filename
    4709101