DocumentCode :
2915336
Title :
Synchronization and combination techniques for audio-video based handwritten mathematical content recognition in classroom videos
Author :
Vemulapalli, Smita ; Hayes, Monson
Author_Institution :
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
2011
fDate :
22-24 Nov. 2011
Firstpage :
941
Lastpage :
946
Abstract :
Recognizing handwritten mathematical content is a challenging problem, and more so when such content appears in classroom videos. However, given the fact that in such videos the handwritten text and the accompanying audio refer to the same content, a combination of a video and an audio based recognizer has the potential to significantly improve the content recognition accuracy. In this paper, using a combination of video and audio based recognizers, we focus on improving the character recognition accuracy in such videos and propose: (1) synchronization techniques for establishing a correspondence between the handwritten and the spoken content, and (2) combination techniques for combining the outputs of the video and audio based recognizers. The current implementation of the system makes use of a modified open source text recognizer and a commercially available phonetic word-spotter. For evaluation purposes, we use videos recorded in a classroom-like environment and our experiments demonstrate the significant improvements (≈ 24% relative increase as compared to the baseline video based recognizer) in character recognition accuracy that can be achieved using our techniques.
Keywords :
audio signal processing; computer aided instruction; handwriting recognition; video signal processing; audio based recognizer; audio video based handwritten mathematical content recognition; classroom videos; combination techniques; handwritten text; open source text recognizer; spoken content; synchronization techniques; video based recognizer; Accuracy; Character recognition; Handwriting recognition; Speech recognition; Synchronization; Video recording; Videos; audio-video classifier combination; handwriting recognition; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
Conference_Location :
Cordoba
ISSN :
2164-7143
Print_ISBN :
978-1-4577-1676-8
Type :
conf
DOI :
10.1109/ISDA.2011.6121779
Filename :
6121779
Link To Document :
بازگشت