Title :
Grammar-assisted audio-video equation recognition
Author :
Vemulapalli, Smite ; Hayes, M.H.
Author_Institution :
Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
In this paper, we consider the problem of recognizing handwritten mathematical content from classroom videos. Since the handwritten text and the accompanying audio refer to the same mathematical characters and symbols, a combination of video and audio based recognizers has the potential to significantly increase the recognition accuracy compared to that of the individual recognizers. In this paper, we propose a novel multi-step technique for combining the output of the video and the audio based recognizers. Initial recognition results from a video based recognizer and a speech recognizer, operating independently on the handwritten and the spoken content from a classroom video, are combined with a base mathematical speech grammar to arrive at a constrained speech grammar that is specific to the content being recognized. The constrained speech grammar is then used by the speech recognizer to generate the final character recognition results. A subsequent layout analysis step, which makes used of audio cues and X-Y cuts based method, is used to arrive at the final recognized content. Experiments conducted using videos recorded in a classroom like environment are used to demonstrate the significant improvement in recognition accuracy that can be achieved using our technique.
Keywords :
grammars; handwriting recognition; handwritten character recognition; speech recognition; video signal processing; audio based recognizer; classroom videos; grammar-assisted audio-video equation recognition; handwritten mathematical content; handwritten text; layout analysis step; mathematical speech grammar; speech recognizer; video based recognizer; Algebra; Equations; Integrated circuits; Audio-Video Combination; Handwriting Recognition; Speech Recognition;
Conference_Titel :
Digital Signal Processing (DSP), 2013 18th International Conference on
Conference_Location :
Fira
DOI :
10.1109/ICDSP.2013.6622671