Title :
Text Alignment from Bimodal Mathematical Expression Sources
Author :
Medjkoune, Sofiane ; Mouchere, Harold ; Viard-Gaudin, Christian ; Petitrenaud, Simon
Author_Institution :
IRCCyN, Univ. of Nantes, Nantes, France
Abstract :
In this paper we propose a new approach to merge mathematical expression recognition results coming from handwriting and speech modalities. Using a bimodal description of mathematical expressions allows taking advantage of the complementarities between both signals, and can disambiguate situations were a single modality would not be clear enough. To combine the signals coming from both modalities, we propose to represent them in the same space as a textual description. First, from the handwriting signal, we generate the Nbest mathematical expressions, each of them is next translated as different possible strings. From the audio signal, an automatic speech recognition system provides a transcript, which is also available as a string. A string comparison algorithm is achieved to select the best mathematical expressions. This bimodal system is evaluated on real bimodal data from the HAMEX dataset and the results are compared to a single modality (handwriting) based system.
Keywords :
handwritten character recognition; speech recognition; string matching; text analysis; HAMEX dataset; MER; Nbest mathematical expressions; audio signal; automatic speech recognition system; bimodal mathematical expression sources; handwriting based mathematical expression recognition; string comparison algorithm; text alignment; Handwriting recognition;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
Print_ISBN :
978-1-4799-4335-7
DOI :
10.1109/ICFHR.2014.42