Title : 
Computer Assisted Transcription of Text Images: Results on the GERMANA Corpus and Analysis of Improvements Needed for Practical Use
         
        
            Author : 
Romero, Verónica ; Toselli, Alejandro H. ; Vidal, Enrique
         
        
            Author_Institution : 
Inst. Tecnol. de Inf., Univ. Politec. de Valencia, Valencia, Spain
         
        
        
        
        
        
            Abstract : 
We present a study of the application of Computer Assisted Transcription of Text Images (CATTI) to a task which is much closer to real applications than other tasks previously studied. The new task consists in the transcription of a new publicly available historic handwritten document, called GERMANA. A detailed analysis of the main factors influencing the system performance are exposed and some strategies to circumvent them are proposed.
         
        
            Keywords : 
document image processing; handwritten character recognition; text analysis; GERMANA corpus; computer assisted transcription; handwritten document; text images; Context; Erbium; Feature extraction; Hidden Markov models; Humans; Training; Vocabulary; Handwritten text image recognition; interactive predictive framework;
         
        
        
        
            Conference_Titel : 
Pattern Recognition (ICPR), 2010 20th International Conference on
         
        
            Conference_Location : 
Istanbul
         
        
        
            Print_ISBN : 
978-1-4244-7542-1
         
        
        
            DOI : 
10.1109/ICPR.2010.497