DocumentCode
153306
Title
Flexible Noisy Text Correction
Author
Sariev, Andrey ; Nenchev, Vladislav ; Gerdjikov, Stefan ; Mitankin, Petar ; Ganchev, Hristo ; Mihov, Stoyan ; Tinchev, Tinko
fYear
2014
fDate
7-10 April 2014
Firstpage
31
Lastpage
35
Abstract
We present a new general and language independent approach to the noisy text correction problem developed and implemented in the framework of the CULTURA project. We briefly describe the core candidate generator, REBELS, the complete system concept, its efficient implementation based on functional automata and its immediate applications. The quality of the whole system is empirically established in different experimental settings where language and noise sources are varied.
Keywords
automata theory; error correction; language translation; learning (artificial intelligence); text analysis; text editing; CULTURA project; REBELS; complete system concept; core candidate generator; flexible noisy text correction; functional automata; language independent approach; Automata; Computational modeling; Nickel; Noise; Noise measurement; Optical character recognition software; Standards; OCR correction; finite state automata; historical texts normalisation; noisy-text correction; statistical methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
Conference_Location
Tours
Print_ISBN
978-1-4799-3243-6
Type
conf
DOI
10.1109/DAS.2014.12
Filename
6830964
Link To Document