• DocumentCode
    153306
  • Title

    Flexible Noisy Text Correction

  • Author

    Sariev, Andrey ; Nenchev, Vladislav ; Gerdjikov, Stefan ; Mitankin, Petar ; Ganchev, Hristo ; Mihov, Stoyan ; Tinchev, Tinko

  • fYear
    2014
  • fDate
    7-10 April 2014
  • Firstpage
    31
  • Lastpage
    35
  • Abstract
    We present a new general and language independent approach to the noisy text correction problem developed and implemented in the framework of the CULTURA project. We briefly describe the core candidate generator, REBELS, the complete system concept, its efficient implementation based on functional automata and its immediate applications. The quality of the whole system is empirically established in different experimental settings where language and noise sources are varied.
  • Keywords
    automata theory; error correction; language translation; learning (artificial intelligence); text analysis; text editing; CULTURA project; REBELS; complete system concept; core candidate generator; flexible noisy text correction; functional automata; language independent approach; Automata; Computational modeling; Nickel; Noise; Noise measurement; Optical character recognition software; Standards; OCR correction; finite state automata; historical texts normalisation; noisy-text correction; statistical methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
  • Conference_Location
    Tours
  • Print_ISBN
    978-1-4799-3243-6
  • Type

    conf

  • DOI
    10.1109/DAS.2014.12
  • Filename
    6830964