• DocumentCode
    2015482
  • Title

    PRAAD: Preprocessing and Analysis Tool for Arabic Ancient Documents

  • Author

    Boussellaa, Wafa ; Zahour, Abderrazak ; Taconet, Bruno ; Alimi, Adel ; Benabdelhafid, Abdellatif

  • Author_Institution
    Univ. of Sfax, Sfax
  • Volume
    2
  • fYear
    2007
  • fDate
    23-26 Sept. 2007
  • Firstpage
    1058
  • Lastpage
    1062
  • Abstract
    This paper presents the new system PRAAD for preprocessing and analysis of Arabic historical documents. It is composed of two important parts: pre-processing and analysis of ancient documents. After digitization, the color or greyscale ancient documents images are distorted by the presence of strong background artefacts such as scan optical blur and noise, show-through and bleed-through effects and spots. In order to preserve and exploit this cultural heritage documents, we intend to create efficient tool that achieves restoration, binarisation, and analyses the document layout. The developed tool is done by adapting our expertise in document image processing of Arabic ancient documents, printed or manuscripts. The different functions of PRAAD system are tested on a set of Arabic ancient documents from the national library and the National Archives of Tunisia.
  • Keywords
    document image processing; image colour analysis; image restoration; natural languages; Arabic ancient documents; Arabic historical documents; National Archives of Tunisia; PRAAD; color ancient documents images; document image processing; document layout binarisation; document layout restoration; document preservation; greyscale ancient documents images; Background noise; Colored noise; Cultural differences; Document image processing; Image restoration; Libraries; Optical distortion; Optical noise; System testing; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
  • Conference_Location
    Parana
  • ISSN
    1520-5363
  • Print_ISBN
    978-0-7695-2822-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.2007.4377077
  • Filename
    4377077