• DocumentCode
    1581002
  • Title

    Arabic hand-written text-line extraction

  • Author

    Zahour, A. ; Taconet, B. ; Mercy, P. ; Ramdane, S.

  • Author_Institution
    Equipe GED, Le Havre Univ., France
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    281
  • Lastpage
    285
  • Abstract
    This paper describes a text-line extraction based method. The typical segmentation for a printed binary document is based on the horizontal projection analysis and then the regrouping of the connected components. These techniques can´t be used for handwritten unconstrained text because data frequently contain undulations and shifts in the baseline, baseline-skew variability and inter-line distance variability. So, we think that the border line for a handwritten unconstrained documents should be a collection of horizontal line segments. From this point of view, we use a partial contour following based method to detect the separating lines. In the current version of our algorithm, we proceed to text slant detection, text line number evaluation by using partial projection. Then we carry out a partial contour following of every line; first in the direction of the writing, then in the opposite direction. After the treatment, the adjacent lines are separated. In the experimental session, we describe the application of the algorithm used for the extraction of text line. Database images contains about one hundred handwritten Arabic texts written by different writers. Results about diacritical points affectation are also reported
  • Keywords
    feature extraction; handwritten character recognition; visual databases; Arabic handwritten text-line extraction; baseline-skew variability; database images; handwritten unconstrained documents; horizontal line segments; horizontal projection analysis; inter-line distance variability; partial contour following based method; printed binary document segmentation; text line number evaluation; text slant detection; Autocorrelation; Data mining; Histograms; Image analysis; Image segmentation; Performance analysis; Strips; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    0-7695-1263-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2001.953799
  • Filename
    953799