• DocumentCode
    2143948
  • Title

    Bags of Strokes Based Approach for Classification and Indexing of Drop Caps

  • Author

    Nguyen, Thi Thuong Huyen ; Coustaty, Mickaël ; Ogier, Jean-Marc

  • Author_Institution
    L3i Lab., La Rochelle, France
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    349
  • Lastpage
    353
  • Abstract
    This paper proposes an approach to process drop cap images - images of decorated letter that begin chapters of old documents that are preserved in libraries, museums - in the domain of characterization, classification and indexing of old documents. The originality of our proposal is based on the fact that we do not try to extract the letter of drop caps but to classify the drop caps according to period, author and style. The drop caps are characterized by using relevant visual features such as length, thickness, orientation, complexity and change of direction on their primitive elements: strokes. The purpose of this approach is to efficiently extract information embedded in the drop caps for the classification and the indexing of old documents. These new visual features based on bags of strokes are more easily calculable and generally applicable than texture or shape features. Experiments based on characterization, classification and indexing phases demonstrate the performance of our propositions and the advances that they represent in terms of content-based drop caps retrieval.
  • Keywords
    content-based retrieval; document handling; image classification; complexity features; content-based drop caps retrieval; direction change; drop cap image processing; drop caps classification; drop caps indexing; length features; old documents; orientation features; stroke bag approach; thickness features; visual features; Feature extraction; Image edge detection; Indexing; Shape; Skeleton; Vectors; Visualization; characterization; classification and indexing; codewords; drop caps; feature detection; old documents;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.78
  • Filename
    6065333