• DocumentCode
    2149093
  • Title

    Recognizing Text Elements for SVG Comic Compression and Its Novel Applications

  • Author

    Su, Chung-Yuan ; Chang, Ray-I ; Liu, Jen-Chang

  • Author_Institution
    Dept. of Eng. Sci., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    1329
  • Lastpage
    1333
  • Abstract
    SVG (scalable vector graphics) has become the standard format for 2D graphics in HTML5. Although some image-to-SVG conversion systems had been proposed, the sizes of files they produced are still large. In [1], we proposed a new system to convert raster comic images into vector SVG files. The compression ratio is better than the previous methods. However, these methods do not process text in raster images. In this paper, we improve our system to recognize text elements in the comic and use these text elements to provide better compression and novel applications. The proposed method uses SCW (sliding concentric windows) and SVM (support vector machine) to identify text regions. Then, OCR (optical character recognition) is applied to recognize text elements in those regions. Instead of encoding the text regions as vectors, the text elements are embedded in the SVG file along with their coordinate values. Experimental results show that we can reduce the file sizes to about 52% of the original SVG files. Using these text elements, we can translate comics into other languages to provide multilingual services easily. Text/content-based image search can be supported efficiently. It can also provide a novel application system for story teller.
  • Keywords
    computer graphics; data compression; document image processing; support vector machines; text analysis; 2D graphics format; HTML5; SCW; SVG comic compression; SVM; comic images; ratio compression; scalable vector graphics; sliding concentric windows; support vector machine; text elements; text elements recognition; text process; Image coding; Image color analysis; Optical character recognition software; Rendering (computer graphics); Text recognition; Vectors; SCW segmentation; SVG; text detection; vector compression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.267
  • Filename
    6065526