• DocumentCode
    2010891
  • Title

    A New Method for Arbitrarily-Oriented Text Detection in Video

  • Author

    Sharma, Nabin ; Shivakumara, Palaiahnakote ; Pal, Umapada ; Blumenstein, Michael ; Tan, Chew Lim

  • Author_Institution
    Griffith Univ., Brisbane, QLD, Australia
  • fYear
    2012
  • fDate
    27-29 March 2012
  • Firstpage
    74
  • Lastpage
    78
  • Abstract
    Text detection in video frames plays a vital role in enhancing the performance of information extraction systems because the text in video frames helps in indexing and retrieving video efficiently and accurately. This paper presents a new method for arbitrarily-oriented text detection in video, based on dominant text pixel selection, text representatives and region growing. The method uses gradient pixel direction and magnitude corresponding to Sobel edge pixels of the input frame to obtain dominant text pixels. Edge components in the Sobel edge map corresponding to dominant text pixels are then extracted and we call them text representatives. We eliminate broken segments of each text representatives to get candidate text representatives. Then the perimeter of candidate text representatives grows along the text direction in the Sobel edge map to group the neighboring text components which we call word patches. The word patches are used for finding the direction of text lines and then the word patches are expanded in the same direction in the Sobel edge map to group the neighboring word patches and to restore missing text information. This results in extraction of arbitrarily-oriented text from the video frame. To evaluate the method, we considered arbitrarily-oriented data, non-horizontal data, horizontal data, Hua´s data and ICDAR-2003 competition data (Camera images). The experimental results show that the proposed method outperforms the existing method in terms of recall and f-measure.
  • Keywords
    edge detection; feature extraction; text detection; video retrieval; Camera images; Hua data; ICDAR-2003 competition data; Sobel edge map extraction; Sobel edge pixels; arbitrarily-oriented data; arbitrarily-oriented text detection; broken segments; dominant text pixel selection; edge components; f-measure; gradient pixel direction; information extraction system; missing text information restoration; neighboring text component grouping; neighboring word patch grouping; nonhorizontal data; region growing; text line direction; text representatives; video frames; video indexing; video retrieval; word patches; Cameras; Classification algorithms; Educational institutions; Feature extraction; Image edge detection; Image resolution; Pattern recognition; Angular region growing; Arbitrarily-oriented text detection; Dominant text pixels; Gradient direction; Video text frame; Video text representative;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
  • Conference_Location
    Gold Cost, QLD
  • Print_ISBN
    978-1-4673-0868-7
  • Type

    conf

  • DOI
    10.1109/DAS.2012.6
  • Filename
    6195338