Title :
Text Segmentation from Complex Background Using Sparse Representations
Author :
Pan, W.M. ; Bui, T.D. ; Suen, C.Y.
Author_Institution :
Concordia Univ., Portland
Abstract :
A novel text segmentation method from complex background is presented in this paper. The idea is inspired by the recent development in searching for the sparse signal representation among a family of over-complete atoms, which is called a dictionary. We assume that the image under investigation is composed of two components: the foreground text and the complex background. We further assume that the latter can be modeled as a piece-wise smooth function. Then we choose two dictionaries, where the first one gives sparse representation to one component and non-sparse representation to another while the second one does the opposite. By looking for the sparse representations in each dictionary, we can decompose the image into the two composing components. After that, text segmentation can be easily achieved by applying simple thresholding to the text component. Preliminary experiments show some promising results.
Keywords :
image representation; image segmentation; smoothing methods; text analysis; foreground text; piecewise smooth function; sparse representations; text segmentation; Computer science; Dictionaries; Filtering algorithms; Image analysis; Image resolution; Image segmentation; Machine intelligence; Pattern recognition; Signal representations; Software engineering;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4378742