DocumentCode :
2886359
Title :
Figure Retrieval in Biomedical Literature
Author :
Deepak, K.S. ; Rai, Harikrishna G. N. ; Radhakrishna, P.
Author_Institution :
Infosys Labs., Infosys Ltd., Bangalore, India
fYear :
2012
fDate :
10-10 Dec. 2012
Firstpage :
25
Lastpage :
32
Abstract :
Automatic classification of figures present in healthcare documents is known to be useful for biomedical document mining. The context of a document is directly reflected in the figures present within them. Embedded text within these figures along with image features have been used for figure retrieval. We demonstrate that image features based on structural properties of figures alone is sufficient for the figure retrieval task. An algorithm for describing structural properties of the embedded images, Fourier Edge Orientation Autocorrelogram, which utilizes spatial distribution of detected edges, is presented. We have shown that Fourier Edge Orientation Autocorrelogram performs better than its predecessor, when most of the edge information is retained. The algorithm is validated on publicly available figures from healthcare literature. Apart from invariance to scale, rotation and non-uniform illumination, the proposed feature descriptor is also shown to be relatively robust to noisy edges. Since there is no standard dataset available for figure classification, comparison of the proposed feature descriptor with four well known binary shape descriptors is demonstrated. The retrieval performance shows an overall improvement over other known methods in figure retrieval task.
Keywords :
Fourier analysis; bioinformatics; correlation methods; data mining; document handling; edge detection; feature extraction; health care; image retrieval; text analysis; Fourier edge orientation autocorrelogram; automatic figure classification; binary shape descriptors; biomedical document mining; biomedical literature; edge detection; edge information; embedded text; feature descriptor; figure classification; figure retrieval; figure structural properties; healthcare documents; healthcare literature; image features; noisy edges; nonuniform illumination; publicly available figures; retrieval performance; rotation invariance; scale invariance; spatial distribution; structural properties; Biomedical imaging; Image edge detection; Lighting; Medical services; Noise measurement; Robustness; Shape; algorithms; feature selection; intelligent bioinformatics and biomedical systems; intelligent information and multimedia systems; preprocessing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-5164-5
Type :
conf
DOI :
10.1109/ICDMW.2012.91
Filename :
6406419
Link To Document :
بازگشت