DocumentCode :
1766432
Title :
Towards Large-Scale Histopathological Image Analysis: Hashing-Based Image Retrieval
Author :
Xiaofan Zhang ; Wei Liu ; Dundar, Murat ; Badve, Sunil ; Shaoting Zhang
Author_Institution :
Dept. of Comput. Sci., Univ. of North Carolina at Charlotte, Charlotte, NC, USA
Volume :
34
Issue :
2
fYear :
2015
fDate :
Feb. 2015
Firstpage :
496
Lastpage :
506
Abstract :
Automatic analysis of histopathological images has been widely utilized leveraging computational image-processing methods and modern machine learning techniques. Both computer-aided diagnosis (CAD) and content-based image-retrieval (CBIR) systems have been successfully developed for diagnosis, disease detection, and decision support in this area. Recently, with the ever-increasing amount of annotated medical data, large-scale and data-driven methods have emerged to offer a promise of bridging the semantic gap between images and diagnostic information. In this paper, we focus on developing scalable image-retrieval techniques to cope intelligently with massive histopathological images. Specifically, we present a supervised kernel hashing technique which leverages a small amount of supervised information in learning to compress a 10 thinspace000-dimensional image feature vector into only tens of binary bits with the informative signatures preserved. These binary codes are then indexed into a hash table that enables real-time retrieval of images in a large database. Critically, the supervised information is employed to bridge the semantic gap between low-level image features and high-level diagnostic information. We build a scalable image-retrieval framework based on the supervised hashing technique and validate its performance on several thousand histopathological images acquired from breast microscopic tissues. Extensive evaluations are carried out in terms of image classification (i.e., benign versus actionable categorization) and retrieval tests. Our framework achieves about 88.1% classification accuracy as well as promising time efficiency. For example, the framework can execute around 800 queries in only 0.01 s, comparing favorably with other commonly used dimensionality reduction and feature selection methods.
Keywords :
cancer; decision support systems; diseases; feature selection; image classification; image coding; image retrieval; learning (artificial intelligence); mammography; medical image processing; tumours; actionable categorization; annotated medical data; automatic analysis; binary bits; binary codes; breast microscopic tissues; classification accuracy; computer-aided diagnosis; content-based image-retrieval systems; data-driven methods; database; decision support; diagnostic information; dimensional image feature vector; dimensionality reduction; disease detection; ever-increasing amount; feature selection methods; hashing-based image retrieval; high-level diagnostic information; image classification; informative signature preservation; large-scale histopathological image analysis; large-scale methods; leveraging computational image-processing methods; low-level image features; mammography; massive histopathological images; modern machine learning techniques; retrieval tests; scalable image-retrieval techniques; semantic gap; supervised kernel hashing technique; Breast cancer; Feature extraction; Image retrieval; Kernel; Optimization; Semantics; Vectors; Breast lesion; hashing; high dimension; histopathological image analysis; large-scale image retrieval; supervised learning;
fLanguage :
English
Journal_Title :
Medical Imaging, IEEE Transactions on
Publisher :
ieee
ISSN :
0278-0062
Type :
jour
DOI :
10.1109/TMI.2014.2361481
Filename :
6919336
Link To Document :
بازگشت