DocumentCode :
1953426
Title :
A compression method for inverted index and its FPGA-based decompression solution
Author :
Yan, Jing ; Xu, Ningyi ; Xia, Zenglin ; Luo, Rong ; Hsu, Feng-Hsiung
Author_Institution :
Microsoft Res. Asia, Beijing, China
fYear :
2010
fDate :
8-10 Dec. 2010
Firstpage :
261
Lastpage :
264
Abstract :
Reconfigurable computing based on FPGA is a promising solution to accelerate applications for web search engines. Due to the challenge of such data-intensive applications, data compression has become much more important. This paper proposes a data compression method for inverted indices, which combines the bit-level compression method - Huffman coding and a coarse-grained compression method, to achieve a balanced performance in compression ratio and decompression speed. Because an inverted index is only compressed once, the compression speed is not the major measurement for a compression method. The proposed method shows good to 21.61% compression ratio on inverted indices from a commercial search engine. This compression ratio is better than results by other existing compression methods. We also develop an efficient FPGA-based hardware decompression module, which could provide up to 996 MBps input bandwidth for the accelerator system.
Keywords :
Huffman codes; Internet; data compression; field programmable gate arrays; information retrieval; search engines; FPGA-based decompression solution; FPGA-based hardware decompression module; Web search engines; bit-level compression method-Huffman coding; coarse-grained compression method; data compression method; inverted index; reconfigurable computing; Bandwidth; Hardware; Huffman coding; Indexes; Query processing; Search engines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field-Programmable Technology (FPT), 2010 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-8980-0
Type :
conf
DOI :
10.1109/FPT.2010.5681488
Filename :
5681488
Link To Document :
بازگشت