DocumentCode :
2425531
Title :
Indexing for linear model-based information retrieval
Author :
Chang, Yuan-Chi ; Li, Chung-Sheng
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
1
fYear :
2000
fDate :
2000
Firstpage :
359
Abstract :
This paper describes the Onion technique, a special indexing structure for linear optimization queries. Linear optimization queries ask for top-N records subject to the maximisation or minimization of a linearly weighted sum of record attribute values. Such a query appears in many applications employing linear models and is an effective way of summarising representative cases, such as the top-50 ranked colleges. Onion indexing is based on a geometric property of a convex hull, which guarantees that the optimal value can always be found at one or more of its vertices. Onion indexing makes use of this property to construct convex hulls in layers with outer layers enclosing inner layers geometrically. A data record is indexed by its layer number or equivalently its depth in the layered convex hull. Queries with linear weightings issued at run time are evaluated from the outmost layer inwards. We show experimentally that Onion indexing achieves orders of magnitude speedup against sequential linear scan when N is small compared to the cardinality of the set. The Onion technique also enables progressive retrieval, which processes and returns ranked results in a progressive manner. Furthermore, the proposed indexing can be extended into a hierarchical organization of data to accommodate both global and local queries
Keywords :
indexing; information retrieval; optimisation; Onion indexing; convex hull; data record; geometric property; global queries; linear model-based information retrieval; linear optimization queries; linear weightings; linearly weighted record attribute value sum; local queries; maximisation; minimization; progressive retrieval; sequential linear scan; top-N records; vertices; Cities and towns; Databases; Diseases; Educational institutions; Indexing; Information retrieval; Nearest neighbor searches; Neural networks; Q factor; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
0-7803-6536-4
Type :
conf
DOI :
10.1109/ICME.2000.869615
Filename :
869615
Link To Document :
بازگشت