DocumentCode :
1412636
Title :
Fundamental limits for information retrieval
Author :
Coffey, John T. ; Klimesh, Matthew
Author_Institution :
Texas Instrum., Santa Rosa, CA, USA
Volume :
46
Issue :
7
fYear :
2000
fDate :
11/1/2000 12:00:00 AM
Firstpage :
2281
Lastpage :
2298
Abstract :
The fundamental limits of performance for a general model of information retrieval from databases are studied. In the scenarios considered, a large quantity of information is to be stored on some physical storage device. Requests for information are modeled as a randomly generated sequence with a known distribution. The requests are assumed to be “context-dependent,” i.e., to vary according to the sequence of previous requests. The state of the physical storage device is also assumed to depend on the history of previous requests. In general, the logical structure of the information to be stored does not match the physical structure of the storage device, and consequently there are nontrivial limits on the minimum achievable average access times, where the average is over the possible sequences of user requests. The paper applies basic information-theoretic methods to establish these limits and demonstrates constructive procedures that approach them, for a wide class of systems. Allowing redundancy greatly lowers the achievable access times, even when the amount added is an arbitrarily small fraction of the total amount of information in the database. The achievable limits both with and without redundancy are computed; in the case where redundancy is allowed the limits essentially coincide with lower limits for more general storage systems
Keywords :
database theory; information retrieval; information theory; redundancy; context-dependent requests; databases; distribution; general model; general storage systems; information retrieval; information-theoretic methods; logical structure; minimum achievable average access times; performance limits; previous requests history; randomly generated sequence; redundancy; requests; storage device; Communication systems; Conferences; History; Image databases; Image storage; Information retrieval; Information science; Information theory; Instruments; Software libraries;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/18.887844
Filename :
887844
Link To Document :
بازگشت