• DocumentCode
    935256
  • Title

    A selectivity model for fragmented relations: applied in information retrieval

  • Author

    Blok, Henk Ernst ; Choenni, Sunil ; Blanken, Henk M. ; Apers, Peter M G

  • Author_Institution
    Dept. of Comput. Sci., Twente Univ., Enschede, Netherlands
  • Volume
    16
  • Issue
    5
  • fYear
    2004
  • fDate
    5/1/2004 12:00:00 AM
  • Firstpage
    635
  • Lastpage
    639
  • Abstract
    New application domains cause today´s database sizes to grow rapidly, posing great demands on technology. Data fragmentation facilitates techniques (like distribution, parallelization. and main-memory computing) meeting these demands. Also, fragmentation might help to improve efficient processing of query types such as top N. Database design and query optimization require a good notion of the costs resulting from a certain fragmentation. Our mathematically derived selectivity model facilitates this. Once its two parameters have been computed based on the fragmentation, after each (though usually infrequent) update, our model can forget the data distribution, resulting in fast and quite good selectivity estimation. We show experimental verification for Zipfian distributed IR databases.
  • Keywords
    distributed databases; information retrieval; storage management; Zipfian distributed IR databases; data distribution; data fragmentation; database design; fragmented relations; information retrieval; main-memory computing; mathematically derived selectivity model; query optimization; query types; selectivity model; Computer applications; Concurrent computing; Context modeling; Cost function; Distributed computing; Distributed databases; Information retrieval; Mathematical model; Predictive models; Query processing;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2004.1277824
  • Filename
    1277824