See this document in CiteSeerX!

Three-Level Caching for Efficient Query Processing in Large Web Search Engines (2005)  (Make Corrections)  
Xiaohui Long, Torsten Suel



  Home/Search   Context   Related

 
View or download:
www2005.org/cdrom/docs/p257.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  www2005.org/cdrom/contents (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Large web search engines have to answer thousands of queries per second with interactive response times. Due to the sizes of the data sets involved, often in the range of multiple terabytes, a single query may require the processing of hundreds of megabytes or more of index data. To keep up with this immense workload, large search engines employ clusters of hundreds or thousands of machines, and a number of techniques such as caching, index compression, and index and query pruning are used to... (Update)

Active bibliography (related documents):   More   All
3.4:   Three-Level Caching for Efficient Query Processing in Large Web .. - Long, Suel (2005)   (Correct)
0.9:   Optimized Query Execution in Large Search Engines with Global.. - Long, Suel (2003)   (Correct)
0.9:   ODISSEA: A Peer-to-Peer Architecture for Scalable.. - Suel, Mathur, Wu, .. (2003)   (Correct)

Similar documents based on text:   More   All
0.5:   Interactive Wrapper Generation with Minimal User Effort - Irmak, Suel (2003)   (Correct)
0.4:   Hierarchical Substring Caching for Efficient Content.. - Irmak, Suel (2005)   (Correct)
0.3:   Server-Friendly Delta Compression for Efficient Web Access - Savant, Suel (2003)   (Correct)

BibTeX entry:   (Update)

@misc{ long-threelevel,
  author = "Xiaohui Long and Torsten Suel",
  title = "Three-Level Caching for Efficient Query Processing in Large Web Search
    Engines",
  url = "citeseer.ist.psu.edu/article/long05threelevel.html" }
Citations (may not include all citations):
4212   Computers and Intractability: A Guide to the Theory of NP Co.. (context) - Garey, Johnson - 1979
641   The anatomy of a large-scale hypertextual web search engine - Brin, Page - 1998
372   Modern Information Retrieval - Baeza-Yates, Ribeiro-Neto - 1999
298   Cost-aware WWW proxy caching algorithms - Cao, Irani - 1997
280   Managing Gigabytes: Compressing and Indexing Documents and I.. - Witten, Moffat et al. - 1999
149   Combining fuzzy information from multiple systems - Fagin - 1996
82   Topic-sensitive pagerank - Haveliwala - 2002
75   Lessons from giant scale services - Brewer - 2001
70   Optimal aggregation algorithms for middleware - Fagin, Lotem et al. - 2001
67   the resemblance and containment of documents - Broder - 1997
46   Optimizing queries over multimedia repositories - Chaudhuri, Gravano - 1996
38   The intelligent surfer: Probabilistic combination of link an.. - Richardson, Domingos - 2002
30   Design and implementation of a high-performance distributed .. - Shkapenyuk, Suel - 2002
25   line file caching - Young - 1998
25   Building a distributed full-text index for the web - Melnik, Raghavan et al. - 2000
23   Filtered document retrieval with frequency-sorted indexes - Persin, Zobel et al. - 1996
20   ODISSEA: A peer-to-peer architecture for scalable web search.. - Suel, Mathur et al. - 2003
18   Performance of inverted indices in distributed text document.. - Tomasic, Garcia-Molina - 1993
18   Predictive caching and prefetching of query results in searc.. (context) - Lempel, Moran - 2003
14   Search engines and web dynamics - Risvik, Michelsen - 2002
13   Efficient peer-to-peer searches using result-caching - Bhattacharjee, Chawathe et al. - 2003
12   Compression of inverted indexes for fast query evaluation - Scholer, Williams et al. - 2002
12   Compressed inverted files with reduced decoding overheads (context) - Anh, Moffat - 1998
11   Distributed query processing using partitioned inverted file.. (context) - Badue, Baeza-Yates et al. - 2002
11   Locality in search engine queries and its implications for c.. - Xie, O'Hallaron - 2002
11   Optimizing result prefetching in web search engines with seg.. - Lempel, Moran - 2002
11   On caching search engine query results - Markatos - 2000
10   Static index pruning for information retrieval systems - Fagin, Carmel et al. - 2001
10   Vector-space ranking with effective early termination (context) - Anh, Kretser et al. - 2001
9   ACM Transactions on Internet Technologies (context) - Arasu, Cho et al. - 2001
8   Efficient passage ranking for document databases - Kaszkiel, Zobel et al. - 1999
7   Adaptive set intersections (context) - Demaine, Lopez-Ortiz et al. - 2000
7   the feasibility of peer-to-peer web indexing (context) - Li, Loo et al. - 2003
6   Rank-preserving two-level caching for scalable search engine.. (context) - Saraiva, de Moura et al. - 2001
5   Outperforming LRU with an adaptive replacement cache - Megiddo, Modha - 2004
5   Optimized query execution in large search engines with globa.. - Long, Suel - 2003
4   Efficient phrase querying with an auxiliary index - Bahle, Williams et al. - 2002
3   Interaction of query evaluation and buffer management for in.. (context) - Jonsson, Franklin et al. - 1998
2   Multi-tier architecture for web search engines (context) - Risvik, Aasheim et al. - 2003

Documents on the same site (http://www.www2005.org/cdrom/contents.htm):   More
Sampling Search-Engine Results - Anagnostopoulos, Broder, Carmel (2005)   (Correct)
Incremental Maintenance for Materialized XPath/XSLT Views - Onizuka, Chan, Michigami, .. (2005)   (Correct)
PageRank as a Function of the Damping Factor - Boldi, Santini, Vigna (2005)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC