See this document in CiteSeerX!

Optimized Query Execution in Large Search Engines with Global Page Ordering (2003)  (Make Corrections)  (5 citations)
Xiaohui Long, Torsten Suel



  Home/Search   Context   Related

 
View or download:
poly.edu/~suel/papers/order.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  poly.edu/~suel/ (more)
Homepages:  X.Long  

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Large web search engines have to answer thousands of queries per second with interactive response times. A major factor in the cost of executing a query is given by the lengths of the inverted lists for the query terms, which increase with the size of the document collection and are often in the range of many megabytes. To address this issue, IR and database researchers have proposed pruning techniques that compute or approximate term-based ranking functions without scanning over the full... (Update)

Cited by:   More
An Efficient and Versatile Query Engine for TopX Search - Theobald, Schenkel, Weikum (2005)   (Correct)
Three-Level Caching for Efficient Query Processing in Large Web .. - Long, Suel (2005)   (Correct)
Efficiency-Quality Tradeoffs for Vector Score Aggregation - Singitham, Mahabhashyam.. (2004)   (Correct)

Active bibliography (related documents):   More   All
1.2:   ODISSEA: A Peer-to-Peer Architecture for Scalable.. - Suel, Mathur, Wu, .. (2003)   (Correct)
0.4:   Efficient Query Evaluation on Large Textual Collections in a.. - Zhang, Suel (2005)   (Correct)
0.3:   Optimal Aggregation Algorithms for Middleware - Fagin, Lotem, Naor (2001)   (Correct)

Similar documents based on text:   More   All
0.5:   I/O-Efficient Techniques for Computing Pagerank - Chen, Gan, Suel   (Correct)
0.5:   Design and Implementation of a High-Performance Distributed.. - Shkapenyuk, Suel (2001)   (Correct)
0.4:   Compressing the Graph Structure of the Web - Suel, Yuan (2001)   (Correct)

Related documents from co-citation:   More   All
4:   Optimal aggregation algorithms for middleware - Fagin, Lotem et al. - 2001
4:   Managing gigabytes: compressing and indexing documents and images - Witten, Moffat et al. - 1994
3:   Filtered document retrieval with frequency-sorted indexes - Persin, Zobel et al. - 1996

BibTeX entry:   (Update)

X. Long and T. Suel. Optimized Query Execution in Large Search Engines with Global Page Ordering. In VLDB'03, 2003. http://citeseer.ist.psu.edu/long03optimized.html   More

@misc{ long03optimized,
  author = "X. Long and T. Suel",
  title = "Optimized Query Execution in Large Search Engines with Global Page Ordering",
  text = "X. Long and T. Suel. Optimized Query Execution in Large Search Engines
    with Global Page Ordering. In VLDB'03, 2003.",
  year = "2003",
  url = "citeseer.ist.psu.edu/long03optimized.html" }
Citations (may not include all citations):
641   The anatomy of a large-scale hypertextual web search engine - Brin, Page - 1998
576   Authoritative sources in a hyperlinked environment - Kleinberg - 1998
372   Modern Information Retrieval - Baeza-Yates, Ribeiro-Neto - 1999
344   The pagerank citation ranking: Bringing order to the web - Page, Brin et al. - 1999
280   Managing Gigabytes: Compressing and Indexing Documents and I.. - Witten, Moffat et al. - 1999
149   Combining fuzzy information from multiple systems - Fagin - 1996
82   Topic-sensitive pagerank - Haveliwala - 2002
75   Lessons from giant scale services - Brewer - 2001
70   Optimal aggregation algorithms for middleware - Fagin, Lotem et al. - 2001
67   the resemblance and containment of documents - Broder - 1997
46   Optimizing queries over multimedia repositories - Chaudhuri, Gravano - 1996
44   Optimization of inverted vector searches (context) - Buckley, Lewit - 1985
38   The intelligent surfer: Probabilistic combination of link an.. - Richardson, Domingos - 2002
37   Breadth-first search crawling yields high-quality pages - Najork, Wiener - 2001
36   Retrieving records from a gigabyte of text on a minicomputer.. (context) - Harman, Candela - 1990
35   queries over web-accessible databases - Bruno, Gravano et al. - 2002
34   Optimizing multifeature queries in image databases (context) - Guntzer, Balke et al. - 2000
32   The Stochastic Approach for Link-Structure Analysis (context) - Lempel, Moran - 2000
31   Finding authorities and hubs from link structures on the Wor.. - Borodin, Roberts et al. - 2001
30   Design and implementation of a high-performance distributed .. - Shkapenyuk, Suel - 2002
27   Automatic resource list compilation by analyzing hyperlink s.. - Chakrabarti, Dom et al. - 1998
26   Query processing issues in image (context) - Nepal, Ramakrishna - 1999
25   Building a distributed full-text index for the web - Melnik, Raghavan et al. - 2000
23   Filtered document retrieval with frequency-sorted indexes - Persin, Zobel et al. - 1996
20   ODISSEA: A peer-to-peer architecture for scalable web search.. - Suel, Mathur et al. - 2003
18   Performance of inverted indices in distributed text document.. - Tomasic, Garcia-Molina - 1993
17   Evaluating the performance of distributed architectures for .. - Cahoon, McKinley et al. - 2000
15   Information Processing and Management (context) - Turtle, Flood et al. - 1995
14   Search engines and web dynamics - Risvik, Michelsen - 2002
14   Implementations of partial document ranking using inverted f.. - Wong, Lee - 1993
13   Using fagin's algorithm for merging ranked results in multim.. - Wimmers, Haas et al. - 1999
13   Inverted file partitioning schemes in multiple disk systems - Jeong, Omiecinski - 1995
12   Compressed inverted files with reduced decoding overheads (context) - Anh, Moffat - 1998
11   Distributed query processing using partitioned inverted file.. (context) - Badue, Baeza-Yates et al. - 2002
11   Optimizing result prefetching in web search engines with seg.. - Lempel, Moran - 2002
10   Static index pruning for information retrieval systems - Fagin, Carmel et al. - 2001
10   Vector-space ranking with effective early termination (context) - Anh, Kretser et al. - 2001
9   Combining fuzzy information: an overview - Fagin - 2002
9   ACM Transactions on Internet Technologies (context) - Arasu, Cho et al. - 2001
7   Towards efficient multi-feature queries in heterogeneous env.. - Guntzer, Balke et al. - 2001
5   Scalable Distributed Architectures For Information Retrieval - Lu - 1999



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://cis.poly.edu/~suel/):   More
zdelta: An Efficient Delta Compression Tool - Trendafilov, Memon, Suel (2002)   (Correct)
On The Scalability Of An Image Transcoding Proxy Server - Savant, Memon, Suel (2003)   (Correct)
Interactive Wrapper Generation with Minimal User Effort - Irmak, Suel (2003)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC