See this document in CiteSeerX!

Query-Based Sampling of Text Databases (1999)  (Make Corrections)  (34 citations)
Jamie Callan, Margaret Connell
Information Systems



  Home/Search   Context   Related

 
View or download:
cmu.edu/~callan/Papers/tois01a.ps.gz
umass.edu/pubfiles/ir180.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cmu.edu/~callan/Papers/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: The proliferation of searchable texts... This paper presents query-based sampling, a new technique for acquiring accurate resource descriptions. Query-based sampling does not require the cooperation of resource providers nor does it require that resource providers use a particular search engine or representation technique. An extensive set of experimental results demonstrates that accurate resource descriptions are created, that computation and communication costs are reasonable, and that the... (Update)

Similar documents based on text:   More   All
0.5:   Distributed Information Retrieval With Skewed Database Size.. - Si, Lu, Callan   (Correct)
0.4:   Ist Rd Project - Shared-Cost Rtd Project   (Correct)
0.3:   A Semisupervised Learning Method to Merge Search Engine Results - Si, Callan (2003)   (Correct)

BibTeX entry:   (Update)

Callan, J. and Connell, M. (1999). Query-based sampling of text databases. Technical Report IR-180, Center for Intelligent Information Retrieval, Department of Computer Science, University of Massachusetts. http://citeseer.ist.psu.edu/callan99querybased.html   More

@article{ callan01querybased,
    author = "James P. Callan and Margaret E. Connell",
    title = "Query-based sampling of text databases",
    journal = "Information Systems",
    volume = "19",
    number = "2",
    pages = "97--130",
    year = "2001",
    url = "citeseer.ist.psu.edu/callan99querybased.html" }
Citations (may not include all citations):
411   Freenet: A distributed anonymous information storage and ret.. - Clarke, Sandberg et al. - 2000
193   Searching distributed collections with inference networks - Callan, Lu et al. - 1995
110   Generalizing GLOSS to vector-space databases and broker hier.. - Gravano, Garc - 1995
109   The automatic creation of literature abstracts (context) - Luhn - 1958
102   Inference Networks for Document Retrieval - Turtle - 1990
101   Evaluation of an inference network-based retrieval model (context) - Turtle, Croft - 1991
92   STARTS Stanford proposal for Internet meta-searching - Gravano, Chang et al. - 1997
73   The art of scientic computing (context) - Press, Flannery et al. - 1992
61   TREC and TIPSTER experiments with INQUERY - Callan, Croft et al. - 1995
59   A decision-theoretic approach to database selection in netwo.. - Fuhr - 1999
56   Cluster-based language models for distributed retrieval - Xu, Croft - 1999
53   Comparing the performance of database selection algorithms - French, Powell et al. - 1999
53   HyPursuit: A hierarchical network search engine that exploit.. (context) - Weiss, Velez et al. - 1996
49   Learning collection fusion strategies (context) - Voorhees, Gupta et al. - 1995
49   Automatic discovery of language models for text databases - Callan, Connell et al. - 1999

[Article contains additional citations not shown here]



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.cmu.edu/~callan/Papers/):   More
INQUERY Does Battle with TREC-6 - Allan, Callan, Croft, Ballesteros.. (1998)   (Correct)
Comparing the Performance of Database Selection.. - French, Powell, Callan, .. (1999)   (Correct)
Recent Experiments with INQUERY - Allan (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC