(Enter summary)
Abstract: Consider the task of exploring the Web in order to find pages of a particular kind or on a particular topic. This task arises in the construction of domain-specific search engines. A selective, directed web spider can be much more efficient than a spider that gathers new pages indiscriminantly. This paper argues that the creation of efficient web spiders is best framed and solved by reinforcement learning, a branch of machine learning that concerns itself with optimal sequential decision... (Update)
Context of citations to this paper: More
.... first search [12] pagerank [4,8,15] focused crawling [17,10] shark search [11] adaptive agent [2,3,9] reinforcement learning [6], artificial life agent[13] arbitrary predicate [1] WTMS [18,19] etc. The main purpose of these algorithms is to gather the most relevant...
Cited by: More
Learnable Crawling: An Efficient Approach to.. - Angkawattanawit.. (2002)
(Correct)
Similar documents (at the sentence level): More
61.2%: Efficient Web Spidering with Reinforcement Learning - Rennie, McCallum (1999)
(Correct)
11.6%: Using Reinforcement Learning to Spider the Web Efficiently - Rennie, McCallum (1999)
(Correct)
8.2%: Building Domain-Specific Search Engines with Machine .. - McCallum, Nigam.. (1999)
(Correct)
Active bibliography (related documents): More All
0.2: Text Classification from Labeled and Unlabeled.. - Nigam, Mccallum.. (1999)
(Correct)
0.2: Accountability in a Computerized Society - Nissenbaum
(Correct)
0.2: Segmentation of Range Images Via Data Fusion and.. - Baccar, Gee.. (1996)
(Correct)
Similar documents based on text: More All
0.5: A Machine Learning Approach to Building.. - McCallum, Nigam.. (1999)
(Correct)
BibTeX entry: (Update)
J. Rennie and A. McCallum. "Efficient Web Spidering with Reinforcement Learning." In Proceedings of the 16th International Conference on Machine Learning, 1999. http://citeseer.ist.psu.edu/rennie99efficient.html More
@misc{ rennie-efficient,
author = "Jason Rennie and Andrew {McCallum}",
title = "Efficient Web Spidering with Reinforcement Learning",
url = "citeseer.ist.psu.edu/rennie99efficient.html" }
Citations (may not include all citations):
976
Machine Learning (context) - Mitchell - 1997
374
Reinforcement learning: A survey
- Kaelbling, Littman et al. - 1996
189
Webwatcher: A tour guide for the World Wide Web
- Joachims, Freitag et al. - 1997
103
at forty: The independence assumption in information retriev.. (context) - Lewis - 1998
29
Text classication from labeled and unlabeled documents using..
- Nigam, McCallum et al. - 1999
7
Building domain-specic search engines with machine learning ..
- McCallum, Nigam et al. - 1999
4
Elements of Information Theory (context) - Networks, Systems et al. - 1991
2
ARACHNID: Adaptive retrieval agents choosing heuristic neigh.. (context) - AAAI-, on et al. - 1997
2
and Shoham (context) - Balabanovic - 1995
1
Regression using classication algorithms (context) - Learning, Torgo et al. - 1997
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC