Alternate document:   Details   Using Reinforcement Learning to Spider the Web Efficiently (99) Jason Rennie, Andrew McCallum

See this document in CiteSeerX!

Using Reinforcement Learning to Spider the Web Efficiently (1999)  (Make Corrections)  (41 citations)
Jason Rennie, Andrew Kachites McCallum
Proceedings of ICML-99, 16th International Conference on Machine Learning



  Home/Search   Context   Related

 
View or download:
mit.edu/~jrennie/papers/icml99.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  mit.edu/~jrennie/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Consider the task of exploring the Web in order to find pages of a particular kind or on a particular topic. This task arises in the construction of search engines and Web knowledge bases. This paper argues that the creation of efficient web spiders is best framed and solved by reinforcement learning, a branch of machine learning that concerns itself with optimal sequential decision making. (Update)

Similar documents based on text:   More   All
3.0:   Using Reinforcement Learning to Spider the Web Efficiently - Rennie, McCallum (1999)   (Correct)
1.8:   Efficient Web Spidering with Reinforcement Learning - Rennie, McCallum (1999)   (Correct)
0.8:   A Machine Learning Approach to Building.. - McCallum, Nigam.. (1999)   (Correct)

BibTeX entry:   (Update)

Jason Rennie and Andrew McCallum. Using reinforcement learning to spider the Web efficiently. In ICML-99, 1999. http://citeseer.ist.psu.edu/rennie99using.html   More

@inproceedings{ rennie99using,
    author = "Jason Rennie and Andrew Kachites McCallum",
    title = "Using reinforcement learning to spider the {W}eb efficiently",
    booktitle = "Proceedings of {ICML}-99, 16th International Conference on Machine Learning",
    publisher = "Morgan Kaufmann Publishers, San Francisco, US",
    address = "Bled, SL",
    editor = "Ivan Bratko and Saso Dzeroski",
    pages = "335--343",
    year = "1999",
    url = "citeseer.ist.psu.edu/rennie99using.html" }
Citations (may not include all citations):
976   Machine Learning (context) - Mitchell - 1997
408   Princeton University Press (context) - Bellman - 1957
189   Webwatcher: A tour guide for the World Wide Web - Joachims, Freitag et al. - 1997
149   Learning to extract symbolic knowledge from the world wide w.. - Craven, DiPasquo et al. - 1998
103   at forty: The independence assumption in information retriev.. (context) - Lewis, Bayes - 1998
81   Reinforcement learning: A survey (context) - Kaelbling, Littman et al. - 1996
40   ARACHNID: Adaptive retrieval agents choosing heuristic neigh.. - Menczer - 1997
36   A machine learning architecture for optimizing web search en.. - Boyan, Freitag et al. - 1996
32   A comparison of event models for naive Bayes text classi cat.. (context) - McCallum, Nigam - 1998
10   Ecient crawling through URL ordering - Cho, Garcia-Molina et al. - 1998
7   Building domain-speci c search engines with machine learning.. - McCallum, Nigam et al. - 1999
5   Improving text clasi cation by shrinkage in a hierarchy of c.. (context) - McCallum, Rosenfeld et al. - 1998
3   Regression using classi cation algorithms (context) - Torgo, Gama - 1997



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.ai.mit.edu/~jrennie/):
Building Domain-Specific Search Engines with Machine .. - McCallum, Nigam.. (1999)   (Correct)
Using Reinforcement Learning to Spider the Web Efficiently - Rennie, McCallum (1999)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC