(Enter summary)
Abstract: Content and link information is used by virtually all search engines
to crawl, index, retrieve, and rank Web pages. The correlations
between similarity measures based on these cues and on semantic
associations between pages is crucial in determining the performance
of any search tool. A great deal of research is under way
to understand how to effectively extract semantic information from
Web pages by mining their text and links. A brute force approach
has been used to build semantic maps, that... (Update)
Similar documents based on text: More All
0.3: Exploration versus Exploitation in Topic Driven Crawlers - Pant, Srinivasan, Menczer (2002)
(Correct)
0.2: MySpiders : Evolve your own intelligent Web crawlers - Pant, Menczer (2002)
(Correct)
0.2: Evaluating Topic-Driven Web Crawlers - Menczer, Pant, Srinivasan, Ruiz (2001)
(Correct)
BibTeX entry: (Update)
Menczer, F. Semi-Supervised Evaluation of Search Engines via Semantic Mapping. Submitted to WWW'03 (Budapest, Hungary, 2003), ACM Press. http://dollar.biz.uiowa.edu/~fil/Papers/engines.pdf http://citeseer.ist.psu.edu/menczer03semisupervised.html More
@misc{ menczer03semisupervised,
author = "F. Menczer",
title = "Semi-Supervised Evaluation of Search Engines via Semantic Mapping",
text = "Menczer, F. Semi-Supervised Evaluation of Search Engines via Semantic Mapping.
Submitted to WWW'03 (Budapest, Hungary, 2003), ACM Press. http://dollar.biz.uiowa.edu/~fil/Papers/engines.pdf",
year = "2003",
url = "citeseer.ist.psu.edu/menczer03semisupervised.html" }
Citations (may not include all citations):
1256
An Introduction to Modern Information Retrieval (context) - Salton, McGill - 1983
641
The anatomy of a large-scale hypertextual Web search engine
- Brin, Page - 1998
576
Authoritative sources in a hyperlinked environment
- Kleinberg - 1999
416
Information Retrieval
- van Rijsbergen - 1979
372
An algorithm for suffix stripping (context) - Porter - 1980
180
Combining labeled and unlabeled data with co-training
- Blum, Mitchell - 1998
163
Improved algorithms for topic distillation in hyperlinked en..
- Bharat, Henzinger - 1998
154
Automatic resource compilation by analyzing hyperlink struct..
- Chakrabarti, Dom et al. - 1998
150
Accessibility of information on the Web (context) - Lawrence, Giles - 1999 - http://www.wwwmetrics.com/
106
Inferring Web communities from link topology
- Gibson, Kleinberg et al. - 1998
106
Trawling the Web for emerging cyber-communities
- Kumar, Raghavan et al. - 1999
82
Topic-sensitive PageRank
- Haveliwala - 2002
82
Finding related pages in the World Wide Web
- Dean, Henzinger - 1999
72
Finding what people want: Experiences with the WebCrawler (context) - Pinkerton - 1994
61
Mining the Web: Discovering knowledge from hypertext data (context) - Chakrabarti - 2003
[Article contains additional citations not shown here]
Documents on the same site (http://dollar.biz.uiowa.edu/~fil/papers.html): More
Latent Energy Environments - Menczer, Belew (1993)
(Correct)
Changing Latent Energy Environments: A Case for the Evolution of.. - Menczer (1994)
(Correct)
ARACHNID: Adaptive Retrieval Agents Choosing Heuristic.. - Menczer (1997)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC