(Enter summary)
Abstract: The Web has been rapidly "deepened" by the prevalence of databases
online. With the potentially unlimited information hidden behind
their query interfaces, this "deep Web" of searchable databases is
clearly an important frontier for data access. This paper surveys
this relatively unexplored frontier, measuring characteristics pertinent
to both exploring and integrating structured Web sources. On
one hand, our "macro" study surveys the deep Web at large, in April
2004, adopting the random... (Update)
Cited by: More
Light-weight Domain-based Form Assistant: - Querying Web Databases (2005)
(Correct)
Fully Automatic Wrapper Generation For Search Engines - Hongkun Zhao Weiyi (2005)
(Correct)
Downloading Hidden Web Content - Ntoulas, Zerfos, Cho (2004)
(Correct)
Active bibliography (related documents): More All
0.5: Toward Large Scale Integration: Building a MetaQuerier over.. - Chang, He, Zhang (2004)
(Correct)
0.3: Organizing Structured Web Sources by Query Schemas: A.. - Bin He Tao (2004)
(Correct)
0.3: Light-weight Domain-based Form Assistant: Querying Databases .. - Zhang, He, Chang (2005)
(Correct)
Similar documents based on text: More All
0.3: MetaQuerier over the Deep Web: Shallow Integration across.. - Chang, He, Zhang (2004)
(Correct)
0.3: Query Routing: Finding Ways in the Maze of the Deep Web - Govind Kabra Chengkai (2005)
(Correct)
0.2: Knocking the Door to the Deep Web: Integrating Web Query.. - He, Zhang, Chang (2004)
(Correct)
Related documents from co-citation: More All
15: Statistical schema matching across web query interfaces
- He, Chang - 2003
11: The deep Web: Surfacing hidden value
- Bergman - 2000
11: Understanding web query interfaces: Best-effort parsing with hidden syntax (context) - Zhang, He et al. - 2004
BibTeX entry: (Update)
K. C.-C. Chang, B. He, C. Li, and Z. Zhang. Structured databases on the web: Observations and implications. Report UIUCDCS-R-2003-2321, Dept. of Computer Science, UIUC, Feb. 2003. http://citeseer.ist.psu.edu/chang04structured.html More
@misc{ chang03structured,
author = "K. Chang and B. He and C. Li and Z. Zhang",
title = "Structured databases on the web: Observations and implications",
text = "K. C.-C. Chang, B. He, C. Li, and Z. Zhang. Structured databases on the
web: Observations and implications. Report UIUCDCS-R-2003-2321, Dept. of
Computer Science, UIUC, Feb. 2003.",
year = "2003",
url = "citeseer.ist.psu.edu/chang04structured.html" }
Citations (may not include all citations):
432
Querying heterogeneous information sources using source desc..
- Levy, Rajaraman et al. - 1996
266
Information integration using logical views
- Ullman - 1997
217
Human Behavior and the Principle of Least Effort (context) - Zipf - 1949
198
Database techniques for the world-wide web: A survey
- Florescu, Levy et al. - 1998
49
Automatic discovery of language models for text databases
- Callan, Connell et al. - 1999
48
Roadrunner: Towards automatic data extraction from large web..
- Crescenzi, Mecca et al. - 2001
44
Crawling the hidden web
- Raghavan, Garcia-Molina - 2001
40
Record-boundary discovery in Web documents
- Embley, Jiang et al. - 1999
37
Merging ranks from heterogeneous internet sources
- Gravano, Garca-Molina - 1997
35
Determining text databases to search in the internet
- Meng, Liu et al. - 1998
33
Methods for information server selection
- Hawking, Thistlewaite - 1999
33
The Clio project: managing heterogeneity
- Miller, Hernandez et al. - 2001
30
STARTS: Stanford protocol proposal for internet retrieval an.. (context) - Gravano, Chang et al. - 1996
22
Accessibility of information on the web (context) - Lawrence, Giles - 1999
21
Statistical schema matching across web query interfaces
- He, Chang - 2003
19
Query routing for web search engines: architecture and exper.. (context) - Sugiura, Etzioni - 2000
12
Understanding web query interfaces: Best effort parsing with.. (context) - Zhang, He et al. - 2004
11
Discovering complex matchings across web query interfaces: A..
- He, Chang et al. - 2004
9
and classify: Categorizing hidden web databases (context) - Ipeirotis, Gravano et al. - 2001
9
Medmaker: A mediation system based on declarative specificat..
- Papakonstantinou, Garca-Molina et al. - 1996
8
The deep web: Surfacing hidden value
- com - 2000
5
Some practical observations on integration of web informatio..
- Cohen - 1999
4
controversies: Information integration (context) - Hearst - 1998
4
and discover: Focused extraction of qa-pagelets from the dee.. (context) - Caverlee, Liu et al. - 2004
1
The UIUC web integration repository (context) - Chang, He et al. - 2003
1
Modeling interactive web sources for information mediation
- Ludascher, Gupta - 1999
http://wcp.oclc.org"
www.gnu.org/software/wget/wget.html"
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://metaquerier.cs.uiuc.edu/): More
Organizing Structured Web Sources by Query Schemas: A.. - Bin He Tao (2004)
(Correct)
On-the-Fly Constraint Mapping across Web Query Interfaces - Zhang, He, Chang (2004)
(Correct)
A Holistic Paradigm for Schema Matching - He, Chang (2004)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC