See this document in CiteSeerX!

Structured Databases on the Web: Observations and Implications (2004)  (Make Corrections)  (20 citations)
Kevin Chen-Chuan Chang, Bin He, Chengkai Li, Mitesh Patel, Zhen Zhang



  Home/Search   Context   Related

 
View or download:
uiuc.edu/pubs/2004...rdchlpzaug04.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  uiuc.edu/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: The Web has been rapidly "deepened" by the prevalence of databases online. With the potentially unlimited information hidden behind their query interfaces, this "deep Web" of searchable databases is clearly an important frontier for data access. This paper surveys this relatively unexplored frontier, measuring characteristics pertinent to both exploring and integrating structured Web sources. On one hand, our "macro" study surveys the deep Web at large, in April 2004, adopting the random... (Update)

Cited by:   More
Light-weight Domain-based Form Assistant: - Querying Web Databases (2005)   (Correct)
Fully Automatic Wrapper Generation For Search Engines - Hongkun Zhao Weiyi (2005)   (Correct)
Downloading Hidden Web Content - Ntoulas, Zerfos, Cho (2004)   (Correct)

Active bibliography (related documents):   More   All
0.5:   Toward Large Scale Integration: Building a MetaQuerier over.. - Chang, He, Zhang (2004)   (Correct)
0.3:   Organizing Structured Web Sources by Query Schemas: A.. - Bin He Tao (2004)   (Correct)
0.3:   Light-weight Domain-based Form Assistant: Querying Databases .. - Zhang, He, Chang (2005)   (Correct)

Similar documents based on text:   More   All
0.3:   MetaQuerier over the Deep Web: Shallow Integration across.. - Chang, He, Zhang (2004)   (Correct)
0.3:   Query Routing: Finding Ways in the Maze of the Deep Web - Govind Kabra Chengkai (2005)   (Correct)
0.2:   Knocking the Door to the Deep Web: Integrating Web Query.. - He, Zhang, Chang (2004)   (Correct)

Related documents from co-citation:   More   All
15:   Statistical schema matching across web query interfaces - He, Chang - 2003
11:   The deep Web: Surfacing hidden value - Bergman - 2000
11:   Understanding web query interfaces: Best-effort parsing with hidden syntax (context) - Zhang, He et al. - 2004

BibTeX entry:   (Update)

K. C.-C. Chang, B. He, C. Li, and Z. Zhang. Structured databases on the web: Observations and implications. Report UIUCDCS-R-2003-2321, Dept. of Computer Science, UIUC, Feb. 2003. http://citeseer.ist.psu.edu/chang04structured.html   More

@misc{ chang03structured,
  author = "K. Chang and B. He and C. Li and Z. Zhang",
  title = "Structured databases on the web: Observations and implications",
  text = "K. C.-C. Chang, B. He, C. Li, and Z. Zhang. Structured databases on the
    web: Observations and implications. Report UIUCDCS-R-2003-2321, Dept. of
    Computer Science, UIUC, Feb. 2003.",
  year = "2003",
  url = "citeseer.ist.psu.edu/chang04structured.html" }
Citations (may not include all citations):
432   Querying heterogeneous information sources using source desc.. - Levy, Rajaraman et al. - 1996
266   Information integration using logical views - Ullman - 1997
217   Human Behavior and the Principle of Least Effort (context) - Zipf - 1949
198   Database techniques for the world-wide web: A survey - Florescu, Levy et al. - 1998
49   Automatic discovery of language models for text databases - Callan, Connell et al. - 1999
48   Roadrunner: Towards automatic data extraction from large web.. - Crescenzi, Mecca et al. - 2001
44   Crawling the hidden web - Raghavan, Garcia-Molina - 2001
40   Record-boundary discovery in Web documents - Embley, Jiang et al. - 1999
37   Merging ranks from heterogeneous internet sources - Gravano, Garca-Molina - 1997
35   Determining text databases to search in the internet - Meng, Liu et al. - 1998
33   Methods for information server selection - Hawking, Thistlewaite - 1999
33   The Clio project: managing heterogeneity - Miller, Hernandez et al. - 2001
30   STARTS: Stanford protocol proposal for internet retrieval an.. (context) - Gravano, Chang et al. - 1996
22   Accessibility of information on the web (context) - Lawrence, Giles - 1999
21   Statistical schema matching across web query interfaces - He, Chang - 2003
19   Query routing for web search engines: architecture and exper.. (context) - Sugiura, Etzioni - 2000
12   Understanding web query interfaces: Best effort parsing with.. (context) - Zhang, He et al. - 2004
11   Discovering complex matchings across web query interfaces: A.. - He, Chang et al. - 2004
9   and classify: Categorizing hidden web databases (context) - Ipeirotis, Gravano et al. - 2001
9   Medmaker: A mediation system based on declarative specificat.. - Papakonstantinou, Garca-Molina et al. - 1996
8   The deep web: Surfacing hidden value - com - 2000
5   Some practical observations on integration of web informatio.. - Cohen - 1999
4   controversies: Information integration (context) - Hearst - 1998
4   and discover: Focused extraction of qa-pagelets from the dee.. (context) - Caverlee, Liu et al. - 2004
1   The UIUC web integration repository (context) - Chang, He et al. - 2003
1   Modeling interactive web sources for information mediation - Ludascher, Gupta - 1999
http://wcp.oclc.org"
www.gnu.org/software/wget/wget.html"



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://metaquerier.cs.uiuc.edu/):   More
Organizing Structured Web Sources by Query Schemas: A.. - Bin He Tao (2004)   (Correct)
On-the-Fly Constraint Mapping across Web Query Interfaces - Zhang, He, Chang (2004)   (Correct)
A Holistic Paradigm for Schema Matching - He, Chang (2004)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC