Abstract:
The Web has been rapidly “deepened ” by myriad searchable databases online, where data are hidden behind query forms. Helping users query alternative “deep Web ” sources in the same domain (e.g., Books, Airfares) is an important task with broad applications. As a core component of those applications, dynamic query translation (i.e., translating a user’s query across dynamically selected sources) has not been extensively explored. While existing works focus on isolated subproblems (e.g., schema matching, query rewriting) to study, we target at building a complete query translator and thus face new challenges: 1) To complete the translator, we need to solve the predicate mapping problem (i.e., map a source predicate to target predicates), which is largely unexplored by existing works; 2) To satisfy our application requirements, we need to design a customizable system architecture to assemble various components addressing respective subproblems (i.e., schema matching, predicate mapping, query rewriting). Tackling these challenges, we develop a light-weight domain-based form assistant, which can generally handle alternative sources in the same domain and is easily customizable to new domains. Our experiment shows the effectiveness of our form assistant in translating queries for real Web sources. 1
Citations
|
603
|
Querying Heterogeneous Information Sources Using Source Descriptions
– Levy, Rajaraman, et al.
- 1996
|
|
273
|
Answering queries using views: A survey
– Halevy
- 2001
|
|
271
|
Generic Schema Matching with Cupid
– Madhavan, Bernstein, et al.
|
|
223
|
Reconciling schemas of disparate data sources: A machine-learning approach,” SIGMOD
– Doan, Domingos, et al.
- 2001
|
|
200
|
Your mediators need data conversion
– Cluet, Delobel, et al.
- 1998
|
|
183
|
Infomaster: An information integration system
– Genesereth, Keller, et al.
- 1997
|
|
181
|
Answering queries using templates with binding patterns
– Rajaraman, Sagiv, et al.
- 1995
|
|
116
|
A Query Translation Scheme for Rapid Implementation of Wrappers
– Papakonstantinou
- 1995
|
|
79
|
Capabilities-based query rewriting in mediator systems
– Papakonstantinou, Gupta, et al.
- 1996
|
|
69
|
Statistical schema matching across Web query interfaces
– He, Chang
- 2003
|
|
48
|
Capability Based Mediation in TSIMMIS
– Li
|
|
47
|
On Schema Matching with Opaque Column Names and Data Values
– Kang, Naughton
|
|
36
|
An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web
– Wu, Yu, et al.
|
|
35
|
Understanding web query interfaces: best-effort parsing with hidden syntax
– Zhang, He, et al.
- 2004
|
|
34
|
Structured databases on the Web: Observations and implications
– Chang, He, et al.
- 2003
|
|
28
|
Clio: A semi-automatic tool for schema mapping
– Hernandez, Miller, et al.
- 2001
|
|
26
|
Discovering complex matchings across web query interfaces: A correlation mining approach
– He, Chang, et al.
- 2004
|
|
18
|
Toward large scale integration: Building a metaquerier over databases on the web
– Chang, He, et al.
- 2005
|
|
9
|
Light-weight domain-based form assistant: querying web databases on the fly
– Zhang, He, et al.
- 2005
|
|
7
|
Approximate Query Mapping: Accounting for Translation Closeness
– Chang, García-Molina
- 2001
|
|
5
|
The deep web: Surfacing hidden value. Accessible at http://brightplanet.com
– com
- 2000
|
|
2
|
The UIUC web integration repository. http://metaquerier.cs.uiuc.edu/repository
– Chang, He, et al.
|