On-the-fly constraint mapping across web query interfaces (2004) [3 citations — 2 self]
Abstract:
Recently, the Web has been rapidly “deepened ” with the prevalence of databases online and becomes an important frontier for data integration. On this deep Web, a significant amount of information can only be accessed as response to dynamically issued queries to the query interface of a back-end database, instead of by traversing static URL links. Such a query interface expresses a set of constraint templates, where each constraint template states how an attribute can be queried. To enable automatic query mediation among heterogenous deep Web sources, it is critical to automatically translate those constraints, which we name as constraint mapping. In particular, this paper aims at enabling on-the-fly constraint mapping, which is a critical task for integrating the large scale and dynamic deep Web. Such on-the-fly query translation poses a significant new challenge on the generality and extensibility of the translation framework. Existing works pursue a per-source rule-driven framework and thus cannot satisfy such requirements. In contrast, we propose a generic type-based search-driven translation framework by considering the constraint mapping for each data type as a search problem. In particular, in this paper, we develop search algorithms for text and numeric types. Our experiments over real deep Web sources show that our approach is promising to mediate queries for large scale integration. 1.
Citations
| 510 | A survey of approaches to automatic schema matching – Rahm, Bernstein |
| 273 | Answering queries using views: A survey – Halevy - 2001 |
| 200 | Word problems requiring exponential time: Preliminary report – Stockmeyer, Meyer - 1973 |
| 70 | The Deep Web: Surfacing Hidden Value – Bergman - 2000 |
| 69 | Statistical schema matching across Web query interfaces – He, Chang - 2003 |
| 65 | Data Driven Understanding and Refinement of Schema Mappings – Yan, Miller, et al. - 2001 |
| 47 | Mind your vocabulary: Query mapping across heterogeneous information sources – Chang, Garcia-Molina - 1999 |
| 36 | An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web – Wu, Yu, et al. |
| 35 | Understanding web query interfaces: best-effort parsing with hidden syntax – Zhang, He, et al. - 2004 |
| 16 | The UIUC web integration repository – Chang, He, et al. - 2003 |

