MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  The webspace method: On the integration of database technology with information retrieval (2000) [7 citations — 4 self]

Download:
Download as a PDF
by Roelof Van Zwol
In proceedings of CIKM’00
http://www16.brinkster.com/cs425ir/CIKM00-web-quering-db.pdf
Add To MetaCart

Abstract:

Large collections of documents containing various types of multimedia, are made available to the WWW. Unfortunately, due to the un-structuredness of Internet environments it is hard to find specific information when one is looking for it. Search engines available can only rely their results on information retrieval techniques and most of the time they lack the desired power in query formulation. Modelling data on the web, as if it was designed for use within databases, should provide us with the necessary basis for enhancing this query formulation. This of course requires special care for dealing with the included multimedia data and the semi-structured aspects of data on the web. Modelling the entire web would be too ambitious, therefore we focus on a more feasible environment, like the intranet, where one can find large collections of related data. With the webspace method we have already shown how to deal with the various aspects of semi-structured data in large collections of related documents. In this paper we focus on the integration of our webspace method for concept-based search with content-based multimedia information retrieval (IR). A webspace consists of two levels. At the document level, a webspace is considered to be a collection of related documents. At the semantical level, concepts are defined to be used in the documents at the document level. By modelling these concepts using a webspace schema a semantical level of abstraction is gained. This supplies the necessary platform for querying data available within a specific webspace. For the integration with content-based information retrieval an existing IR model is adopted. We will discuss how this is used in the context of Mirror, a Multimedia DBMS, and how this framework is used for the integration with the webspace method for concept-based search.

Citations

286 XML-QL: A Query Language for XML – Deutsch, Fernandez, et al. - 1998
249 Querying the world wide web – Mendelzon, Mihaila, et al. - 1996
238 Web Modeling Language (WebML): a modeling language for designing Web sites," presented at – Ceri, Fraternali, et al. - 2000
227 The POSTGRES next-generation database management system – Stonebraker, Kemnitz - 1991
190 Catching the boat with strudel: Experiences with a web-site management system – Fernandez, Florescu, et al. - 1998
152 Semistructured Data to XML: Migrating the Lore data Model and Query Language – Goldman, Widom - 1999
135 WebOQL: Restructuring Documents, Databases and Webs – Arocena, Mendelzon - 1998
108 A performance evaluation of alternative mapping schemes for storing XML data in a relational database. INRIA – Florescu, Kossmann - 1999
100 XMLGL: A Graphical Language for Querying and Restructuring – Ceri, Comai, et al. - 1999
76 Efficient Relational Storage and Retrieval of XML Documents – Schmidt - 2000
62 Extensible Markup Language (XML – W3C
52 Flattening an object algebra to provide performance – Boncz, Wilschut, et al. - 1998
43 Integrating Keyword Search into XML Query Processing – Florescu, Kossmann, et al. - 2000
32 Atwolevel hypertext retrieval model for legal data – Agosti, Colotti, et al. - 1991
26 Developing Hypermedia Applications using OOHDM," presented at – Schwabe, Rossi - 1998
21 Araneus in the Era of XML – Mecca, Merialdo, et al. - 1999
15 On the integration of IR and databases – Vries, Wilschut - 1999
5 Modelling and querying semistructured data with MOA. Workshop on Query processing for semistructured data and non-standard data formats – Zwol, Apers, et al. - 1999
3 The Araneus guide to web-site development. Technical report, Dipartimento di Informatica e Automazione, Universita’ di Roma Tre – Mecca, Merialdo, et al. - 1999
3 Using webspaces to model document collections on the web – Zwol, Apers - 2000
2 Modelling the webspace of an intranet – Zwol, Apers - 2000
1 Twenty one at trec-7: Ad-hoc and cross-language track – Hiemstra, Kraaij - 1999
1 On views and xml. symposium on – S - 1999
1 Content and multimedia database management systems – Vries - 1999