Abstract:
Large collections of documents containing various types of multimedia, are made available to the WWW. Unfortunately, due to the un-structuredness of Internet environments it is hard to find specific information when one is looking for it. Search engines available can only rely their results on information retrieval techniques and most of the time they lack the desired power in query formulation. Modelling data on the web, as if it was designed for use within databases, should provide us with the necessary basis for enhancing this query formulation. This of course requires special care for dealing with the included multimedia data and the semi-structured aspects of data on the web. Modelling the entire web would be too ambitious, therefore we focus on a more feasible environment, like the intranet, where one can find large collections of related data. With the webspace method we have already shown how to deal with the various aspects of semi-structured data in large collections of related documents. In this paper we focus on the integration of our webspace method for concept-based search with content-based multimedia information retrieval (IR). A webspace consists of two levels. At the document level, a webspace is considered to be a collection of related documents. At the semantical level, concepts are defined to be used in the documents at the document level. By modelling these concepts using a webspace schema a semantical level of abstraction is gained. This supplies the necessary platform for querying data available within a specific webspace. For the integration with content-based information retrieval an existing IR model is adopted. We will discuss how this is used in the context of Mirror, a Multimedia DBMS, and how this framework is used for the integration with the webspace method for concept-based search.
Citations
|
286
|
XML-QL: A Query Language for XML
– Deutsch, Fernandez, et al.
- 1998
|
|
249
|
Querying the world wide web
– Mendelzon, Mihaila, et al.
- 1996
|
|
238
|
Web Modeling Language (WebML): a modeling language for designing Web sites," presented at
– Ceri, Fraternali, et al.
- 2000
|
|
227
|
The POSTGRES next-generation database management system
– Stonebraker, Kemnitz
- 1991
|
|
190
|
Catching the boat with strudel: Experiences with a web-site management system
– Fernandez, Florescu, et al.
- 1998
|
|
152
|
Semistructured Data to XML: Migrating the Lore data Model and Query Language
– Goldman, Widom
- 1999
|
|
135
|
WebOQL: Restructuring Documents, Databases and Webs
– Arocena, Mendelzon
- 1998
|
|
108
|
A performance evaluation of alternative mapping schemes for storing XML data in a relational database. INRIA
– Florescu, Kossmann
- 1999
|
|
100
|
XMLGL: A Graphical Language for Querying and Restructuring
– Ceri, Comai, et al.
- 1999
|
|
76
|
Efficient Relational Storage and Retrieval of XML Documents
– Schmidt
- 2000
|
|
62
|
Extensible Markup Language (XML
– W3C
|
|
52
|
Flattening an object algebra to provide performance
– Boncz, Wilschut, et al.
- 1998
|
|
43
|
Integrating Keyword Search into XML Query Processing
– Florescu, Kossmann, et al.
- 2000
|
|
32
|
Atwolevel hypertext retrieval model for legal data
– Agosti, Colotti, et al.
- 1991
|
|
26
|
Developing Hypermedia Applications using OOHDM," presented at
– Schwabe, Rossi
- 1998
|
|
21
|
Araneus in the Era of XML
– Mecca, Merialdo, et al.
- 1999
|
|
15
|
On the integration of IR and databases
– Vries, Wilschut
- 1999
|
|
5
|
Modelling and querying semistructured data with MOA. Workshop on Query processing for semistructured data and non-standard data formats
– Zwol, Apers, et al.
- 1999
|
|
3
|
The Araneus guide to web-site development. Technical report, Dipartimento di Informatica e Automazione, Universita’ di Roma Tre
– Mecca, Merialdo, et al.
- 1999
|
|
3
|
Using webspaces to model document collections on the web
– Zwol, Apers
- 2000
|
|
2
|
Modelling the webspace of an intranet
– Zwol, Apers
- 2000
|
|
1
|
Twenty one at trec-7: Ad-hoc and cross-language track
– Hiemstra, Kraaij
- 1999
|
|
1
|
On views and xml. symposium on
– S
- 1999
|
|
1
|
Content and multimedia database management systems
– Vries
- 1999
|