MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  On the Information Content of Semi-Structured Databases

Download:
Download as a PDF
by Mark Levene
http://www.dcs.bbk.ac.uk/~mark/download/cfg.pdf
Add To MetaCart

Abstract:

In a semi-structured database there is no clear separation between the data and the schema, and the degree to which it is structured depends on the application. Semi-structured data is naturally modelled in terms of graphs which contain labels which give semantics to its underlying structure. Such databases subsume the modelling power of recent extensions of flat relational databases, to nested databases which allow the nesting (or encapsulation) of entities, and to object databases which, in addition, allow cyclic references between objects. Due to the flexibility of data modelling in a semi-structured environment, in any given application there may be different ways in which to enter the data, but it is not always clear when the semantics are the same. In order to compare different approaches to modelling the data we investigate a measure of the information content of typical semi-structured databases in order to test whether such databases are information-wise equivalent. For the purpose of our investigation we use a graph-based data model, called the hypernode model, as our model for semi-structured data and formalise flat, nested and object databases as subclasses of hypernode databases.

Citations

7709 Computers and Intractability: A Guide to the Theory of NP-Completeness – Garey, Johnson - 1979
2770 Introduction to Automata Theory, Language, and Computation – Hopcroft, Ullman - 1979
1059 The entity-relationship model: Toward a unified view of data – Chen - 1976
469 Querying semi-structured data – Abiteboul - 1997
461 Querying objectoriented databases – Kiier, Kim, et al. - 1992
289 Process Modeling – Curtis, Kellner, et al. - 1992
224 Semistructured data – Buneman - 1997
189 Extending the database relational model to capture more meaning – Codd - 1979
134 GraphLog: A Visual Formalism for Real Life Recursion – Consens, Mendelzon
116 Distance in Graphs – Buckley, Harary - 1990
92 A Graph-Oriented Object Database Model – Gyssens, Paredaens, et al. - 1991
83 Relative Information Capacity of Simple Relational Database Schemata – Hull - 1986
57 Equivalence of datalog queries is undecidable – SHMUELI - 1993
50 The format model: A theory of database organization – Hull, Yap - 1982
42 Restructuring hierarchical database objects. TCS – Abiteboul, Hull - 1988
36 Identifying extended entityrelationship object structures in relational schemas – Markowitz, Makowsky - 1990
31 On the equivalence, containment, and covering problems for the regular and context-free languages – Hunt, Rosenkrantz, et al. - 1976
31 A nested-graph model for the representation and manipulation of complex objects – Poulovassilis, Levene - 1994
21 The Logical Data Model – Kuper, Vardi - 1993
14 Database Modelling and Design: The Fundamental Principles – Teorey - 1994
10 A Graph-Based Data Model and its Ramifications – Levene, Loizou - 1995
9 An algebra for a directional binary entity-relationship model – Chen - 1984
6 On the expressive power of the logical data model – Kuper, Vardi - 1985
4 On the time and tape complexity of languages – Hunt - 1973
4 The interface between language theory and database theory – Ullman - 1992
1 A framework for seamless conceptual data and process modelling – Levene, Scott, et al. - 1997