Abstract:
In a semi-structured database there is no clear separation between the data and the schema, and the degree to which it is structured depends on the application. Semi-structured data is naturally modelled in terms of graphs which contain labels which give semantics to its underlying structure. Such databases subsume the modelling power of recent extensions of flat relational databases, to nested databases which allow the nesting (or encapsulation) of entities, and to object databases which, in addition, allow cyclic references between objects. Due to the flexibility of data modelling in a semi-structured environment, in any given application there may be different ways in which to enter the data, but it is not always clear when the semantics are the same. In order to compare different approaches to modelling the data we investigate a measure of the information content of typical semi-structured databases in order to test whether such databases are information-wise equivalent. For the purpose of our investigation we use a graph-based data model, called the hypernode model, as our model for semi-structured data and formalise flat, nested and object databases as subclasses of hypernode databases.
Citations
|
7709
|
Computers and Intractability: A Guide to the Theory of NP-Completeness
– Garey, Johnson
- 1979
|
|
2770
|
Introduction to Automata Theory, Language, and Computation
– Hopcroft, Ullman
- 1979
|
|
1059
|
The entity-relationship model: Toward a unified view of data
– Chen
- 1976
|
|
469
|
Querying semi-structured data
– Abiteboul
- 1997
|
|
461
|
Querying objectoriented databases
– Kiier, Kim, et al.
- 1992
|
|
289
|
Process Modeling
– Curtis, Kellner, et al.
- 1992
|
|
224
|
Semistructured data
– Buneman
- 1997
|
|
189
|
Extending the database relational model to capture more meaning
– Codd
- 1979
|
|
134
|
GraphLog: A Visual Formalism for Real Life Recursion
– Consens, Mendelzon
|
|
116
|
Distance in Graphs
– Buckley, Harary
- 1990
|
|
92
|
A Graph-Oriented Object Database Model
– Gyssens, Paredaens, et al.
- 1991
|
|
83
|
Relative Information Capacity of Simple Relational Database Schemata
– Hull
- 1986
|
|
57
|
Equivalence of datalog queries is undecidable
– SHMUELI
- 1993
|
|
50
|
The format model: A theory of database organization
– Hull, Yap
- 1982
|
|
42
|
Restructuring hierarchical database objects. TCS
– Abiteboul, Hull
- 1988
|
|
36
|
Identifying extended entityrelationship object structures in relational schemas
– Markowitz, Makowsky
- 1990
|
|
31
|
On the equivalence, containment, and covering problems for the regular and context-free languages
– Hunt, Rosenkrantz, et al.
- 1976
|
|
31
|
A nested-graph model for the representation and manipulation of complex objects
– Poulovassilis, Levene
- 1994
|
|
21
|
The Logical Data Model
– Kuper, Vardi
- 1993
|
|
14
|
Database Modelling and Design: The Fundamental Principles
– Teorey
- 1994
|
|
10
|
A Graph-Based Data Model and its Ramifications
– Levene, Loizou
- 1995
|
|
9
|
An algebra for a directional binary entity-relationship model
– Chen
- 1984
|
|
6
|
On the expressive power of the logical data model
– Kuper, Vardi
- 1985
|
|
4
|
On the time and tape complexity of languages
– Hunt
- 1973
|
|
4
|
The interface between language theory and database theory
– Ullman
- 1992
|
|
1
|
A framework for seamless conceptual data and process modelling
– Levene, Scott, et al.
- 1997
|