Grammars provide a convenient means to describe the set of valid strings in a language, and thus they seem natural for describing the set of valid instances in a text database. It is well-known that a given language can be described by many grammars, and similarly text database designers have a choice of grammar for specifying valid documents. This flexibility can be exploited to provide information modelling capability by designing productions in the grammar to represent entities and relationships of interest to the database applications. Additional constraints can be specified by attaching predicates to selected non-terminals in the grammar. In this paper, we formalize and illustrate the use of extended grammars for text databases. When used for database definition, grammars can provide the functionality that users have come to expect of database schemas. Extended grammars can also be used to specify database manipulation, including query, update, view definition, and index specification.
|
402
|
The SGML handbook
– Goldfarb, Rubinsky
- 1990
|
|
400
|
Dataguides: Enabling query formulation and optimization in semistructured databases
– Goldman, Widom
- 1997
|
|
389
|
A Query Language and Optimization Techniques for Unstructured Data
– Buneman, Davidson, et al.
- 1996
|
|
278
|
Lore: A Database Management System for Semistructured Data
– McHugh, Abiteboul, et al.
- 1997
|
|
246
|
From structured documents to novel query facilities
– Christophides, Abiteboul, et al.
- 1994
|
|
155
|
The Theory of Parsing
– Aho, Ullman
- 1972
|
|
105
|
Querying semistructured heterogeneous information
– Quass, Rajaraman, et al.
- 1995
|
|
103
|
Change detection in hierarchically structured information
– Chawathe, Rajaraman, et al.
- 1996
|
|
102
|
Querying and updating the file
– Abiteboul, Cluet, et al.
- 1993
|
|
83
|
Guidelines for Electronic Text Encoding and Interchange. Chicago and
– Sperberg-McQueen, Burnard
- 1994
|
|
76
|
Mind your grammar | a new approach to modelling text
– Tompa
- 1987
|
|
75
|
Correspondence and translation for heterogeneous data
– Abiteboul, Cluet, et al.
- 2002
|
|
64
|
Proximal nodes: A model to query document databases by content and structure
– NAVARRO, BAEZA-YATES
- 1997
|
|
50
|
Expressions: an Algebra for Text Search
– PAT
- 1992
|
|
45
|
den Bussche. “Expressiveness of Structured Document Query Languages Based on Attribute Grammars
– Neven, van
- 2002
|
|
42
|
Data Models
– Tsitchizris, Lochovsky
- 1982
|
|
36
|
Storage and retrieval of structured documents
– Macleod
- 1990
|
|
36
|
Database systems for structured documents
– Sacks-Davis, Arnold-Moore, et al.
- 1994
|
|
32
|
A query language for retrieving information from hierarchic text structures
– Macleod
- 1991
|
|
32
|
Guidelines for Electronic Text Encoding and
– Sperberg-McQueen, Burnard
- 1994
|
|
32
|
Text databases: A survey of text models and systems
– Loeffen
- 1994
|
|
30
|
The role of semantics in translating view updates
– Keller
- 1986
|
|
27
|
Integrating a structured-text retrieval system with an objectoriented database system
– Yan, Annevelink
- 1994
|
|
26
|
A grammar-based approach towards unifying hierarchical data models
– Gyssens, Paredaens, et al.
- 1989
|
|
26
|
Pair grammars, graph languages and string-to-graph translations
– Pratt
- 1971
|
|
24
|
Atlas: A nested relational database system for text applications
– Sacks-Davis, Kent, et al.
- 1995
|
|
23
|
An algebra for hierarchically organized text-dominated databases
– Burkowski
- 1992
|
|
20
|
Rita---An editor and user interface for manipulating structured documents
– Cowan, Mackie, et al.
- 1991
|
|
20
|
Structured answers for a large structured document collection
– Fuller, Mackie, et al.
- 1993
|
|
20
|
Regular right part grammars and their parsers
– LALONDE
- 1977
|
|
19
|
Schema-Independent Retrieval from Heterogeneous Structured Text
– Clarke, Cormack, et al.
- 1995
|
|
18
|
Specifying structured document transformations
– Furuta, Stotts
- 1988
|
|
17
|
Hypertext and the Oxford English dictionary
– Raymond, Tomps
- 1988
|
|
17
|
PAT expressions: an algebra for text search
– Salminen, Tompa
- 1992
|
|
16
|
A transient hypergraph-based model for data access
– Watters, Shepherd
- 1990
|
|
15
|
From text to hypertext by indexing
– Salminen, Tague-Sutcliffe, et al.
- 1995
|
|
13
|
Partial Order Databases
– Raymond
- 1996
|
|
11
|
Shortening the OED: Experience with a grammar-defined database
– Blake, Bray, et al.
- 1992
|
|
11
|
A data model for use with formatted and textual data
– Desai, Goyal, et al.
- 1986
|
|
9
|
From data representation to data model: meta-semantic issues
– Raymond, Tompa, et al.
- 1996
|
|
8
|
Concepts for modeling and querying list-structured data
– Colby, Saxton, et al.
- 1994
|
|
8
|
Examples of Pat applied to the Oxford English Dictionary
– Gonnet
- 1987
|
|
8
|
Transformation of structured documents with the use of grammar
– Kuikka, Penttonen
- 1993
|
|
8
|
Two-dimensional filters for structured text
– Kuikka, Salminen
- 1995
|
|
8
|
A two-level structure for textual databases to support hypertext access
– Salminen, Watters
- 1992
|
|
8
|
Complete formal model for information retrieval systems
– Tague, Salminen, et al.
- 1991
|
|
8
|
Hypertext by link-resolving components
– Tompa, Blake, et al.
- 1993
|
|
8
|
A Structured Text ADT for Object-Relational Databases. Theory and Practice of Object Systems 4(4
– Brown, Consens, et al.
- 1998
|
|
7
|
Adding Structured Text to SQL/MM Part 2: Full Text
– Davis
- 1996
|
|
7
|
Indexing structured text for queries on containment relationships
– Dao, Sacks-Davis, et al.
- 1996
|