See this document in CiteSeerX!

Extracting Schema from Semistructured Data (1998)  (Make Corrections)  (66 citations)
SVETLOZAR NESTOROV* evtimov @db.stanford.edu SERGE AB1TEBOUL t...



  Home/Search   Context   Related

 
View or download:
uta.edu/~alp/ix/rea...p295nestorov.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  uta.edu/~alp/ix/readings/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Semistructured data is characterized by the lack of any fixed and rigid schema, although typically the data has some implicit structure. While the lack of fixed schema makes extracting semistructured data fairly easy and an attractive goal, presenting and querying such data is greatly impaired. Thus, a critical problem is the discovery of the structure implicit in semistructured data and, subsequently, the recasting of the raw data in terms of this structure. In this paper, we consider a very... (Update)

Cited by:   More
Automatic Discovery of Semantic Structures in HTML Documents - Saikat Mukherjee Guizhen   (Correct)
Fast Mining of Frequent Tree Structures By Hashing and.. - Dimitrios Katsaros..   (Correct)
XStruct: Ecient Schema Extraction - From Multiple And   (Correct)

Similar documents (at the sentence level):
79.5%:   Extracting Schema from Semistructured Data - Nestorov, Abiteboul, Motwani (1998)   (Correct)

Active bibliography (related documents):   More   All
0.0:   MedLan: a Logic-based Mediator Language - Aquilino, Asirelli, Renso, Turini   (Correct)
0.0:   Constraint Databases: A Survey - Revesz (1998)   (Correct)
0.0:   On the Formalization of Actions Using Transaction Logic - Santos (1996)   (Correct)

Similar documents based on text:   More   All
0.5:   Finding Structure and Characteristics of Web Documents for.. - Wong, Fu (2000)   (Correct)
0.5:   Inferring Structure in Semistructured Data - Nestorov, Abiteboul, Motwani (1997)   (Correct)
0.4:   Query Flocks: A Generalization of Association-Rule Mining - Tsur, Ullman.. (1997)   (Correct)

Related documents from co-citation:   More   All
29:   Querying semi-structured data - Abiteboul - 1997
27:   Dataguides: Enabling query formulation and optimization in semistructured databa.. - Goldman, Widom - 1977
25:   The lorel query language for semistructured data - Abiteboul, Quass et al. - 1997

BibTeX entry:   (Update)

S. Nestorov, S. Abiteboul, and R. Motwani. Extracting schema from semistructured data. In Proceedings of the ACM SIGMOD International Conference on Management of Data, Seattle, Washington, May 1998. http://citeseer.ist.psu.edu/nestorov98extracting.html   More

@inproceedings{ nestorov98extracting,
    author = "Svetlozar Nestorov and Serge Abiteboul and Rajeev Motwani",
    title = "Extracting schema from semistructured data",
    pages = "295--306",
    year = "1998",
    url = "citeseer.ist.psu.edu/nestorov98extracting.html" }
Citations (may not include all citations):
775   Foundations of Databases (context) - Abiteboul, Hull et al. - 1995
373   Querying semi-structured data - Abiteboul - 1997
22   Handbook of Theoretical Computer Science (context) - Apt - 1991



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://ranger.uta.edu/~alp/ix/readings/):   More
Dynamic Topic Identification: Towards Combination of.. - Bigi, Brun, Haton..   (Correct)
Text Categorization with Support Vector Machines: Learning with.. - Joachims (1997)   (Correct)
A Statistical Information Extraction System for Turkish - Tür (2000)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC