See this document in CiteSeerX!

A Data Model for Information Extraction from the Web  (Make Corrections)  
Luca Iocchi
WebNet (1)



  Home/Search   Context   Related

 
View or download:
dis.uniroma1.it/~ioc...jnca99sub.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  dis.uniroma1.it/~iocchi/public... (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: The enormous amount of information available through the World Wide Web requires the development of effective tools for extracting and summarizing relevant data from Web sources. In this article we present a data model for representing Web documents and an associated SQL-like query language. Our framework provides an easy-to-use and wellformalized method for automatic generation of wrappers extracting data from Web documents. (Update)

Active bibliography (related documents):   More   All
0.9:   A Data Model for Information Extraction from the Web - Iocchi (1999)   (Correct)
0.2:   Building Intelligent Web Applications Using Lightweight Wrappers - Sahuguet, Azavant (2000)   (Correct)
0.2:   RoadRunner: Towards Automatic Data Extraction from Large.. - Crescenzi, Mecca.. (2001)   (Correct)

Similar documents based on text:   More   All
0.3:   A Framework for Filtering News and Managing Distributed.. - Amati, D'Aloisi.. (1997)   (Correct)
0.3:   Improving The Effectiveness Of Web Search Engines.. - Berenci, Carpineto.. (1998)   (Correct)
0.3:   Probabilistic Learning for Information Filtering - Amati, Crestani, Ubaldini, al.   (Correct)

BibTeX entry:   (Update)

@inproceedings{ iocchi99data,
    author = "Luca Iocchi",
    title = "A Data Model for Information Extraction from the Web",
    booktitle = "WebNet (1)",
    pages = "538-543",
    year = "1999",
    url = "citeseer.ist.psu.edu/731810.html" }
Citations (may not include all citations):
501   The Lorel query language for semistructured data - Abiteboul, Quass et al. - 1997
316   Object exchange across heterogeneous information sources - Papakonstantinou, Garcia-Molina et al. - 1995
244   Querying the World Wide Web - Mendelzon, Mihaila et al. - 1997
198   Database techniques for the World Wide Web: a survey - Florescu, Levy et al. - 1998
104   Object fusion in mediator systems - Papakonstantinou, Abiteboul et al. - 1996
80   Template-based wrappers in the TSIMMIS system - Hammer, Garcia-Molina et al. - 1997
77   A query translation scheme for rapid implementation of wrapp.. - Papakonstantinou, Gupta et al. - 1995
53   Evaluating queries with generalized path expression - Christophides, Cluet et al. - 1996
27   JEDI: Extracting and Synthesizing Information from the Web - Huck, Fankhauser et al.
26   WebL a programming language for the Web (context) - Kistler, Marais - 1997
17   Wrapper generation for Web Accessible Data Sources - Gruser, Raschid et al. - 1998
5   Information access in the Web - Iocchi, Nardi - 1997
5   Web Ecology: Recycling HTML pages as XML documents using W4F - Sahuguet, Azavant - 1999
5   Information extraction and database techniques: a user-orien.. (context) - Lacroix, Sahuguet et al. - 1998
2   Knowledge representation techniques for information extracti.. (context) - De Rosa, Iocchi et al. - 1998
http://www.muc.saic.com/

Documents on the same site (http://www.dis.uniroma1.it/~iocchi/publications/):   More
Self-Localization in the RoboCup Environment - Luca Iocchi And (1999)   (Correct)
Task Assignment with dynamic perception and.. - Farinelli, Iocchi.. (2005)   (Correct)
Planning With Sensing for a Mobile Robot - De Giacomo, Iocchi, Nardi, Rosati (1997)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC