(Enter summary)
Abstract: The enormous amount of information available through the
World Wide Web requires the development of effective tools for extracting
and summarizing relevant data from Web sources. In this article we present
a data model for representing Web documents and an associated SQL-like
query language. Our framework provides an easy-to-use and wellformalized
method for automatic generation of wrappers extracting data
from Web documents. (Update)
Active bibliography (related documents): More All
0.9: A Data Model for Information Extraction from the Web - Iocchi (1999)
(Correct)
0.2: Building Intelligent Web Applications Using Lightweight Wrappers - Sahuguet, Azavant (2000)
(Correct)
0.2: RoadRunner: Towards Automatic Data Extraction from Large.. - Crescenzi, Mecca.. (2001)
(Correct)
Similar documents based on text: More All
0.3: A Framework for Filtering News and Managing Distributed.. - Amati, D'Aloisi.. (1997)
(Correct)
0.3: Improving The Effectiveness Of Web Search Engines.. - Berenci, Carpineto.. (1998)
(Correct)
0.3: Probabilistic Learning for Information Filtering - Amati, Crestani, Ubaldini, al.
(Correct)
BibTeX entry: (Update)
@inproceedings{ iocchi99data,
author = "Luca Iocchi",
title = "A Data Model for Information Extraction from the Web",
booktitle = "WebNet (1)",
pages = "538-543",
year = "1999",
url = "citeseer.ist.psu.edu/731810.html" }
Citations (may not include all citations):
501
The Lorel query language for semistructured data
- Abiteboul, Quass et al. - 1997
316
Object exchange across heterogeneous information sources
- Papakonstantinou, Garcia-Molina et al. - 1995
244
Querying the World Wide Web
- Mendelzon, Mihaila et al. - 1997
198
Database techniques for the World Wide Web: a survey
- Florescu, Levy et al. - 1998
104
Object fusion in mediator systems
- Papakonstantinou, Abiteboul et al. - 1996
80
Template-based wrappers in the TSIMMIS system
- Hammer, Garcia-Molina et al. - 1997
77
A query translation scheme for rapid implementation of wrapp..
- Papakonstantinou, Gupta et al. - 1995
53
Evaluating queries with generalized path expression
- Christophides, Cluet et al. - 1996
27
JEDI: Extracting and Synthesizing Information from the Web
- Huck, Fankhauser et al.
26
WebL a programming language for the Web (context) - Kistler, Marais - 1997
17
Wrapper generation for Web Accessible Data Sources
- Gruser, Raschid et al. - 1998
5
Information access in the Web
- Iocchi, Nardi - 1997
5
Web Ecology: Recycling HTML pages as XML documents using W4F
- Sahuguet, Azavant - 1999
5
Information extraction and database techniques: a user-orien.. (context) - Lacroix, Sahuguet et al. - 1998
2
Knowledge representation techniques for information extracti.. (context) - De Rosa, Iocchi et al. - 1998
http://www.muc.saic.com/
Documents on the same site (http://www.dis.uniroma1.it/~iocchi/publications/): More
Self-Localization in the RoboCup Environment - Luca Iocchi And (1999)
(Correct)
Task Assignment with dynamic perception and.. - Farinelli, Iocchi.. (2005)
(Correct)
Planning With Sensing for a Mobile Robot - De Giacomo, Iocchi, Nardi, Rosati (1997)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC