(Enter summary)
Abstract: . This paper describes the methodology and the software development of XWRAP, an XML-enabled wrapper construction system for semi-automatic generation of wrapper programs. By XML-enabled we mean that the metadata about information content that are implicit in the original web pages will be extracted and encoded explicitly as XML tags in the wrapped documents. In addition, the query-based content filtering process is performed against the XML documents. The XWRAP wrapper generation framework has ... (Update)
Cited by: More
Automatic Discovery of Semantic Structures in HTML Documents - Saikat Mukherjee Guizhen
(Correct)
Monadic Datalog and the Expressive Power of - Languages For Web
(Correct)
ATwo-Phase Rule Generation and Optimization Approach for - Wrapper Generation Yanan
(Correct)
Similar documents (at the sentence level):
68.8%: XWRAP: An XML-enabled Wrapper Construction System for Web.. - Liu, Pu, Han (2000)
(Correct)
Active bibliography (related documents): More All
0.4: A Fully Automated Object Extraction System for the World Wide.. - Buttler, Liu, Pu (2001)
(Correct)
0.3: Versus: a Web Data Repository with Time Support - Campos (2003)
(Correct)
0.3: Building an Extensible Wrapper Repository System: A Metadata.. - Calton
(Correct)
Similar documents based on text: More All
0.5: Wrapping Web Data into XML - Han, Buttler, Pu (2001)
(Correct)
0.3: An XML-based Wrapper Generator for Web Information.. - Liu, Han, Buttler, Pu, Tang (1999)
(Correct)
0.2: Adaptation Space: A Design Framework for - Adaptive Web Services
(Correct)
Related documents from co-citation: More All
18: Visual Web Information Extraction with Lixto
- Baumgartner, Flesca et al. - 2001
12: Building Intelligent Web Applications Using Lightweight Wrappers
- Sahuguet, Azavant - 2000
11: A hierarchical approach to wrapper induction
- Muslea, Minton et al. - 1999
BibTeX entry: (Update)
L. Liu, C. Pu, and W. Han. XWRAP: An XML-enabled wrapper construction system for web information sources. International Conference on Data Engineering (ICDE), pages 611--621, 2000. http://citeseer.ist.psu.edu/liu00xwrap.html More
@inproceedings{ liu00xwrap,
author = "Ling Liu and Calton Pu and Wei Han",
title = "{XWRAP}: An {XML}-Enabled Wrapper Construction System for Web Information Sources",
booktitle = "{ICDE}",
pages = "611-621",
year = "2000",
url = "citeseer.ist.psu.edu/liu00xwrap.html" }
Citations (may not include all citations):
228
Wrapper induction for information extraction
- Kushmerick, Weil et al. - 1997
228
Wrapper induction for information extraction
- Kushmerick - 1997
140
The TSIMMIS approach to mediation: data models and languages (context) - Garcia-Molina - 1995
101
Modeling web sources for information integration
- Knoblock, Minton et al. - 1998
97
Continual queries for internet-scale event-driven informatio..
- Liu, Pu et al. - 1999
80
Template-based wrappers in the tsimmis system
- Hammer, Brennig et al. - 1997
68
Learning to extract text-based information from the world wi..
- Soderland - 1997
64
Semi-automatic wrapper generation for internet information s..
- Ashish, Knoblock - 1997
62
Nodose - a tool for semi-automatically extracting structured..
- Adelberg - 1998
50
Cut and paste
- Atzeni, Mecca - 1997
17
CQ: A Personalized Update Monitoring Toolkit
- Liu, Pu et al. - 1998
17
Microsoft repository version 2 and the open information mode.. (context) - Bernstein, Bergstraesser et al. - 1999
13
WysiWyg Web Wrapper Factory (context) - Sahuguet, Azavant - 1999
4
Extracting semi-structured data from the web (context) - Hammer, Garcia-Molina et al. - 1997
3
Versions and workspaces in microsoft repositorys (context) - Bergstraesser, Bernstein et al. - 1999
2
VLDB'97 Tutorial and ACM SIGMOD'96 Tutorial (context) - Bernstein - 1997
2
Clean Up Your Web Pahes with HTML TIDY (context) - Raggett - 1999
2
WD-html-in-xml (context) - HTML, XML et al. - 1999
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cc.gatech.edu/~lingliu/CQ/publication.html):
CONQUER: A Continual Query System for Update Monitoring in.. - Liu, Pu, Tang, Han (1999)
(Correct)
Continual Queries for Internet Scale Event-Driven Information.. - Liu (1999)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC