(Enter summary)
Abstract: Information mediators are systems capable of providing a unified view of several information sources. Central to any mediator that accesses Web-based sources is a set of wrappers that can extract relevant information from Web pages. In this paper, we present a wrapper-induction algorithm that generates extraction rules for Web-based information sources. We introduce landmark automata, a formalism that describes classes of extraction rules. Our wrapper induction algorithm, stalker, generates... (Update)
Similar documents based on text: More All
0.4: Wrapper Induction for Semistructured, Web-based.. - Muslea, Minton, Knoblock (1998)
(Correct)
0.4: Hierarchical Wrapper Induction for Semistructured.. - Muslea, Minton, Knoblock (2001)
(Correct)
0.3: A Hierarchical Approach to Wrapper Induction - Muslea, Minton, Knoblock (1999)
(Correct)
Related documents from co-citation: More All
13: Wrapper induction for information extraction
- Kushmerick, Weld et al. - 1997
7: Semi-automatic wrapper generation for internet information sources
- Ashish, Knoblock - 1997
6: Learning to extract text-based information from the world wide web
- Soderland - 1997
BibTeX entry: (Update)
I. Muslea, S. Minton, and C.A. Knoblock. STALKER: Learning extraction rules for semistructured, Web-based information sources. In Proceedings of AAAI-98 Workshop on AI and Information Integration, Technical Report WS-98-01, AAAI Press, Menlo Park, CA (1998). http://citeseer.ist.psu.edu/muslea98stalker.html More
@misc{ muslea98stalker,
author = "I. Muslea and S. Minton and C. Knoblock",
title = "STALKER: Learning extraction rules for semistructured",
text = "I. Muslea, S. Minton, and C.A. Knoblock. STALKER: Learning extraction rules
for semistructured, Web-based information sources. In Proceedings of AAAI-98
Workshop on AI and Information Integration, Technical Report WS-98-01, AAAI
Press, Menlo Park, CA (1998).",
year = "1998",
url = "citeseer.ist.psu.edu/muslea98stalker.html" }
Citations (may not include all citations):
300
The tsimmis project: integration of heterogeneous informatio..
- Chawathe, Garcia-Molina et al. - 1994
228
Wrapper induction for information extraction
- Kushmerick - 1997
101
Modeling web sources for information integration
- Knoblock, Minton et al. - 1998
64
Semi-automatic wrapper generation for internet information s..
- Ashish, Knoblock - 1997
43
Semistructured and structured data in the web: going back an..
- Atzeni, Mecca et al. - 1997
2
Cut and paste (context) - Cooperative, Systems et al. - 1997
2
and Chevalier (context) - Chidlovskii, Borghoff - 1997
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.isi.edu/ariadne/):
A Hierarchical Approach to Wrapper Induction - Muslea, Minton, Knoblock (1999)
(Correct)
Wrapper Induction for Semistructured, Web-based.. - Muslea, Minton, Knoblock (1998)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC