See this document in CiteSeerX!

STALKER: Learning Extraction Rules for Semistructured, Web-based Information Sources (1998)  (Make Corrections)  (20 citations)
Ion Muslea, Steve Minton, Craig Knoblock



  Home/Search   Context   Related

 
View or download:
isi.edu/ariadne/papers/98AIII.ps
isi.edu/~muslea/PS/a3iWII.ps
isi.edu/~knoblock/papers/98aiii.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  isi.edu/ariadne/ (more)
From:  isi.edu/~knoblock/
Homepages:  I.Muslea  

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Information mediators are systems capable of providing a unified view of several information sources. Central to any mediator that accesses Web-based sources is a set of wrappers that can extract relevant information from Web pages. In this paper, we present a wrapper-induction algorithm that generates extraction rules for Web-based information sources. We introduce landmark automata, a formalism that describes classes of extraction rules. Our wrapper induction algorithm, stalker, generates... (Update)

Similar documents based on text:   More   All
0.4:   Wrapper Induction for Semistructured, Web-based.. - Muslea, Minton, Knoblock (1998)   (Correct)
0.4:   Hierarchical Wrapper Induction for Semistructured.. - Muslea, Minton, Knoblock (2001)   (Correct)
0.3:   A Hierarchical Approach to Wrapper Induction - Muslea, Minton, Knoblock (1999)   (Correct)

Related documents from co-citation:   More   All
13:   Wrapper induction for information extraction - Kushmerick, Weld et al. - 1997
7:   Semi-automatic wrapper generation for internet information sources - Ashish, Knoblock - 1997
6:   Learning to extract text-based information from the world wide web - Soderland - 1997

BibTeX entry:   (Update)

I. Muslea, S. Minton, and C.A. Knoblock. STALKER: Learning extraction rules for semistructured, Web-based information sources. In Proceedings of AAAI-98 Workshop on AI and Information Integration, Technical Report WS-98-01, AAAI Press, Menlo Park, CA (1998). http://citeseer.ist.psu.edu/muslea98stalker.html   More

@misc{ muslea98stalker,
  author = "I. Muslea and S. Minton and C. Knoblock",
  title = "STALKER: Learning extraction rules for semistructured",
  text = "I. Muslea, S. Minton, and C.A. Knoblock. STALKER: Learning extraction rules
    for semistructured, Web-based information sources. In Proceedings of AAAI-98
    Workshop on AI and Information Integration, Technical Report WS-98-01, AAAI
    Press, Menlo Park, CA (1998).",
  year = "1998",
  url = "citeseer.ist.psu.edu/muslea98stalker.html" }
Citations (may not include all citations):
300   The tsimmis project: integration of heterogeneous informatio.. - Chawathe, Garcia-Molina et al. - 1994
228   Wrapper induction for information extraction - Kushmerick - 1997
101   Modeling web sources for information integration - Knoblock, Minton et al. - 1998
64   Semi-automatic wrapper generation for internet information s.. - Ashish, Knoblock - 1997
43   Semistructured and structured data in the web: going back an.. - Atzeni, Mecca et al. - 1997
2   Cut and paste (context) - Cooperative, Systems et al. - 1997
2   and Chevalier (context) - Chidlovskii, Borghoff - 1997



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.isi.edu/ariadne/):
A Hierarchical Approach to Wrapper Induction - Muslea, Minton, Knoblock (1999)   (Correct)
Wrapper Induction for Semistructured, Web-based.. - Muslea, Minton, Knoblock (1998)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC