• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

DMCA

Wrapper Induction for Information Extraction (1997)

Cached

  • Download as a PDF

Download Links

  • [www.cs.ucd.ie]
  • [ftp.cs.washington.edu]
  • [www.cs.wisc.edu]
  • [www.cs.wisc.edu]
  • [pages.cs.wisc.edu]
  • [pages.cs.wisc.edu]
  • [pages.cs.wisc.edu]
  • [pages.cs.wisc.edu]

  • Other Repositories/Bibliography

  • DBLP
  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Nicholas Kushmerick
Citations:624 - 30 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Kushmerick97wrapperinduction,
    author = {Nicholas Kushmerick},
    title = {Wrapper Induction for Information Extraction},
    year = {1997}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

The Internet presents numerous sources of useful information---telephone directories, product catalogs, stock quotes, weather forecasts, etc. Recently, many systems have been built that automatically gather and manipulate such information on a user's behalf. However, these resources are usually formatted for use by people (e.g., the relevant content is embedded in HTML pages), so extracting their content is difficult. Wrappers are often used for this purpose. A wrapper is a procedure for extracting a particular resource's content. Unfortunately, hand-coding wrappers is tedious. We introduce wrapper induction, a technique for automatically constructing wrappers. Our techniques can be described in terms of three main contributions. First, we pose the problem of wrapper construction as one of inductive learn...

Keyphrases

wrapper induction    information extraction    many system    particular resource    hand-coding wrapper    relevant content    product catalog    useful information telephone directory    inductive learn    weather forecast    html page    stock quote    wrapper construction    main contribution    internet present numerous source   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University