(Enter summary)
Abstract: We present a method for learning wrappers for multislot extraction from semi-structured documents. The presented method learns how to construct automatically wrappers from positive examples, consisting of text tuples occurring in the document. These wrappers (T-wrappers) are based on a feature structure unification based pattern language for information extraction. The presented technique is an inductive machine learning method based on a modified version of least general generalization... (Update)
Similar documents based on text: More All
0.3: Learning T-Wrappers for Information Extraction - Thomas (1999)
(Correct)
0.2: Information Extraction in Structured Documents.. - Kosala, Van den.. (2002)
(Correct)
0.2: Core Technologies For Information Agents - Kushmerick, Thomas (2003)
(Correct)
BibTeX entry: (Update)
B. Thomas, `Anti-unification based learning of T-Wrappers for information extraction ', in Proc. of AAAI Workshop on Machine Learning for IE, pp. 15--20. AAAI, (1999). http://citeseer.ist.psu.edu/thomas99antiunification.html More
@inproceedings{ thomas99anti,
author = "Bernd Thomas",
title = "Anti Unification Based Learning of {T}-Wrappers for Information Extraction",
booktitle = "Proceedings of the Workshop on Machine Learning for Information Extraction",
year = "1999",
url = "citeseer.ist.psu.edu/thomas99antiunification.html" }
Citations (may not include all citations):
1838
Foundations of Logic Programming (context) - Lloyd - 1987
460
Mediators in the architecture of future information systems
- Wiederhold - 1992
105
Records for Logic Programming
- Smolka, Treinen - 1994
103
Automated Deduction by Theory Resolution
- Stickel - 1985
68
Unification: A multidisciplinary survey (context) - Knight - 1989
10
Logic programs for intelligent web search
- Thomas
7
Wrapper generation for semi-structured information sources (context) - Asish, Knoblock - 1997
2
Automatic Methods of Inductive Inference (context) - edition - 1971
2
Relational Learning of Pattern-Match Rules for Information E.. (context) - Management, Data et al. - 1997
1
Learning to extract text-based information from the World-Wi.. (context) - Logic, -- - 1997
1
University of Koblenz (context) - Automated, -- et al.
1
An Introduction to Unification-Based Approaches to Grammar (context) - Dissertation, Edinburgh - 1986
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.uni-koblenz.de/~bthomas/MIA_HTML/): More
Ubiquitous Web Information Agents - Beuster, Thomas, Wolff (2000)
(Correct)
MIA - An Ubiquitous Multi-Agent Web Information System - Beuster, Thomas, Wolff (2000)
(Correct)
Token-Templates and Logic Programs for Intelligent Web Search - Thomas (2000)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC