Classification, and dictionary learning, using small amounts of seed text, and unlabeled text.
Abstract: When applying text learning algorithms to complex tasks, it is tedious and expensive to hand-label the large amounts of training data necessary for good performance. This paper presents bootstrapping as an alternative approach to learning from large sets of labeled data. Instead of a large quantity of labeled data, this paper advocates using a small amount of seed information and a large collection of easily-obtained unlabeled data. Bootstrapping initializes a learner with the seed information; ... (Update)
Similar documents based on text:
0.2: Secure Information Flow in Mobile Bootstrapping Process - Liu, Mickunas, Campbell (2000)
(Correct)
BibTeX entry: (Update)
Jones, R., McCallum, A., Nigam, K., & Riloff, E. (1999). Bootstrapping for text learning tasks. Working Notes of the IJCAI-99 Workshop on Text Mining: Foundations, Techniques and Applications (pp. 52--63). http://citeseer.ist.psu.edu/jones99bootstrapping.html More
@inproceedings{ jones99bootstrapping,
author = "Rosie Jones and Andrew McCallum and Kamal Nigam and Ellen Riloff",
title = "Bootstrapping for Text Learning Tasks",
booktitle = "IJCAI-99 Workshop on Text Mining: Foundations, Techniques and Applications",
year = 1999,,
url = "citeseer.ist.psu.edu/jones99bootstrapping.html" }
Citations (may not include all citations):
2528
Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
2133
Pattern classification and scene analysis (context) - Duda, Hart - 1973
976
Machine Learning (context) - Mitchell - 1997
376
Text categorization with Support Vector Machines: Learning w..
- Joachims - 1998
180
Combining labeled and unlabeled data with co-training
- Blum, Mitchell - 1998
149
An evaluation of statistical approaches to text categorizati..
- Yang - 1999
149
Learning to Extract Symbolic Knowledge from the World Wide W..
- Craven, DiPasquo et al. - 1998
140
A comparison of event models for naive Bayes text classifica..
- McCallum, Nigam - 1998
140
Text classification from labeled and unlabeled documents usi..
- Nigam, McCallum et al. - 1999
110
Unsupervised word sense disambiguation rivaling supervised m..
- Yarowsky - 1995
110
Contextsensitive learning methods for text categorization
- Cohen, Singer - 1996
105
Learning Information Extraction Rules for Semi-structured an..
- Soderland - 1999
103
at forty: The independence assumption in information retriev.. (context) - Lewis, Bayes - 1998
97
A comparison of two learning algorithms for text categorizat..
- Lewis, Ringuette - 1994
88
Learning trees and rules with set-valued features
- Cohen - 1996
[Article contains additional citations not shown here]
The graph only includes citing articles where the year of publication is known.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC