See this document in CiteSeerX!

Bootstrapping for Text Learning Tasks (1999)  (Make Corrections)  (13 citations)
Rosie Jones, Andrew McCallum, Kamal Nigam, Ellen Riloff
IJCAI-99 Workshop on Text Mining: Foundations, Techniques and Applications



  Home/Search   Context   Related

 
View or download:
cmu.edu/~knigam/pa...strapijcaiws99.ps
cmu.edu/People/kni...strapijcaiws99.ps
cmu.edu/afs/cs/use...strapijcaiws99.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cmu.edu/afs/cs/project/theo11... (more)
From:  cmu.edu/afs/cs/user/rosie...index
Homepages:  R.Jones  A.Mccallum
  

Rate this article: (best)
  Comment on this article  
Classification, and dictionary learning, using small amounts of seed text, and unlabeled text.

Abstract: When applying text learning algorithms to complex tasks, it is tedious and expensive to hand-label the large amounts of training data necessary for good performance. This paper presents bootstrapping as an alternative approach to learning from large sets of labeled data. Instead of a large quantity of labeled data, this paper advocates using a small amount of seed information and a large collection of easily-obtained unlabeled data. Bootstrapping initializes a learner with the seed information; ... (Update)

Similar documents based on text:
0.2:   Secure Information Flow in Mobile Bootstrapping Process - Liu, Mickunas, Campbell (2000)   (Correct)

BibTeX entry:   (Update)

Jones, R., McCallum, A., Nigam, K., & Riloff, E. (1999). Bootstrapping for text learning tasks. Working Notes of the IJCAI-99 Workshop on Text Mining: Foundations, Techniques and Applications (pp. 52--63). http://citeseer.ist.psu.edu/jones99bootstrapping.html   More

@inproceedings{ jones99bootstrapping,
  author =       "Rosie Jones and Andrew McCallum and Kamal Nigam and Ellen Riloff",
  title =        "Bootstrapping for Text Learning Tasks",
  booktitle =    "IJCAI-99 Workshop on Text Mining: Foundations, Techniques and Applications",
  year =         1999,,
  url = "citeseer.ist.psu.edu/jones99bootstrapping.html" }
Citations (may not include all citations):
2528   Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
2133   Pattern classification and scene analysis (context) - Duda, Hart - 1973
976   Machine Learning (context) - Mitchell - 1997
376   Text categorization with Support Vector Machines: Learning w.. - Joachims - 1998
180   Combining labeled and unlabeled data with co-training - Blum, Mitchell - 1998
149   An evaluation of statistical approaches to text categorizati.. - Yang - 1999
149   Learning to Extract Symbolic Knowledge from the World Wide W.. - Craven, DiPasquo et al. - 1998
140   A comparison of event models for naive Bayes text classifica.. - McCallum, Nigam - 1998
140   Text classification from labeled and unlabeled documents usi.. - Nigam, McCallum et al. - 1999
110   Unsupervised word sense disambiguation rivaling supervised m.. - Yarowsky - 1995
110   Contextsensitive learning methods for text categorization - Cohen, Singer - 1996
105   Learning Information Extraction Rules for Semi-structured an.. - Soderland - 1999
103   at forty: The independence assumption in information retriev.. (context) - Lewis, Bayes - 1998
97   A comparison of two learning algorithms for text categorizat.. - Lewis, Ringuette - 1994
88   Learning trees and rules with set-valued features - Cohen - 1996

[Article contains additional citations not shown here]



The graph only includes citing articles where the year of publication is known.


Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC