(Enter summary)
Abstract: Recently there has been significantinterest in supervised
learning algorithms that combine labeled and unlabeled data
for text learning tasks. The co-training setting [1] applies to
datasets that have a natural separation of their features into
two disjoint sets. We demonstrate that when learning from
labeled and unlabeled data, algorithms explicitly leveraging
a natural independent split of the features outperform algorithms
that do not. When a natural split does not exist,
co-training... (Update)
Similar documents based on text:
0.6: Analyzing the Effectiveness and Applicability of Co-training - Nigam, Ghani (2000)
(Correct)
BibTeX entry: (Update)
Kamal Nigam and Rayid Ghani. 2000. Analyzing the effectiveness and applicability of co-training. In Proc. of Ninth International Conference on Information and Knowledge (CIKM-2000). http://citeseer.ist.psu.edu/nigam00analyzing.html More
@inproceedings{ nigam00analyzing,
author = "Kamal Nigam and Rayid Ghani",
title = "Analyzing the Effectiveness and Applicability of Co-training",
booktitle = "{CIKM}",
pages = "86-93",
year = "2000",
url = "citeseer.ist.psu.edu/nigam00analyzing.html" }
Citations (may not include all citations):
2528
Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
180
Combining labeled and unlabeled data with co-training
- Blum, Mitchell - 1998
140
Text classification from labeled and unlabeled documents usi..
- Nigam, McCallum et al. - 2000
140
A comparison of event models for naiveBayes text classificat..
- McCallum, Nigam - 1998
130
A probabilistic analysis of the Rocchio algorithm with TFIDF..
- Joachims - 1997
119
Exploiting generative models in discriminative classifiers
- Jaakkola, Haussler - 1999
111
Active learning with statistical models
- Cohn, Ghahramani et al. - 1996
110
Unsupervised word sense disambiguation rivaling supervised m..
- Yarowsky - 1995
103
at forty: The independence assumption in information retriev.. (context) - Lewis, Bayes - 1998
86
Transductive inference for text classification using support..
- Joachims - 1999
55
Using probabilistic models of document retrieval without rel.. (context) - Croft, Harper - 1979
51
New retrieval approaches using SMART: TREC
- Buckley, Singhal et al. - 1996
42
Unsupervised models for named entity classification
- Collins, Singer - 1999
35
Employing EM in pool-based active learning for text classifi..
- McCallum, Nigam - 1998
32
Learning dictionaries for information extraction using multi..
- Riloff, Jones - 1999
[Article contains additional citations not shown here]
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.accenture.com/xd/xd.asp?it=enWeb&xd=services%5Ctechnology%5Cpeople%5Crayid_ghani.xml): More
Combining Labeled and Unlabeled Data for MultiClass Text.. - Ghani (2002)
(Correct)
A Study of Approaches to Hypertext - Categorization Yiming Yang
(Correct)
Under considera tion for publica( on in Knowledgeag.. - Rosiejones And Dunja (2003)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC