See this document in CiteSeerX!

Analyzing the Effectiveness and Applicability Of Co-Training (2000)  (Make Corrections)  (12 citations)
Kamal Nigam, Rayid Ghani
CIKM



  Home/Search   Context   Related

 
View or download:
accenture.com/xdoc/en/services/...6.pdf


From:  accenture.com/x...rayid_ghani.xml (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Recently there has been significantinterest in supervised learning algorithms that combine labeled and unlabeled data for text learning tasks. The co-training setting [1] applies to datasets that have a natural separation of their features into two disjoint sets. We demonstrate that when learning from labeled and unlabeled data, algorithms explicitly leveraging a natural independent split of the features outperform algorithms that do not. When a natural split does not exist, co-training... (Update)

Cited by:   More
Adaptive View Validation: A First Step Towards Automatic.. - Ion Muslea Muslea   (Correct)
Gene Functional classification by Semi-supervised - Learning From Heterogeneous   (Correct)
Co-Training and Expansion: Towards Bridging Theory and Practice - Balcan, Blum, Yang (2004)   (Correct)

Similar documents (at the sentence level):
15.9%:   Using Error-Correcting Codes for Efficient Text Classification.. - Ghani (2001)   (Correct)
14.1%:   Understanding the Behavior of Co-training - Nigam, Ghani (2000)   (Correct)
9.9%:   Combining Labeled and Unlabeled Data for MultiClass Text.. - Ghani (2002)   (Correct)

Active bibliography (related documents):   More   All
0.7:   Analyzing the Effectiveness and Applicability of Co-training - Nigam, Ghani (2000)   (Correct)
0.1:   Using Unlabeled Data to Improve Text Classification - Nigam (2001)   (Correct)
0.0:   Active Learning with Multiple Views - Muslea (2002)   (Correct)

Related documents from co-citation:   More   All
12:   Combining labeled and unlabeled data with co-training - Blum, Mitchell - 1998
6:   Unsupervised models for named entity classification - Collins, Singer - 1999
5:   Text classification from labeled and unlabeled documents using EM - Nigam, McCallum et al. - 1999

BibTeX entry:   (Update)

Kamal Nigam and Rayid Ghani. 2000. Analyzing the effectiveness and applicability of co-training. In Proc. of Ninth International Conference on Information and Knowledge (CIKM-2000). http://citeseer.ist.psu.edu/nigam00analyzing.html   More

@inproceedings{ nigam00analyzing,
    author = "Kamal Nigam and Rayid Ghani",
    title = "Analyzing the Effectiveness and Applicability of Co-training",
    booktitle = "{CIKM}",
    pages = "86-93",
    year = "2000",
    url = "citeseer.ist.psu.edu/nigam00analyzing.html" }
Citations (may not include all citations):
2528   Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
180   Combining labeled and unlabeled data with co-training - Blum, Mitchell - 1998
140   Text classification from labeled and unlabeled documents usi.. - Nigam, McCallum et al. - 2000
140   A comparison of event models for naiveBayes text classificat.. - McCallum, Nigam - 1998
130   A probabilistic analysis of the Rocchio algorithm with TFIDF.. - Joachims - 1997
119   Exploiting generative models in discriminative classifiers - Jaakkola, Haussler - 1999
111   Active learning with statistical models - Cohn, Ghahramani et al. - 1996
110   Unsupervised word sense disambiguation rivaling supervised m.. - Yarowsky - 1995
103   at forty: The independence assumption in information retriev.. (context) - Lewis, Bayes - 1998
86   Transductive inference for text classification using support.. - Joachims - 1999
55   Using probabilistic models of document retrieval without rel.. (context) - Croft, Harper - 1979
51   New retrieval approaches using SMART: TREC - Buckley, Singhal et al. - 1996
42   Unsupervised models for named entity classification - Collins, Singer - 1999
35   Employing EM in pool-based active learning for text classifi.. - McCallum, Nigam - 1998
32   Learning dictionaries for information extraction using multi.. - Riloff, Jones - 1999
21   Relational learning with statistical predicate invention: Be.. - Craven, Slattery
2   Linkoping Electronic Atricles in Computer and Information Sc.. (context) - Fjallstrom, graph et al. - 1998
1   the optimalityofthe simple Bayesian classifier under zero-on.. (context) - Domingos, Pazzani - 1997



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.accenture.com/xd/xd.asp?it=enWeb&xd=services%5Ctechnology%5Cpeople%5Crayid_ghani.xml):   More
Combining Labeled and Unlabeled Data for MultiClass Text.. - Ghani (2002)   (Correct)
A Study of Approaches to Hypertext - Categorization Yiming Yang   (Correct)
Under considera tion for publica( on in Knowledgeag.. - Rosiejones And Dunja (2003)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC