See this document in CiteSeerX!

Analyzing the Effectiveness and Applicability Of Co-Training (2000)  (Make Corrections)  (12 citations)
Kamal Nigam, Rayid Ghani
CIKM



  Home/Search   Context   Related

 
View or download:
accenture.com/xdoc/en/services/...6.pdf


From:  accenture.com/x...rayid_ghani.xml (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Recently there has been significantinterest in supervised learning algorithms that combine labeled and unlabeled data for text learning tasks. The co-training setting [1] applies to datasets that have a natural separation of their features into two disjoint sets. We demonstrate that when learning from labeled and unlabeled data, algorithms explicitly leveraging a natural independent split of the features outperform algorithms that do not. When a natural split does not exist, co-training... (Update)

Similar documents based on text:
0.6:   Analyzing the Effectiveness and Applicability of Co-training - Nigam, Ghani (2000)   (Correct)

BibTeX entry:   (Update)

Kamal Nigam and Rayid Ghani. 2000. Analyzing the effectiveness and applicability of co-training. In Proc. of Ninth International Conference on Information and Knowledge (CIKM-2000). http://citeseer.ist.psu.edu/nigam00analyzing.html   More

@inproceedings{ nigam00analyzing,
    author = "Kamal Nigam and Rayid Ghani",
    title = "Analyzing the Effectiveness and Applicability of Co-training",
    booktitle = "{CIKM}",
    pages = "86-93",
    year = "2000",
    url = "citeseer.ist.psu.edu/nigam00analyzing.html" }
Citations (may not include all citations):
2528   Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
180   Combining labeled and unlabeled data with co-training - Blum, Mitchell - 1998
140   Text classification from labeled and unlabeled documents usi.. - Nigam, McCallum et al. - 2000
140   A comparison of event models for naiveBayes text classificat.. - McCallum, Nigam - 1998
130   A probabilistic analysis of the Rocchio algorithm with TFIDF.. - Joachims - 1997
119   Exploiting generative models in discriminative classifiers - Jaakkola, Haussler - 1999
111   Active learning with statistical models - Cohn, Ghahramani et al. - 1996
110   Unsupervised word sense disambiguation rivaling supervised m.. - Yarowsky - 1995
103   at forty: The independence assumption in information retriev.. (context) - Lewis, Bayes - 1998
86   Transductive inference for text classification using support.. - Joachims - 1999
55   Using probabilistic models of document retrieval without rel.. (context) - Croft, Harper - 1979
51   New retrieval approaches using SMART: TREC - Buckley, Singhal et al. - 1996
42   Unsupervised models for named entity classification - Collins, Singer - 1999
35   Employing EM in pool-based active learning for text classifi.. - McCallum, Nigam - 1998
32   Learning dictionaries for information extraction using multi.. - Riloff, Jones - 1999

[Article contains additional citations not shown here]



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.accenture.com/xd/xd.asp?it=enWeb&xd=services%5Ctechnology%5Cpeople%5Crayid_ghani.xml):   More
Combining Labeled and Unlabeled Data for MultiClass Text.. - Ghani (2002)   (Correct)
A Study of Approaches to Hypertext - Categorization Yiming Yang   (Correct)
Under considera tion for publica( on in Knowledgeag.. - Rosiejones And Dunja (2003)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC