See this document in CiteSeerX!

B-EM: A Classifier Incorporating Bootstrap with EM Approach for Data Mining  (Make Corrections)  
Xintao Wu, Jianping Fan, Kalpathi R. Subramanian



  Home/Search   Context   Related

 
View or download:
uncc.edu/~xwu/classify/iclassify.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  uncc.edu/~xwu/pub (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This paper investigates the problem of augmenting labeled data with unlabeled data to improve classification accuracy. This is significant for many applications such as image classification where obtaining classification labels is expensive, while large unlabeled examples are easily available. We investigate an Expectation Maximization (EM) algorithm for learning from labeled and unlabeled data. The reason why unlabeled data boosts learning accuracy is because it provides the information about... (Update)

Active bibliography (related documents):   More   All
0.8:   B-EM: A Classifier Incorporating Bootstrap with EM.. - Wu, Fan, Subramanian (2002)   (Correct)
0.1:   Dynamic Concept Graph Generation From Stream-Based Cases - Mastenbrook, Berkowitz (2003)   (Correct)
0.1:   A Web Personalization System based on Web Usage.. - Albanese.. (2004)   (Correct)

Similar documents based on text:   More   All
0.6:   Graphical Modeling Based Gene Interaction Analysis for.. - Wu, Ye, Zhang   (Correct)
0.5:   Screening and Interpreting Multi-item Associations Based On.. - Wu, Barbara, Ye (2003)   (Correct)
0.3:   Gene Interaction Analysis Using k-way Interaction.. - Wu, Barbará.. (2003)   (Correct)

BibTeX entry:   (Update)

@misc{ wu-bem,
  author = "Xintao Wu and Jianping Fan and Kalpathi R. Subramanian",
  title = "B-EM: A Classifier Incorporating Bootstrap with EM Approach for Data Mining",
  url = "citeseer.ist.psu.edu/707980.html" }
Citations (may not include all citations):
2528   Maximum likelihood from incomplete data via the em algorithm (context) - Dempster, Laird et al. - 1977
1359   Induction of decision trees (context) - Quinlan - 1986
1051   Optimization and Machine Learning (context) - Goldberg, in - 1999
227   An introduction to computing with neural nets (context) - Lippmann - 1987
145   Sprint: A scalable parallel classifier for data mining - Shafer, Agrawal et al. - 1996
140   Text classification from labeled and unlabeled documents - Nigam, Mccallum et al. - 2000
111   Sliq: A fast scalable classifier for data mining - Mehta, Agrawal et al. - 1996
111   Scaling clustering algorithms to large databases - Bradley, Fayyad et al. - 1998
82   Continuous queries over data streams (context) - Babu, Widom - 2001
76   Supervised learning from incomplete data via an em approach - Ghaharmani, Jordan - 1994
44   Classification Algorithms (context) - James - 1985
37   Boat-optimistic decision tree construction - Gehrke, Ganti et al. - 1999
36   The relative value of labeled and unlabeled samples in patte.. (context) - Castelli, Cover - 1996
35   the exponential value of labeled samples (context) - Castelli, Cover - 1995
13   Chapman and Hall (context) - Chapmann, Tibshirani et al. - 1993
8   Advances in Knowledge Discovery and Data Mining (context) - Cheeseman, Stutz et al. - 1996
2   ect of unlabeled samples in reducing the small sample size p.. (context) - Shanhshanhani, Landgrebe - 1994
http://kdd.ics.uci.edu/databases/CorelFeatures/CorelFeatures.html

Documents on the same site (http://www.cs.uncc.edu/~xwu/pub.htm):   More
GenExplore: Interactive Exploration of Gene Interactions.. - University Of North   (Correct)
Privacy Preserving Database Application Testing - Wu, Wang, Zheng (2003)   (Correct)
Journal of Intelligent Information Systems, 16, 255--276.. - Loglinear-Based Quasi..   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC