See this document in CiteSeerX!

Mining Time-Changing Data Streams (2001)  (Make Corrections)  (53 citations)
Geoff Hulten, Laurie Spencer, Pedro Domingos
Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining



  Home/Search   Context   Related

Links:   ACM   DBLP

 
View or download:
washington.edu/homes/ped...kdd01b.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  washington.edu/homes/pedrod/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Most statistical and machine-learning algorithms assume that the data is a random sample drawn from a stationary distribution. Unfortunately, most of the large databases available for mining today violate this assumption. They were gathered over months or years, and the underlying processes generating them changed during this time, sometimes radically. Although a number of algorithms have been proposed for learning time-changing concepts, they generally do not scale well to very large... (Update)

Cited by:   More
Boosting Classifiers for Drifting Concepts - Scholz, Klinkenberg (2006)   (Correct)
An Ensemble Classifier for Drifting Concepts - Scholz, Klinkenberg (2005)   (Correct)
Issues in Data Stream Management - Lukasz Golab And (2003)   (Correct)

Similar documents (at the sentence level):
6.8%:   Mining High-Speed Data Streams - Domingos, Hulten (2000)   (Correct)

Active bibliography (related documents):   More   All
1.3:   Thesis Proposal - Ruoming Jin Department   (Correct)
0.5:   E4 - Machine Learning - Domingos   (Correct)
0.5:   MetaCost: A General Method for Making Classifiers Cost-Sensitive - Domingos (1999)   (Correct)

Similar documents based on text:   More   All
0.8:   Drifting Concepts as Hidden Factors in Clinical Studies - Kukar   (Correct)
0.3:   Mining Complex Models from Arbitrarily Large Databases in.. - Hulten, Domingos (2002)   (Correct)
0.2:   Mining Massive Relational Databases - Hulten, Domingos, Abe   (Correct)

Related documents from co-citation:   More   All
31:   Mining High-Speed Data Streams - Domingos, Hulten - 2000
27:   Clustering data streams - Guha, Mishra et al. - 2000
23:   Models and Issues in Data Stream Systems (context) - Babcock, Babu et al. - 2002

BibTeX entry:   (Update)

Hulten, G., Spencer, L., and Domingos, P. Mining time-changing data streams. KDD-01, San Francisco, CA, 2001. http://citeseer.ist.psu.edu/hulten01mining.html   More

@inproceedings{ hulten-mining,
  author = "G. Hulten and L. Spencer and P. Domingos",
  title = "Mining Time-Changing Data Streams"
  booktitle="Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining",
  year="2001",
  pages="97-106",
  address="San Francisco, CA",
  publisher="ACM Press",
  url = "citeseer.ist.psu.edu/hulten01mining.html" }
Citations (may not include all citations):
2177   Programs for Machine Learning (context) - Quinlan - 1993
157   Probability inequalities for sums of bounded random variable.. (context) - Hoe - 1963
121   Classication and Regression Trees (context) - Breiman, Friedman et al. - 1984
106   Maintenance of discovered association rules in large databas.. - Cheung, Han et al. - 1996  DBLP
92   Mining high-speed data streams - Domingos, Hulten - 2000  ACM   DBLP
62   Megainduction: Machine Learning on Very Large Databases (context) - Catlett - 1991
58   Organization-based analysis of Web-object sharing and cachin.. - Wolman, Voelker et al. - 1999  DBLP
57   Learning in the presence of concept drift and hidden context.. - Widmer, Kubat - 1996  ACM   DBLP
37   BOAT: optimistic decision tree construction - Gehrke, Ganti et al. - 1999
33   Activity monitoring: Noticing interesting changes in behavio.. - Fawcett, Provost - 1999
28   Decision theoretic subsampling for induction on large databa.. (context) - Musick, Catlett et al. - 1993  DBLP
27   Mining surprising patterns using temporal description length - Chakrabarti, Sarawagi et al. - 1998  ACM   DBLP
26   Beyond incremental processing: Tracking concept drift (context) - Schlimmer, Granger - 1986  DBLP
24   Simultaneous Statistical Inference (context) - Miller - 1981
19   Active data mining - Agrawal, Psaila - 1995  DBLP
17   SPRINT: A scalable parallel classier for data mining - Shafer, Agrawal et al. - 1996
17   DEMON: Mining and monitoring evolving data - Ganti, Gehrke et al. - 2000  DBLP
10   SLIQ: A fast scalable classier for data mining (context) - Mehta, Agrawal et al. - 1996
9   The complexity of learning according to two models of a drif.. (context) - Long - 1999  ACM   DBLP
7   Learning changing concepts by exploiting the structure of ch.. - Bartlett, Ben-David et al. - 2000  ACM   DBLP
5   An adaptive algorithm for incremental mining of association .. (context) - Sarda, Srinivas - 1998  ACM   DBLP
5   An ecient algorithm to update large itemsets with early prun.. (context) - Ayan, Tansel et al. - 1999
2   Density-adaptive learning and forgetting (context) - Salganico - 1993  DBLP
2   Special issue on context sensitivity and concept drift (context) - Widmer, Kubat - 1998
2   The impact of changing populations on classier performance (context) - Kelly, Hand et al. - 1999
2   Institute for Information Technology of the National Researc.. (context) - Turney, bibliography - 1998



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.washington.edu/homes/pedrod/):   More
Context-Sensitive Feature Selection for Lazy Learners - Domingos (1997)   (Correct)
Why Does Bagging Work? A Bayesian Account and its Implications - Domingos   (Correct)
Two-Way Induction - Domingos (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC