See this document in CiteSeerX!

Mining Massive Relational Databases  (Make Corrections)  
Geoff Hulten, Pedro Domingos, Yeuhi Abe



  Home/Search   Context   Related

 
View or download:
washington.edu/homes/pedro...srl03b.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  washington.edu/homes/pedrod/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: There is a large and growing mismatch between the size of the relational data sets available for mining and the amount of data our relational learning systems can process. In particular, most relational learning systems can operate on data sets containing thousands to tens of thousands of objects, while many real-world data sets grow at a rate of millions of objects a day. In this paper we explore the challenges that prevent relational learning systems from operating on massive data... (Update)

Active bibliography (related documents):   More   All
0.6:   Research on Statistical Relational Learning at the University of .. - Domingos (2003)   (Correct)
0.5:   A Note on the Unification of Information Extraction and Data .. - McCallum, Jensen (2003)   (Correct)
0.3:   Automated Modeling and Nonlinear Axis Scaling - Leejay Wu (2005)   (Correct)

Similar documents based on text:   More   All
0.3:   Mining High-Speed Data Streams - Domingos, Hulten (2000)   (Correct)
0.2:   Mining Time-Changing Data Streams - Hulten, Spencer, Domingos (2001)   (Correct)
0.2:   Momentum-based Parameterization of Dynamic Character Motion - Abe, Liu, Popovic (2004)   (Correct)

BibTeX entry:   (Update)

@misc{ hulten-mining,
  author = "Geoff Hulten and Pedro Domingos and Yeuhi Abe",
  title = "Mining Massive Relational Databases",
  url = "citeseer.ist.psu.edu/592179.html" }
Citations (may not include all citations):
2177   Programs for Machine Learning (context) - Quinlan - 1993
576   Authoritative sources in a hyperlinked environment - Kleinberg - 1998
492   Learning logical definitions from relations (context) - Quinlan - 1990
375   Probability inequalities for sums of bounded random variable.. (context) - Hoeffding - 1963
344   The PageRank citation ranking: Bringing order to the web - Page, Brin et al. - 1998
342   Wrappers for feature subset selection - Kohavi, John - 1997
212   Inductive logic programming: techniques and applications (context) - Lavrac, Dzeroski - 1994
194   Online aggregation - Hellerstein, Hass et al. - 1997
145   SPRINT: A scalable parallel classifier for data mining - Shafer, Agrawal et al. - 1996
103   Learning probabilistic relational models - Friedman, Getoor et al. - 1999
92   Mining high-speed data streams - Domingos, Hulten - 2000
43   Cached sufficient statistics for efficient machine learning .. - Moore, Lee - 1997
26   Scale-free characteristics of random networks: The topology .. - Barabasi, Albert et al. - 2000
22   Scaling up inductive logic programming by learning from inte.. - Blockeel, Raedt et al. - 1999
21   Relational learning with statistical predicate invention: be.. - Slattery, Craven - 2001
16   Learning probabilistic models of relational structure - Getoor, Friedman et al. - 2001
12   Linkage and autocorrelation cause feature selection bias in .. - Jensen, Neville - 2002
6   Mining complex models from arbitrarily large databases in co.. - Hulten, Domingos - 2002
4   Schemas and models - Jensen, Neville - 2002
3   Autocorrelation and linkage cause bias in evaluation of rela.. (context) - Jensen, Neville - 2002
2   Statistical challenges to inductive inference in linked data - Jensen - 1998
1   Estimating attributes: Analysys and extensions of relief (context) - Kononenko - 1994

Documents on the same site (http://www.cs.washington.edu/homes/pedrod/):   More
Context-Sensitive Feature Selection for Lazy Learners - Domingos (1997)   (Correct)
Why Does Bagging Work? A Bayesian Account and its Implications - Domingos   (Correct)
Two-Way Induction - Domingos (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC