(Enter summary)
Abstract: Because the knowledge discovery process is ill-defined, iterative, and requires intense interaction, algorithm flexibility is crucial. In this paper, we present a straighforward, heuristic generate-and-test search algorithm for knowledge discovery. An analysis of the literature shows that this basic algorithm underlies many of the systems that have had practical success in data mining and knowledge discovery over the past twenty years. We argue that this search algorithm has persevered because... (Update)
Context of citations to this paper: More
...branches. On one hand, a number of researchers explored techniques for identifying large numbers of classi cation rules [4, 8, 10, 12, 14, 16]. This work was distinguished by the removal of the objective of using the rules for classi cation and hence of the requirement...
...In [Web95] the authors provide detailed descriptions for efficient admissible search and dynamic search space reorderings. [Pro99] describe a generic heuristic generate and test rule space search algorithm (GAT) and argue that a wide variety of knowledge discovery...
Cited by: More
Discovering Associations With Numeric Variables - Webb (2001)
(Correct)
On Detecting Differences Between Groups - Webb, Butler, Newlands (2003)
(Correct)
Cheese: A Generic Search Framework for Data Mining - Ludl (2002)
(Correct)
Similar documents (at the sentence level):
58.8%: Rule-Space Search for Knowledge-Based Discovery - Provost, al. (1999)
(Correct)
5.8%: Exploiting Background Knowledge in Automated Discovery - Aronis, Provost, al. (1996)
(Correct)
Active bibliography (related documents): More All
0.9: A Survey of Methods for Scaling Up Inductive Algorithms - Provost, Kolluri (1999)
(Correct)
0.8: Augmenting Medical Databases with Domain Knowledge - Aronis, Buchanan, Lee (1996)
(Correct)
0.7: A Survey of Methods for Scaling Up Inductive Learning Algorithms - Provost, Kolluri (1997)
(Correct)
Similar documents based on text: More All
0.3: Pointwise ROC Confidence Bounds: An Empirical Evaluation - Sofus Macskassy Smacskas (2005)
(Correct)
0.3: ROC Confidence Bands: An Empirical Evaluation - Sofus Macskassy Smacskas (2005)
(Correct)
0.3: Scaling Up Inductive Algorithms: An Overview - Provost, Kolluri (1997)
(Correct)
Related documents from co-citation: More All
3: Mining associations between sets of items in massive databases (context) - Agrawal, Imielinski et al. - 1993
3: Search through systematic set enumeration (context) - Rymon - 1992
3: Rl4: A tool for knowledge-based induction (context) - Clearwater, Provost - 1990
BibTeX entry: (Update)
F. Provost, J. Aronis, and B. Buchanan. Rule-space search for knowledge-based discovery. CIIO Working Paper IS 99-012, Stern School of Business, New York University, , NY, NY 10012, 1999. http://citeseer.ist.psu.edu/article/provost99rulespace.html More
@misc{ provost99rulespace,
author = "F. Provost and J. Aronis and B. Buchanan",
title = "Rule-space search for knowledge-based discovery",
note = "{CIIO} Working Paper IS 99-012, Stern School of Business, New York
University, NY, NY 10012.",
year = "1999",
url = "citeseer.ist.psu.edu/article/provost99rulespace.html" }
Citations (may not include all citations):
2177
Programs for Machine Learning (context) - Quinlan - 1993
921
Mining association rules between sets of items in large data..
- Agrawal, Imielinski et al. - 1993 ACM DBLP
696
UCI repository of machine learning databases (context) - Blake, Keogh et al. - 1998
388
Inductive Logic Programming
- Muggleton - 1992
274
Generalization as search (context) - Mitchell - 1982 DBLP
162
Simplifying decision trees
- Quinlan - 1987 ACM DBLP
83
Generating production rules from decision trees (context) - Quinlan - 1987 DBLP
80
What makes patterns interesting in knowledge discovery syste..
- Silberschatz, Tuzhilin - 1996 ACM DBLP
64
NETL: A System for Representing and Using Real-World Knowled.. (context) - Fahlman - 1979
52
and presentation of strong rules (context) - Piatetsky-Shapiro - 1991
49
Adaptive fraud detection
- Fawcett, Provost - 1997 ACM DBLP
42
An information theoretic approach to rule induction from dat.. (context) - Smyth, Goodman - 1992 ACM DBLP
35
A survey of methods for scaling up inductive algorithms
- Provost, Kolluri - 1999
32
Small disjuncts in action: Learning to diagnose errors in th.. (context) - Danyluk, Provost - 1993
31
An SE-tree based characterization of the induction problem
- Rymon - 1993 DBLP
30
Representation design and brute-force induction in a boeing ..
- Riddle, Segal et al. - 1994
28
Inductive policy: The pragmatics of bias selection (context) - Provost, Buchanan - 1995 DBLP
27
Model-directed learning of production rules (context) - Buchanan, Mitchell - 1978 ACM
24
Learning decision lists using homogeneous rules
- Segal, Etzioni - 1994 ACM DBLP
23
Scaling up inductive learning with massive parallelism
- Provost, Aronis - 1996 ACM DBLP
22
Incremental version-space merging: A general framework for c.. (context) - Hirsh - 1989 ACM
21
Maximizing the predictive value of production rules (context) - Weiss, Galen et al. - 1990 ACM DBLP
20
RL4: A tool for knowledge-based induction (context) - Clearwater, Provost - 1990
17
SPRINT: A scalable parallel classier for data mining
- Shafer, Agrawal et al. - 1996
16
An evaluation of machine-learning methods for predicting pne.. (context) - Cooper - 1997 DBLP
15
The WoRLD: Knowledge discovery from multiple distributed dat..
- Aronis, Kolluri et al. - 1997
14
Unexpectedness as a measure of interestingness in knowledge ..
- Padmanabhan, Tuzhilin - 1999 ACM
14
DENDRAL and META-DENDRAL: their applications dimension (context) - Buchanan, Feigenbaum - 1978 ACM DBLP
13
Exploiting background knowledge in automated discovery
- Aronis, Provost et al. - 1996 DBLP
13
Linear time rule induction
- Domingos - 1996
12
The use of background knowledge in decision tree induction (context) - nez - 1991 ACM DBLP
11
Knowledge-based learning in exploratory science: Learning ru.. (context) - Lee, Buchanan et al. - 1998 DBLP
10
SLIQ: A fast scalable classier for data mining (context) - Mehta, Agrawal et al. - 1996
9
Expert-driven validation of rule-based user models in person.. (context) - Adomavicius, Tuzhilin - 2001 ACM DBLP
6
Complete anytime beam search (context) - Zhang - 1998 ACM DBLP
6
A rule-learning program in high energy physics event classic.. (context) - Clearwater, Stern - 1991
5
OPUS: An ecient admissible algorithm for unordered search (context) - Webb - 1995
5
Separate-and-conquer rule learning (context) - urnkranz - 1999 ACM DBLP
4
commerce and data mining: Architecture and challenges (context) - Ansari, Kohavi et al. - 2000
4
Beam search (context) - Bisiani - 1987
4
poisoning and abuse (context) - Krenzelok, Jacobsen et al.
4
Declarative bias: An overview (context) - Russell, Grosof - 1990
3
Ecient search for association rules (context) - Webb - 2000
3
On handling tree-structure attributes in decision tree learn.. (context) - Almuallim, Akiba et al. - 1995
3
A heuristic programming study of theory formation in science (context) - Buchanan, Feigenbaum et al. - 1971 ACM DBLP
2
Botanical scoundrels and emergency department visits (context) - Krenzelok, Jacobsen et al. - 1995
2
Special issue on applications and the knowledge discovery pr.. (context) - Kohavi, Provost - 1998
2
Combining data mining and machine learning for eective user.. (context) - Fawcett, Provost - 1996
2
Problem solving and rule induction: A unied view (context) - Simon, Lea - 1973
1
Hemlock ingestions: the most deadly plant exposures (context) - Krenzelok, Jacobsen et al. - 1996
1
Abstract of presentation given at the 1995 North American Co.. (context) - Krenzelok, Jacobsen et al. - 1995
1
Eciently constructing relational features from background kn.. (context) - Aronis, Provost - 1994
1
Quantiying inductive bias: AI learning algorithms and Valian.. (context) - Haussler - 1988
1
RL: An innovative tool for predicting developmental toxicity (context) - Gomez, Lee et al. - 1994
1
Use of a learning program for trigger sensitivity studies (context) - Clearwater, Lee - 1993
1
Identication of developmental toxicants using a rule learnin.. (context) - Gomez, Lee et al. - 1993
1
Eciently inducing determinations: A complete and systematic .. (context) - Schlimmer - 1993
1
Ecient mining of statistical dependencies (context) - Oates, Schmill et al. - 1999
Documents on the same site (http://www.stern.nyu.edu/~fprovost/Classes/rolling-readings-syllabus.html):
Multiple Comparisons in Induction Algorithms - Jensen, COHEN (1999)
(Correct)
Robust Classification Analysis for Performance Evaluation - Provost, Fawcett (2001)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC