87 citations found. Retrieving documents...
A. J. Miller. Subset selection in regression. In Monographs on Statistics and Applied Probability 40. London: Chapman and Hall, 1990.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents  Next 50

Mining Taxation Data with Parallel BMARS - Sergey Bakin Department (1999)   (Correct)

....points N as well as the number of predictor variables. This makes BMARS suitable for performing an ecient large scale data analysis often referred to as Data Mining. Now we would like to contrast the MARS and BMARS algorithms with a standard statistical forward subset selection procedure (FSS) [11]. MARS builds a model by adding basis functions one at a step and so does the FSS algorithm. However, the former selects a new basis function from a relatively small subset of all available tensor product basis functions. Speci cally, at each step such a subset is comprised of basis functions ....

Miller, A.J., Subset Selection in Regression, Chapman and Hall, 1990.


DIRECT: a system for mining data value conversion rules.. - Fan, Lu, Madnick, Cheung   (Correct)

....of relevant attribute analysis and their scatter plot. A number of techniques have been developed to automatically select and rank models from a set of attributes, like t test, step wise regression. All of these methods are available in the popular statistical packages, like SAS, SPSS, S PLUS [24]. However, these model selection methods tend to favor those models that involve more attributes, which make them very vulnerable in the real modelling process [27] We adopt the approach proposed by Raftery [27] To assess a model, Raftery [27] introduced the Bayesian Information Criterion (BIC) ....

A. Miller, Subset Selection in Regression, Chapman & Hall, New York, 1990.


Computational Techniques and Applications Conference: Can MARS.. - Bakin, al. (1998)   (Correct)

....to enter the model, the forward stepwise procedure is allowed to produce an excess number of basis functions. In order to get rid of suboptimal functions one has to apply the second module of the MARS algorithm. This is essentially just a standard statistical backward subset selection algorithm [7]. This part of MARS selects a submodel which minimizes an estimate of prediction error. The Generalized Cross Validation criterion (GCV) 5] is suggested. GCV (J) m=1 [y m Gamma f J (x m ) 1 Gamma C(J) # 2 (2.5) The numerator in GCV is the average residual squared error and ....

Miller, A.J., Subset Selection in Regression. Chapman and Hall, 1990.


Geometrical Selection of Important Inputs with Feedforward Neural.. - Rossi (1997)   (Correct)

.... is also very important to notice that variable suppression speeds up the acquisition process (i.e. the measurement of the real world in order to produce the input variables) A huge work has been done by statisticians in order to study variable selection when the regression tool is linear (e.g. [5]) The purpose of this article is not to # This work was performed on Mrs Kim K. PHAM s responsibility, at THOMSON CSF AIRSYS. Published in ICANNGA 97 Proceedings. Available at http: apiacoa.org publications 1997 icannaga97.pdf review these methods but to work on problems that cannot be ....

A. J. Miller. Subset Selection in Regression. Chapman and Hall, 1990.


Converting A Trained Neural Network To A Decision Tree - Dectext -.. - Boz (2000)   (Correct)

....5.1.1.2 Sequential Search Algorithms Sequential Search algorithms have a complexity of O(n ) They add and or delete features to from the current subset sequentially. They usually use hill climbing strategy for the search. 104 5.1.1.2. 1 Sequential Forward Selection (SFS) In SFS [Miller, 1990, Devijver and Kittler, 1982] search starts with an empty set. First, feature subsets with only one feature are evaluated and the best feature (f is selected. Then two feature combinations of f with the other features are tested and the best feature subset is selected. The search goes on ....

Miller, A. J. (1990). Subset Selection in Regression. Chapman and Hall.


Feature Subset Selection by Using Sorted Feature Relevance - Boz (2002)   (1 citation)  (Correct)

....real world applications. 3.2 Sequential Search Algorithms Sequential Search algorithms have a complexity of O(n ) They add and or delete features to from the current subset sequentially. They usually use hill climbing strategy for the search. 3.2. 1 Sequential Forward Selection (SFS) In SFS [9, 5] search starts with an empty set. First, feature subsets with only one feature are evaluated and the best feature (f # ) is selected. Then, two feature combinations of f # with the other features are tested and the best feature subset is selected. The search goes on like that by adding one more ....

A. J. Miller. Subset Selection in Regression. Chapman and Hall, 1990.


The Problem of Sparse Image Coding - Pece (2001)   (2 citations)  (Correct)

....a sparse coecient vector, is that the prior densities of the sources have zero mode. This assumption is discussed in section 3. 2 1. 1 Related research The principle of nding an economical description of data can be traced back to Occam s razor and has important applications in statistics [24, 41] The generative model formulation outlined in section 2 has two advantages: following a Bayesian framework, the economy of a code can be conveniently expressed as the prior probability distribution of source values; the Bayesian (subjective) interpretation of probabilities is unnecessary, ....

....pursuit reduces the number of false positives by removing the image components likely to arise from the background. Matching pursuit has also been applied to the processing of seismic data [37] The problem of nding sparse linear expansions is common in numerical linear algebra [45] statistics [24, 41], system identi cation [12] and control [64] Matching pursuit and related methods have found applications in these elds. 7.2 Biological implications Responses of simple cells in visual cortical area V1 can be described, to a rst approximation, as (halfwave recti ed) projections of the retinal ....

A.J. Miller, Subset selection in regression. London: Chapman and Hall, 1990.


Data Mining At The Interface Of Computer Science And Statistics - Smyth (2001)   (1 citation)  (Correct)

....is always the possibility that any satisfactory results may simply be due to chance rather than to any merit inherent in the method yielding the results. 7.1. Variable Selection for Regression A classic example of data mining in this context occurs in variable selection for regression (e.g. [Mil90]) especially when applied to a relatively small training data set with no data used for holdout testing. Lets Data Mining At the Interface of Computer Science and Statistics 8 17 say we are trying to fit a simple linear regression model of the form Y = ff 0 ff 1 X i ff 2 X j e; 1 i; j ....

Miller, A. J. (1990) Subset Selection in Regression, London, Chapman and Hall.


Feature Subset Selection by Bayesian networks: a.. - Inza..   (Correct)

....lines of research. II. Feature Subset Selection as a search problem Even if we locate our work as a Machine Learning or Data Mining approach, the FSS literature includes plenty of works in other fields such as Pattern Recognition (Jain and Chandrasekaran [27] Kittler [29] Statistic (Miller [40], Narendra and Fukunaga [46] or TextLearning (Mladeni c [41] In the Bayesian network community we have the example of the work of Provan and Singh [51] who, using a Bayesian network as a classifier, build it selecting in a greedy manner the variables that should maximize the predictive ....

Miller, A.J., Subset Selection in Regression, Chapman and Hall, Washington, DC, 1990.


Data Splitting Strategies for Reducing the Effect of Model.. - Faraway (1995)   (Correct)

....the model. 2. Estimation of the parameters of the selected model. Point predictions can then be made. 3. Assessment of the variability in the predictions. Conceivably we could split the data three ways and use a different part for each of the above tasks. This has been suggested in passing by Miller (1990, p. 13) and is, perhaps, what Mosteller and Tukey (1977) meant by double cross validation. I investigated this strategy and found it clearly inferior to any of those discussed below, so henceforth I restrict attention to splitting the data into two parts. Which parts of the data should be used ....

....model. This suggests a second data splitting strategy (B) where the first sample is used only to select the model and the second sample is used both to estimate the parameters of the model leading to predictions and to assess the variability in those predictions. This is the approach mentioned by Miller (1990, p. 13) and Hurvich and Tsai (1990) and actually implemented by Faraway (1992) Interestingly, none of the proponents of these two approaches seems to have considered the other. A third strategy (C) can also be motivated by the arrival of new data. One might retain the originally selected model, ....

Miller, A. (1990). Subset Selection in Regression. Chapman and Hall, London.


Estimating the Genetic Architecture of Quantitative Traits - Zeng, Kao, Basten (1999)   (Correct)

....(It needs to be pointed out that the IC criteria can be used to compare any models, not necessarily the nested models under stepwise regression analysis. For example, it can be shown that the likelihood ratio test statistic can be expressed as LR k 2 log(L k L k # ) n log(c(n) n 1) E c(n) (Miller, 1990, p. 208) The criterion is basically defined by c(n) if interpreted under BIC. Using AIC with c(n) 2 would yield a final LOD threshold of 043 In reference to QTL analysis on markers, Broman (1997) suggested using c(n) # log n and recommended that # be between 2 and 3. For n 100C 500, the LOD ....

....model selection can be inflated about 25 in our analysis. However, when the heritability is high, say 05 or higher, the estimate is usually close to the parameter value. Also, part of the bias can be corrected by using adjusted R# to estimate the total variance explained by QTL (Miller, 1990). To obtain the sampling variances of architectural parameter estimates, Kao Zeng (1997) described a Estimating the genetic architecture of quantitatie traits 287 procedure based on the Fisher information matrix allowing for missing data. The confidence intervals of parameter estimates can also ....

Miller, A. J. (1990). Subset Selection in Regression. London : Chapman and Hall.


Overfit Bounds for Classification Algorithms - Gat (2000)   (Correct)

.... by n 2 (EP f L r=n 2 ) 2 k : Therefore, a level 1 2 UCB for the expected risk is P 2 k (1 ) n 1 E S f L v n k ( If a greedy search for a subset of regression variables within some candidate set is performed, such as the forward or backward stepping procedures ([14]) then the search tree methodology can be applied to obtain a UCB for the risk of the resulting regression function. The following adaptation for Algorithm 2 calculates a level 1 4 UCB for the risk of the forward stepping procdure (as above, P i i ) The forward stepping procedure adds the ....

A.J. Miller. Subset selection in regression. Chapman and Hall, NY, 1990.


Bayesian Networks for Feature Subset Selection - Inza, Larrañaga, Sierra (2000)   (Correct)

.... Subset Selection as a search problem Even if we locate our work as a Machine Learning or Data Mining approach, the FSS literature includes plenty of works in other fields such as Pattern Recognition (Jain and Chandrasekaran [27] Kittler [31] Stearns [61] Statistic (Boyce et al. 7] Miller [43], Narendra and Fukunaga [49] or TextLearning (Mladeni c [44] In the Bayesian network community we have the example of the work of Provan and Singh [55] who, using a Bayesian network as a classifier, build it selecting in a greedy manner the variables that should maximize the predictive ....

A.J. Miller, Subset Selection in Regression, Chapman and Hall, Washington, DC, 1990.


Journal of Machine Learning Research 1 (2002) 1-48.. - Support Vector Machines   (Correct)

No context found.

A. J. Miller. Subset selection in regression. In Monographs on Statistics and Applied Probability 40. London: Chapman and Hall, 1990.


A Sparse Signal Reconstruction Perspective for . . . - Malioutov (2003)   (Correct)

No context found.

A. J. Miller, Subset Selection in Regression, Chapman and Hall, 2002.


Validating Linear Restrictions in Linear - Regression Models With   (Correct)

No context found.

Miller, A. (2002), Subset selection in regression, 2nd edn., Chapman & Hall/CRC, Florida.


Online and Offline Character Recognition Using Alignment.. - Jonathan Alon Vassilis (2005)   (Correct)

No context found.

A. Miller. Subset Selection in Regression. Chapman and Hall, 1990.


Sparse Approximations for High Fidelity Compression.. - Aiello, Gilbert.. (2005)   (Correct)

No context found.

MILLER, A. J. Subset selection in regression, 2nd ed. Chapman and Hall, London, 2002.


Algorithmic Stability and Sanity-Check Bounds for Leave-One-Out .. - Kearns, Ron (1997)   (42 citations)  (Correct)

No context found.

A.J. Miller. Subset Selection in Regression. Chapman and Hall, 1990.


Feature Selection and Classifier Ensembles: A Study on.. - Yu (2003)   (Correct)

No context found.

A.J. Miller. Subset Selection in Regression. Chapman and Hall, Washtington D.C., 1990. 44


Simultaneous Feature Selection and Clustering Using Mixture.. - Law, Figueiredo, Jain (2004)   (Correct)

No context found.

A. J. Miller. Subset Selection in Regression. Chapman & Hall, London, 2002.


A Comparison of Pruning Algorithms for Sparse.. - Hoegaerts..   (Correct)

No context found.

A. J. Miller, Subset Selection in Regression. Chapman & Hall, 1990.


Least Squares and Maximum Likelihood - Osborne   (Correct)

No context found.

A. J. Miller, Subset selection in regression, Chapman and Hall, 1990.


Automatic Construction of Decision Trees from Data: A.. - Murthy (1997)   (37 citations)  (Correct)

No context found.

A. J. Miller. Subset Selection in Regression. Chapman and Hall, 1990.


Regression with Multiple Candidate Models: Selecting or Mixing? - Yang (2000)   (1 citation)  (Correct)

No context found.

Miller, A. J. (1990) Subset Selection in Regression, New York, Chapman-Hall.

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC