• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 265,409
Next 10 →

from small data sets

by Michael T. Rosenstein, James J. Collins, Carlo J. De Luca , 1992
"... A practical method for calculating largest Lyapunov exponents ..."
Abstract - Add to MetaCart
A practical method for calculating largest Lyapunov exponents

with model uncertainty on small data sets.

by A. A. Voinov, S. Lange, D. Bankamp (eds
"... Abstract: Datasets of population dynamics are typically characterized by a short temporal extension. In this condition, several alternative models typically achieve close accuracy, though returning quite different predictions (model uncertainty). Bayesian model averaging (BMA) addresses this issue b ..."
Abstract - Add to MetaCart
by averaging the prediction of the different models, using as weights the posterior probability of the models. However, an open problem of BMA is the choice of the prior probability of the models, which can largely impact on the inferences, especially when data are scarce. We present Credal Model Averaging

Muoz Entropy estimates of small data sets

by Juan A. Bonachela, Haye Hinrichsen, Miguel A. Muñoz - J. Phys. A Math. Theor , 2008
"... Abstract. Estimating entropies from limited data series is known to be a non-trivial task. Naïve estimations are plagued with both systematic (bias) and statistical errors. Here, we present a new “balanced estimator ” for entropy functionals (Shannon, Rényi and Tsallis) specially devised to provide ..."
Abstract - Cited by 15 (0 self) - Add to MetaCart
a compromise between low bias and small statistical errors, for short data series. This new estimator out-performs other currently available ones when the data sets are small and the probabilities of the possible outputs of the random variable are not close to zero. Otherwise, other well

Generalization And Maximum Likelihood From Small Data Sets

by William Byrne - Proc. IEEESP Workshop on Neural Networks for Signal Processing , 1993
"... INTRODUCTION An often encountered learning problem is maximum likelihood training of exponential models. When the state is only partially specified by the training data, iterative training algorithms are used to produce a sequence of models that assign increasing likelihood to the training data. Al ..."
Abstract - Cited by 8 (6 self) - Add to MetaCart
. Although the performance as measured on the training set continues to improve as the algorithms progress, performance on related data sets may eventually begin to deteriorate. The cause of this behavior can be seen when the training problem is stated in the Alternating Minimization framework [1]. A

Bagging Is A Small-Data-Set Phenomenon

by Nitesh Chawla , Thomas E. Moore, Jr., Kevin W. Bowyer, Lawrence O. Hall, Clayton Springer, Philip Kegelmeyer - IN INTERNATIONAL CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR , 2001
"... Bagging forms a committee of classifiers by bootstrap aggregation of training sets from a pool of training data. A simple alternative to bagging is to partition the data into disjoint subsets. Experiments on various datasets show that, given the same size partitions and bags, the use of disjoint par ..."
Abstract - Cited by 8 (3 self) - Add to MetaCart
Bagging forms a committee of classifiers by bootstrap aggregation of training sets from a pool of training data. A simple alternative to bagging is to partition the data into disjoint subsets. Experiments on various datasets show that, given the same size partitions and bags, the use of disjoint

On Small Data Sets Revealing Big Differences

by Thanasis Hadzilacos, Dimitris Kalles, Christos Pierrakeas, Michalis Xenos - Proceedings of the 4th Panhellenic conference on Artificial Intelligence, Heraklion, Greece, Springer LNCS 3955 , 2006
"... Abstract. We use decision trees and genetic algorithms to analyze the academic performance of students throughout an academic year at a distance learning university. Based on the accuracy of the generated rules, and on crossexaminations of various groups of the same student population, we surprising ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
Abstract. We use decision trees and genetic algorithms to analyze the academic performance of students throughout an academic year at a distance learning university. Based on the accuracy of the generated rules, and on crossexaminations of various groups of the same student population, we surprisingly observe that students ’ performance is clustered around tutors. 1

• Second session: Experiments – Experiments in small data settings

by Marcello Federico, Wade Shen, Nicola Bertoldi, Chris Callison-burch, Ondrej Bojar, Brooke Cowan, Chris Dyer, Hieu Hoang, Richard Zens, Ra Constantin, Evan Herbst, Christine Moran, Koehn Federico, Moses Toolkit, Confusion Network Experiments , 2006
"... • Open source toolkit – advances state-of-the-art of statistical machine translation models – best performance of European Parliament task – competitive on IWSLT and TC-Star • Factored models – outperform traditional phrase-based models – framework for a wide range of models – integrated approach to ..."
Abstract - Add to MetaCart
• Open source toolkit – advances state-of-the-art of statistical machine translation models – best performance of European Parliament task – competitive on IWSLT and TC-Star • Factored models – outperform traditional phrase-based models – framework for a wide range of models – integrated approach to morphology and syntax • Confusion networks – exploit ambiguous input and outperform 1-best – enable integrated approach to speech translation

A practical method for calculating largest Lyapunov exponents from small data sets

by Michael T. Rosenstein , James J. Collins, Carlo J. De Luca - PHYSICA D , 1993
"... Detecting the presence of chaos in a dynamical system is an important problem that is solved by measuring the largest Lyapunov exponent. Lyapunov exponents quantify the exponential divergence of initially close state-space trajectories and estimate the amount of chaos in a system. We present a new m ..."
Abstract - Cited by 181 (0 self) - Add to MetaCart
to changes in the following quantities: embedding dimension, size of data set, reconstruction delay, and noise level. Furthermore, one may use the algorithm to calculate simultaneously the correlation dimension. Thus, one sequence of computations will yield an estimate of both the level of chaos

Improving the Reliability of Causal Discovery from Small Data Sets Using Argumentation

by Facundo Bromberg, Dimitris Margaritis, Constantin Aliferis
"... We address the problem of improving the reliability of independence-based causal discovery algorithms that results from the execution of statistical independence tests on small data sets, which typically have low reliability. We model the problem as a knowledge base containing a set of independence ..."
Abstract - Cited by 5 (3 self) - Add to MetaCart
We address the problem of improving the reliability of independence-based causal discovery algorithms that results from the execution of statistical independence tests on small data sets, which typically have low reliability. We model the problem as a knowledge base containing a set of independence

Practice of Epidemiology Correcting for Optimistic Prediction in Small Data Sets

by Gordon C. S. Smith, Shaun R. Seaman, Angela M. Wood, Patrick Royston, Ian R. White , 2013
"... The C statistic is a commonly reported measure of screening test performance. Optimistic estimation of the C statistic is a frequent problem because of overfitting of statistical models in small data sets, and methods exist to correct for this issue. However, many studies do not use suchmethods, and ..."
Abstract - Add to MetaCart
The C statistic is a commonly reported measure of screening test performance. Optimistic estimation of the C statistic is a frequent problem because of overfitting of statistical models in small data sets, and methods exist to correct for this issue. However, many studies do not use suchmethods
Next 10 →
Results 1 - 10 of 265,409
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University