Results 1 - 10
of
217
The adaptive nature of human categorization
- Psychological Review
, 1991
"... A rational model of human categorization behavior is presented that assumes that categorization reflects the derivation of optimal estimates of the probability of unseen features of objects. A Bayesian analysis is performed of what optimal estimations would be if categories formed a disjoint partiti ..."
Abstract
-
Cited by 159 (2 self)
- Add to MetaCart
A rational model of human categorization behavior is presented that assumes that categorization reflects the derivation of optimal estimates of the probability of unseen features of objects. A Bayesian analysis is performed of what optimal estimations would be if categories formed a disjoint partitioning of the object space and if features were independently displayed within a category. This Bayesian analysis is placed within an incremental categorization algorithm. The resulting rational model accounts for effects of central tendency of categories, effects of specific instances, learning of linearly nonseparable categories, effects of category labels, extraction of basic level categories, base-rate effects, probability matching in categorization, and trial-by-trial learning functions. Al-though the rational model considers just I level of categorization, it is shown how predictions can be enhanced by considering higher and lower levels. Considering prediction at the lower, individual level allows integration of this rational analysis of categorization with the earlier rational analysis of memory (Anderson & Milson, 1989). Anderson (1990) presented a rational analysis ot 6 human cog-nition. The term rational derives from similar "rational-man" analyses in economics. Rational analyses in other fields are sometimes called adaptationist analyses. Basically, they are ef-forts to explain the behavior in some domain on the assump-tion that the behavior is optimized with respect to some criteria of adaptive importance. This article begins with a general char-acterization ofhow one develops a rational theory of a particu-lar cognitive phenomenon. Then I present the basic theory of categorization developed in Anderson (1990) and review the applications from that book. Since the writing of the book, the theory has been greatly extended and applied to many new phenomena. Most of this article describes these new develop-ments and applications. A Rational Analysis Several theorists have promoted the idea that psychologists might understand human behavior by assuming it is adapted to the environment (e.g., Brunswik, 1956; Campbell, 1974; Gib-
A neuropsychological theory of multiple systems in category learning
- PSYCHOLOGICAL REVIEW
, 1998
"... A neuropsychological theory is proposed that assumes category learning is a competition between separate verbal and implicit (i.e., procedural-learning-based) categorization systems. The theory assumes that the caudate nucleus is an important component of the implicit system and that the anterior ci ..."
Abstract
-
Cited by 131 (12 self)
- Add to MetaCart
A neuropsychological theory is proposed that assumes category learning is a competition between separate verbal and implicit (i.e., procedural-learning-based) categorization systems. The theory assumes that the caudate nucleus is an important component of the implicit system and that the anterior cingulate and prefrontal cortices are critical to the verbal system. In addition to making predictions for normal human adults, the theory makes specific predictions for children, elderly people, and patients suffering from Parkinson's disease, Huntington's disease, major depression, amnesia, or lesions of the prefrontal cortex. Two separate formal descriptions of the theory are also provided. One describes trial-by-trial learning, and the other describes global dynamics. The theory is tested on published neuropsychological data and on category learning data with normal adults.
A Review and Empirical Evaluation of Feature Weighting Methods for a Class of Lazy Learning Algorithms
- ARTIFICIAL INTELLIGENCE REVIEW
, 1997
"... Many lazy learning algorithms are derivatives of the k-nearest neighbor (k-NN) classifier, which uses a distance function to generate predictions from stored instances. Several studies have shown that k-NN's performance is highly sensitive to the definition of its distance function. Many k-NN v ..."
Abstract
-
Cited by 94 (0 self)
- Add to MetaCart
Many lazy learning algorithms are derivatives of the k-nearest neighbor (k-NN) classifier, which uses a distance function to generate predictions from stored instances. Several studies have shown that k-NN's performance is highly sensitive to the definition of its distance function. Many k-NN variants have been proposed to reduce this sensitivity by parameterizing the distance function with feature weights. However, these variants have not been categorized nor empirically compared. This paper reviews a class of weight-setting methods for lazy learning algorithms. We introduce a framework for distinguishing these methods and empirically compare them. We observed four trends from our experiments and conducted further studies to highlight them. Our results suggest that methods which use performance feedback to assign weight settings demonstrated three advantages over other methods: they require less pre-processing, perform better in the presence of interacting features, and generally require less training data to learn good settings. We also found that continuous weighting methods tend to outperform feature selection algorithms for tasks where some features are useful but less important than others.
Information Foraging
- Psychological Review
, 1999
"... Information foraging theory is an approach to understanding how strategies and technologies for information seeking, gathering, and consumption are adapted to the flux of information in the environment. The theory assumes that people, when possible, will modify their strategies or the structure of t ..."
Abstract
-
Cited by 93 (7 self)
- Add to MetaCart
Information foraging theory is an approach to understanding how strategies and technologies for information seeking, gathering, and consumption are adapted to the flux of information in the environment. The theory assumes that people, when possible, will modify their strategies or the structure of the environment to maximize their rate of gaining valuable information. The theory is developed by (a) adaptation (rational) analysis of information foraging problems and (b) a detailed process model (adaptive control of thought in information foraging [ACT-IF]). The adaptation analysis develops (a) information patch models, which deal with time allocation and information filtering and enrichment activities in environments in which information is encountered in clusters; (b) information scent models, which address the identification of information value from proximal cues; and (c) information diet models, which address decisions about the selection and pursuit of information items. ACT-IF is instantiated as a production system model of people interacting with complex information technology. Humans actively seek, gather, share, and consume information to a degree unapproached by other organisms. Ours might properly be characterized as a species of informavores (Dennett, 1991). Our adaptive success depends to a large extent on a vast and complex
Rules and Exemplars in Category Learning
- Journal of Experimental Psychology: General
, 1998
"... haracterized by descriptions of each module and how each serves in those tasks for which it is best suited. However, these theories often do not emphasize how modules interact in producing responses and in learning. In this article we will develop a modular theory of categorization that follows fro ..."
Abstract
-
Cited by 92 (3 self)
- Add to MetaCart
haracterized by descriptions of each module and how each serves in those tasks for which it is best suited. However, these theories often do not emphasize how modules interact in producing responses and in learning. In this article we will develop a modular theory of categorization that follows from two distinct accounts of this behavior. The first account is that of rule-based theories of categorization. These theories emerge from a philosophical tradition in which concepts and categorization are described in terms of definitional rules. For example, if a living thing has a wide, flat tail and constructs dams by cutting down trees with its This work was supported by Indiana University Cognitive Science Program Fellowships and by NIMH ResearchTraining Grant PHS-T32-MH19879-03 to Erickson, and in part by NIMH FIRST Award 1-R29-MH51572-01 to Kruschke. This research was reported as a poster at the 1996 Cognitive Science Society Conference in San Diego, CA. We than
Exemplar dynamics: Word frequency, lenition and contrast
- In
, 2001
"... Exemplar theory was first developed as a model of similarity and classification in perception. In this paper, the theory is extended to model speech production as well as speech perception. Straightforward extension of the model provides a formal framework for thinking about the quantitative predict ..."
Abstract
-
Cited by 76 (4 self)
- Add to MetaCart
Exemplar theory was first developed as a model of similarity and classification in perception. In this paper, the theory is extended to model speech production as well as speech perception. Straightforward extension of the model provides a formal framework for thinking about the quantitative predictions of usage-based phonology, as proposed by Bybee. A model is proposed which allows us to derive the finding that leniting historical changes are more advanced in frequent words than in rarer ones. Calculations using this model are presented which reveal the interaction of production noise, lenition and entrenchment. A realistic treatment is also provided for the time course of a phonological merger which originates from lenition of a marked category. 1
Word Learning as Bayesian Inference
- In Proceedings of the 22nd Annual Conference of the Cognitive Science Society
, 2000
"... The authors present a Bayesian framework for understanding how adults and children learn the meanings of words. The theory explains how learners can generalize meaningfully from just one or a few positive examples of a novel word’s referents, by making rational inductive inferences that integrate pr ..."
Abstract
-
Cited by 75 (19 self)
- Add to MetaCart
The authors present a Bayesian framework for understanding how adults and children learn the meanings of words. The theory explains how learners can generalize meaningfully from just one or a few positive examples of a novel word’s referents, by making rational inductive inferences that integrate prior knowledge about plausible word meanings with the statistical structure of the observed examples. The theory addresses shortcomings of the two best known approaches to modeling word learning, based on deductive hypothesis elimination and associative learning. Three experiments with adults and children test the Bayesian account’s predictions in the context of learning words for object categories at multiple levels of a taxonomic hierarchy. Results provide strong support for the Bayesian account over competing accounts, in terms of both quantitative model fits and the ability to explain important qualitative phenomena. Several extensions of the basic theory are discussed, illustrating the broader potential for Bayesian models of word learning.
An exemplar-based random walk model of speeded classification
- Psychological Review
, 1997
"... The authors propose and test an exemplar-based random walk model for predicting response times in tasks of speeded, multidimensional perceptual classification. The model combines elements of R.M. Nosofsky's (1986) generalized context model of categorization and G. D. Logan's (1988) instance-based mo ..."
Abstract
-
Cited by 74 (22 self)
- Add to MetaCart
The authors propose and test an exemplar-based random walk model for predicting response times in tasks of speeded, multidimensional perceptual classification. The model combines elements of R.M. Nosofsky's (1986) generalized context model of categorization and G. D. Logan's (1988) instance-based model of automaticity. In the model, exemplars race among one another to be retrieved from memory, with rates determined by their similarity to test items. The retrieved exemplars provide incremental information that enters into a random walk process for making classification decisions. The model predicts correctly effects of within- and between-categories similarity, individual-object familiarity, and extended practice on classification response times. It also builds bridges between the domains of categorization and automaticity. Models of multidimensional perceptual classification have grown increasingly powerful and sophisticated in recent years, providing detailed quantitative accounts of patterns of classifi-cation learning, transfer, and generalization (e.g., Anderson,
Connectionist and Diffusion Models of Reaction Time
, 1997
"... Two connectionist frameworks, GRAIN (McClelland, 1993) and BSB (Anderson, 1991), and the diffusion model (Ratcliff, 1978) were evaluated using data from a signal detection task. Subjects were asked to choose one of two possible responses to a stimulus and were provided feedback about whether the cho ..."
Abstract
-
Cited by 73 (10 self)
- Add to MetaCart
Two connectionist frameworks, GRAIN (McClelland, 1993) and BSB (Anderson, 1991), and the diffusion model (Ratcliff, 1978) were evaluated using data from a signal detection task. Subjects were asked to choose one of two possible responses to a stimulus and were provided feedback about whether the choice was correct. The dependent variables included response probabilities, reaction times for correct and error responses, and reaction time distributions, and the independent variables were stimulus value, stimulus probability, and lag from an abrupt switch in stimulus probability. The diffusion model accounted for all aspects of the asymptotic data, including error reaction times, which had previously been a problem. The connectionist models accounted for many aspects of the data adequately, but each failed to a greater or lesser degree in important ways except for one model very similar to the diffusion model. The connectionist learning mechanisms were unable to account for initial learning or abrupt changes in stimulus probability. The results provide an advance in the development of the diffusion model and show that the long tradition of reaction time research and theory is a fertile domain for development and testing of connectionist assumptions about how decisions are generated over time.

