Results 1 -
2 of
2
Integrating experiential and distributional data to learn semantic representations
- Psychological Review
, 2009
"... The authors identify 2 major types of statistical data from which semantic representations can be learned. These are denoted as experiential data and distributional data. Experiential data are derived by way of experience with the physical world and comprise the sensory-motor data obtained through s ..."
Abstract
-
Cited by 11 (1 self)
- Add to MetaCart
The authors identify 2 major types of statistical data from which semantic representations can be learned. These are denoted as experiential data and distributional data. Experiential data are derived by way of experience with the physical world and comprise the sensory-motor data obtained through sense receptors. Distributional data, by contrast, describe the statistical distribution of words across spoken and written language. The authors claim that experiential and distributional data represent distinct data types and that each is a nontrivial source of semantic information. Their theoretical proposal is that human semantic representations are derived from an optimal statistical combination of these 2 data types. Using a Bayesian probabilistic model, they demonstrate how word meanings can be learned by treating experiential and distributional data as a single joint distribution and learning the statistical structure that underlies it. The semantic representations that are learned in this manner are measurably more realistic—as verified by comparison to a set of human-based measures of semantic representation—than those available from either data type individually or from both sources independently. This is not a result of merely using quantitatively more data, but rather it is because experiential and distributional data are qualitatively distinct, yet intercorrelated, types of data. The semantic representations that are learned are based on statistical structures that exist both within and between the experiential and distributional data types.
Modelling language acquisition: Lexical grounding through perceptual features
- In Workshop on Developmental Embodied Cognition
, 2001
"... A neural network model of language acquisition is introduced, motivated by current research in psychology and linguistics. It uses both extra-linguistic perceptual features and symbolic representations of words. The network learns to auto-associate these inputs to their linguistic labels, as well as ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
A neural network model of language acquisition is introduced, motivated by current research in psychology and linguistics. It uses both extra-linguistic perceptual features and symbolic representations of words. The network learns to auto-associate these inputs to their linguistic labels, as well as to predict the next word in the corpus. This is interpreted to model both the acquisition of a lexicon, and the beginnings of syntax or grammar (word order). Furthermore, the inclusion of the extralinguistic perceptual features is argued to be a form of direct developmental grounding in embodied concepts, which allows the later learning of more abstract concepts to be grounded indirectly in meaning through relations to the first words. Through this bootstrapping process, the entire network may be scalable to large vocabularies, and may bridge the gap between high-dimensional and embodied theories of meaning.

