Download:
|
by Walter Daelemans, Steven Gillis, Gert Durieux
Enschede. Twente University
http://cnts.uia.ac.be/~walter/papers/1994/dgd94.ps
Add To MetaCart
Abstract:
We provide a qualitative and empirical comparison of Skousen's Analogical Modeling algorithm (AM) with Lazy Learning (LL) on a typical Natural Language Processing task. AM incorporates an original approach to feature selection and to the handling of symbolic, unordered feature values. More specifically, it provides a method to dynamically compute an optimally-sized set of nearest neighbours (the analogical set) for each test item, on the basis of which the most plausible category can be selected. We investigate the algorithm's generalisation accuracy and its tolerance to noise and compare it to Lazy Learning techniques on a primary stress assignment task in Dutch. The latter problem is typical for a large amount of classification problems in Natural Language Processing. It is shown that AM is highly successful in performing the task: it outperforms Lazy Learning in its basic scheme. However, LL can be augmented so that it performs at least as well as AM and becomes as noise tolerant as well. Keywords: Analogy-based NLP, Example- and Memory-based NLP, Statistical methods. 1
Citations
|
3214
|
C4.5: Programs for Machine Learning
– Quinlan
- 1993
|
|
2488
|
Induction of Decision Trees
– Quinlan
- 1986
|
|
792
|
Instance-Based Learning Algorithms
– Kibler
- 1991
|
|
414
|
Toward memory-based reasoning
– Stanfill, C, et al.
- 1986
|
|
410
|
An algorithm for finding best matches in logarithmic expected time
– Friedman, Bentley, et al.
- 1977
|
|
220
|
A practical approach to feature selection
– Rendell, Kira
- 1992
|
|
153
|
A Nearest Hyperrectangle Learning Method
– Salzberg
- 1991
|
|
69
|
A Study of Instance-Based Algorithms for Supervised Learning Tasks: Mathematical, Empirical, and Psychological Observations
– Aha
- 1990
|
|
66
|
A Case-Based Approach to Knowledge Acquisition for DomainSpeci c Sentence Analysis
– Cardie
- 1993
|
|
62
|
Generalisation performance of backpropagation learning on a syllabification task
– Daelemans, Bosch
- 1992
|
|
59
|
A Weighted Nearest Neighbour Algorithm for Learning with Symbolic Features
– Cost, Salzberg
- 1993
|
|
55
|
The Acquisition of Stress, a dataoriented approach
– Daelemans, Gillis, et al.
- 1994
|
|
52
|
Analogical Modeling of Language
– Skousen
- 1989
|
|
51
|
Memory-based lexical acquisition and processing
– Daelemans
- 1995
|
|
27
|
Challenges of massive parallelism
– Kitano
- 1993
|
|
22
|
An algorithm for best matches in logarithmic expected time
– Friedman, Bentley, et al.
- 1977
|
|
14
|
Real Time Morphology: Symbolic Rules or Analogical Networks'. Berkeley Linguistic Society 15
– Derwing, Skousen
- 1989
|
|
11
|
Are rules and modules really necessary for explaining language
– Chandler
- 1992
|