(Enter summary)
Abstract: This article presents a combination of unsupervised and supervised
learning techniques for the generation of word segmentation
rules from a raw list of words. First, a language bias for word segmentation
is introduced and a simple genetic algorithm is used in the search
for segmentation that corresponds to the best bias value. In the second
phase, the words segmented by the genetic algorithm are used as an input
for the first order decision list learner Clog. The result is a set of
first ... (Update)
Context of citations to this paper: More
.... induction, robust parsing, anaphora resolution and morphological analysis [Losee 1995] Ros 1998] Orasan et al. 2000] [Kazakov and Manandhar 2000]. To the author s knowledge there has not been any report on using GA based techniques to extract bilingual dictionaries. 2.1...
...their correct classification. Later on, learning can be used to replace these exceptions with rules, if possible. Foidl [24] and Clog [15] are two of the first order decision list learners. It is also worth mentioning that Clog, unlike Progol, is an incremental learner. Eager...
Cited by: More
Resources for Morphology Learning and - Evaluation Mike Maxwell (2002)
(Correct)
Evolutionary Sentence Combination for Chatterbots - Vrajitoru, Ratkiewicz
(Correct)
Evolutionary Sentence Building for Chatterbots - Vrajitoru
(Correct)
Similar documents (at the sentence level):
36.9%: Unsupervised Learning of Word Segmentation Rules with.. - Kazakov, Manandhar (2001)
(Correct)
32.4%: Natural Language Processing Applications of Machine Learning - Kazakov (1999)
(Correct)
10.0%: A Hybrid Approach to Word Segmentation - Kazakov, Manandhar (1998)
(Correct)
Active bibliography (related documents): More All
0.3: Incorporating Linkage Learning into the GeLog Framework - Fühner, Kokai (2002)
(Correct)
0.3: Induction of Defeasible Logic Theories in the Legal Domain - Johnston, Governatori
(Correct)
0.3: Using Induced Rules as Complex Features in Memory-Based.. - van den Bosch (2000)
(Correct)
Similar documents based on text: More All
0.3: Achievements and Prospects of Learning Word Morphology with.. - Kazakov
(Correct)
0.2: On Constraint-Based Lambek Calculi - Dörre, Manandhar (1995)
(Correct)
0.2: Inductive Learning of Lexical Semantics with Typed.. - Kazakov, Dobnik
(Correct)
Related documents from co-citation: More All
2: Crossover Improvement for the Genetic Algorithm in Information Retrieval
- Vrajitoru - 1998
2: Dialogues with colorful personalities of early AI (context) - uzeldere, Franchi - 1995
2: ELIZA -- A computer program for the study of natural language communications bet.. (context) - Weizenbaum - 1966
BibTeX entry: (Update)
Kazakov, K. and Manandhar, S. (2000) Unsupervised Learning of Word Segmentation Rules with Genetic Algorithms and Inductive Logic Programming. To appear in Journal of Machine Learning. http://citeseer.ist.psu.edu/kazakov00unsupervised.html More
@article{ kazakov01unsupervised,
author = "Dimitar Kazakov and Suresh Manandhar",
title = "Unsupervised Learning of Word Segmentation Rules with Genetic Algorithms and Inductive Logic Programming",
journal = "Machine Learning",
volume = "43",
number = "1/2",
pages = "121-162",
year = "2001",
url = "citeseer.ist.psu.edu/kazakov00unsupervised.html" }
Citations (may not include all citations):
2138
Genetic Algorithms in Search (context) - Goldberg - 1989
157
The Art of Computer Programming: Sorting and Searching (context) - Knuth - 1973
149
New Generation Computing (context) - Muggleton, Progol - 1995
106
Some advances in transformation-based part of speech tagging
- Brill - 1994
55
Induction of first--order decision lists: Results on learnin..
- Mooney, Califf - 1995
54
Learning semantic grammars with constructive inductive logic..
- Zelle, Mooney - 1993
44
Machine Learning (context) - Mitchell - 1997
42
Part-of-speech tagging using Progol
- Cussens - 1997
22
Course in General Linguistics (context) - de Saussure - 1959
20
Two-level morphology (context) - Koskenniemi - 1983
20
Automatic rule induction for unknown word guessing
- Mikheev - 1994
16
From phoneme to morpheme (context) - Harris - 1955
12
lexically related (context) - Williams, notions - 1981
10
Learning Multilingual Morphology with CLOG
- Manandhar, Dzeroski et al. - 1998
9
A hybrid approach to word segmentation
- Kazakov, Manandhar - 1998
9
Analogical prediction
- Muggleton, Bain - 1999
7
Discovering morphemic suffixes: A case study in minimum desc.. (context) - Brent, Lundberg et al. - 1995
5
Unsupervised learning of naive morphology with genetic algor..
- Kazakov - 1997
5
An inductive approach to natural language parser design (context) - Kazakov - 1996
5
Morphology: an Introduction to the Theory of Word-Structure (context) - Matthews - 1974
4
eles de s'equences de longueurs variables: Application au tr.. (context) - Deligne - 1996
4
a deux niveaux en morphologie computationnelle et les d (context) - Fradin - 1994
4
The Concise Oxford Dictionary of Linguistics (context) - Matthews - 1997
4
Morphological analysis as classification: an inductive learn..
- van den Bosch, Daelemans et al. - 1996
3
Inductive Logic Programming Techniques and Applications (context) - Lavrac, Dzeroski - 1994
3
Learning to Pronounce Written Words: A Study in Inductive La.. (context) - van den Bosch - 1997
2
Analogy and Machine Translation (context) - Pirelli - 1993
2
Prononcer par analogie: motivations (context) - Yvon - 1996
2
Application of inductive logic programming to natural langua.. (context) - Blockeel - 1994
1
Learning word segmentation rules for tag prediction (context) - Kazakov, Manandhar et al. - 1999
1
University of Hawai (context) - Bender, Morphology - 1997
Documents on the same site (http://www-users.cs.york.ac.uk/~kazakov/my-publications.html): More
Evolving the Game of Life - Kazakov, Sweet
(Correct)
Combining LAPIS and WordNet for the Learning of LR Parsers with.. - Kazakov
(Correct)
Achievements and Prospects of Learning Word Morphology with.. - Kazakov
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC