(Enter summary)
Abstract: Palmer ([4]) demonstrated how Brill's Transformation-based Error-Driven Learning can be applied
to word segmentation in various languages. We present experimental results which show that such algorithms
can achieve satisfactory performance even with a a very simple initial state annotator We also
present two preliminary studies, which suggest that even higher performancemight be achieved if simple
morphological information is available to the system, and that segmentation performance might... (Update)
Similar documents based on text: More All
0.4: The Sparse Data Problem in Statistical Language Modeling and.. - Peng
(Correct)
0.2: Entertaining Agents: A Sociological Case Study - Foner (1997)
(Correct)
0.2: A Self-Organizing Japanese Word Segmenter using Heuristic Word.. - Nagata (1997)
(Correct)
BibTeX entry: (Update)
Hockenmaier, J. & Brew, C. 1998. \Error-driven learning of Chinese word segmentation" in 12th Pacic Conference on Language and Information, edited by Guo, J., Lua, K.T. & Xu, J., Singapore, Chinese and Oriental Languages Processing Society, 218-229. http://citeseer.ist.psu.edu/hockenmaier98errordriven.html More
@misc{ hockenmaier98errordriven,
author = "J. Hockenmaier and C. Brew",
title = "Error-driven learning of Chinese word segmentation",
text = "Hockenmaier, J. & Brew, C. 1998. \Error-driven learning of Chinese word
segmentation in 12th Pacic Conference on Language and Information, edited
by Guo, J., Lua, K.T. & Xu, J., Singapore, Chinese and Oriental Languages
Processing Society, 218-229.",
year = "1998",
url = "citeseer.ist.psu.edu/hockenmaier98errordriven.html" }
Citations (may not include all citations):
307
Information Retrieval
- van Rijsbergen - 1979
187
Transformation-based Error-Driven Learning and Natural Langu..
- Brill - 1995
57
Bayesian Learning of Probabilistic Language Models
- Stolcke - 1994
23
A Stochastic Finite-State Word-Segmentation Algorithm for Ch..
- Sproat, Shih et al. - 1996
20
A statistical method for finding word boundaries in Chinese .. (context) - Sproat, Shih - 1993
16
Word Identification for Mandarin Chinese Sentences COLING-92.. (context) - Chen, Liu - 1992
15
Improving Chinese Tokenization with Linguistic Filters on St..
- Wu, Fung - 1994
11
A Trainable Rule-Based Algorithm for Word Segmentation Proce..
- Palmer - 1997
1
The Chinese Language (context) - DeFrancis - 1984
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC