Two statistical parsing models applied to the Chinese Treebank (2000) [17 citations — 4 self]
by Daniel M. Bikel, David Chiang
In Proceedings of the Second Chinese Language Processing Workshop
http://www.cis.upenn.edu/~dbikel/chinese-parsing.ps
Add To MetaCart
Abstract:
This paper presents the rst-ever results of applying statistical parsing models to the newly-available Chinese Treebank. We have employed two models, one extracted and adapted from BBN's SIFT System (Miller et al., 1998) and a TAGbased parsing model, adapted from (Chiang, 2000). On sentences with 40 words, the former model performs at 69 % precision, 75 % recall, and the latter at 77 % precision and 78 % recall.
Citations
| 1133 | Building a large annotated corpus of English: The Penn Treebank – Marcus, Santorini, et al. - 1993 |
| 622 | A stochastic parts program and noun phrase parser for unrestricted text – Church - 1988 |
| 418 | Tree-Adjoining Grammars – Joshi, Schabes - 1997 |
| 358 | Three generative, lexicalised models for statistical parsing – Collins - 1997 |
| 209 | Nymble: A high performance learning name-finder – Bikel, M, et al. - 1997 |
| 114 | Stochastic Lexicalized Tree-Adjoining Grammars – Schabes - 1992 |
| 98 | Another Concept of Tree-Adjoining Derivation – Schabes, Shieber - 1994 |
| 95 | D-Tree Grammars – RAMBOW, VIJAY-SHANKER, et al. - 1995 |
| 78 | Probabilistic Tree-Adjoining Grammar as a Framework for Statistical Natural Language Processing – Resnik - 1992 |
| 36 | A Simple Introduction to Maximum Entropy Models for Natural Language Processing – Ratnaparkhi - 1997 |
| 7 | Nymble: a High-Performance Learning Namender – Bikel, Schwartz, et al. - 1997 |

