|
103
|
An efficient, probabilistically sound algorithm for segmentation and word discovery
– Michael R. Brent
- 1999
|
|
48
|
A Compression-based Algorithm for Chinese Word Segmentation
– W. J. Teahan, Yingying Wen, Rodger Mcnab, Ian H. Witten
|
|
22
|
Self-supervised Chinese Word Segmentation
– Fuchun Peng, Dale Schuurmans
- 2001
|
|
99
|
A Stochastic Finite-State Word-Segmentation Algorithm For Chinese
– Richard Sproat, Chilin Shih, William Gale, Nancy Chang
- 1996
|
|
28
|
USeg: A Retargetable Word Segmentation Procedure for Information Retrieval
– Jay M. Ponte, W. Bruce Croft
- 1996
|
|
201
|
Unsupervised Learning of the Morphology of a Natural Language
– John Goldsmith
- 2001
|
|
131
|
Identifying hierarchical structure in sequences: A linear-time algorithm
– Craig G. Nevill-manning, Ian H. Witten
- 1997
|
|
6234
|
Maximum likelihood from incomplete data via the EM algorithm
– A. P. Dempster, N. M. Laird, D. B. Rubin
- 1977
|
|
32
|
An Unsupervised Iterative Method for Chinese New Lexicon Extraction
– Jing-shin Chang, Keh-yih Su
- 1997
|
|
26
|
Discovering Chinese Words from Unsegmented Text
– Xianping Ge, Wanda Pratt, A Pratt, Padhraic Smyth
- 1999
|
|
31
|
The Unsupervised Acquisition of a Lexicon from Continuous Speech
– Carl De Marcken
- 1995
|
|
631
|
An Empirical Study of Smoothing Techniques for Language Modeling
– Stanley F. Chen
- 1998
|
|
130
|
SPIRIT: Sequential Pattern Mining with Regular Expression Constraints
– Minos N. Garofalakis, Rajeev Rastogi, Kyuseok Shim
- 1999
|
|
72
|
Parsing a Natural Language Using Mutual Information Statistics
– David M. Magerman, Mitchell P. Marcus
- 1990
|
|
250
|
Discovery of Frequent Episodes in Event Sequences
– Heikki Mannila, Hannu Toivonen, A. Inkeri Verkamo
- 1997
|
|
17
|
On the discovery of novel word-like units from utterances: An artificial-language study with implications for native-language acquisition
– D Dahan, M R Brent
- 1999
|
|
5
|
Extracting key terms from Chinese and Japnese text
– P Fung
- 1998
|
|
12
|
Chinese Segmentation Disambiguation
– W Jin
- 1994
|
|
27
|
An Algorithm for Segmenting Categorical Time Series into Meaningful Episodes
– Paul Cohen, Niall Adams
- 2001
|