Mostly-Unsupervised Statistical Segmentation of Japanese: Applications to Kanji (2000)

by Rie Kubota Ando , Lillian Lee
Citations:32 - 1 self

Documents Related by Co-Citation

103 An efficient, probabilistically sound algorithm for segmentation and word discovery – Michael R. Brent - 1999
48 A Compression-based Algorithm for Chinese Word Segmentation – W. J. Teahan, Yingying Wen, Rodger Mcnab, Ian H. Witten
22 Self-supervised Chinese Word Segmentation – Fuchun Peng, Dale Schuurmans - 2001
99 A Stochastic Finite-State Word-Segmentation Algorithm For Chinese – Richard Sproat, Chilin Shih, William Gale, Nancy Chang - 1996
28 USeg: A Retargetable Word Segmentation Procedure for Information Retrieval – Jay M. Ponte, W. Bruce Croft - 1996
201 Unsupervised Learning of the Morphology of a Natural Language – John Goldsmith - 2001
131 Identifying hierarchical structure in sequences: A linear-time algorithm – Craig G. Nevill-manning, Ian H. Witten - 1997
6234 Maximum likelihood from incomplete data via the EM algorithm – A. P. Dempster, N. M. Laird, D. B. Rubin - 1977
32 An Unsupervised Iterative Method for Chinese New Lexicon Extraction – Jing-shin Chang, Keh-yih Su - 1997
26 Discovering Chinese Words from Unsegmented Text – Xianping Ge, Wanda Pratt, A Pratt, Padhraic Smyth - 1999
31 The Unsupervised Acquisition of a Lexicon from Continuous Speech – Carl De Marcken - 1995
631 An Empirical Study of Smoothing Techniques for Language Modeling – Stanley F. Chen - 1998
130 SPIRIT: Sequential Pattern Mining with Regular Expression Constraints – Minos N. Garofalakis, Rajeev Rastogi, Kyuseok Shim - 1999
72 Parsing a Natural Language Using Mutual Information Statistics – David M. Magerman, Mitchell P. Marcus - 1990
250 Discovery of Frequent Episodes in Event Sequences – Heikki Mannila, Hannu Toivonen, A. Inkeri Verkamo - 1997
17 On the discovery of novel word-like units from utterances: An artificial-language study with implications for native-language acquisition – D Dahan, M R Brent - 1999
5 Extracting key terms from Chinese and Japnese text – P Fung - 1998
12 Chinese Segmentation Disambiguation – W Jin - 1994
27 An Algorithm for Segmenting Categorical Time Series into Meaningful Episodes – Paul Cohen, Niall Adams - 2001