(Enter summary)
Abstract: A Corpus-Based Statistics-Oriented (CBSO) methodology, which is an attempt to avoid the drawbacks
of traditional rule-based approaches and purely statistical approaches, is introduced in this paper. Rule-based
approaches, with rules induced by human experts, had been the dominant paradigm in the natural language
processing community. Such approaches, however, suffer from serious difficulties in knowledge acquisition
in terms of cost and consistency. Therefore, it is very difficult for such... (Update)
Context of citations to this paper: More
.... computation power, and to the increasing availability of machine readable corpora, corpus based statistics oriented (CBSO) approaches [Su 1996] have been gaining prevalence in the community of computational linguistics recently. Many computational and statistical tools have...
Cited by: More
Computational Tools and Resources for Linguistic Studies - Hsu, Chang, Su
(Correct)
Active bibliography (related documents): More All
2.8: A Corpus-Based Statistics-Oriented Two-Way Design for.. - Su, Chang, Hsu (1995)
(Correct)
1.7: A Multivariate Gaussian Mixture Model for Automatic Compound.. - Chang, Su
(Correct)
1.6: GPSM: A Generalized Probabilistic Semantic Model for.. - Chang, Luo, Su (1992)
(Correct)
Similar documents based on text: More All
0.7: Statistical Models for Deep-structure Disambiguation - Chiang, Su (1996)
(Correct)
0.5: BehaviorTran: It's Current Status, Prospects and Philosophy - Chang, Su
(Correct)
0.3: Automatic Construction of a Chinese Electronic Dictionary - Jing-Shin Chang Yi-Chung (1995)
(Correct)
BibTeX entry: (Update)
Su, Keh-Yih, Tung-Hui Chiang and Jing-Shin Chang, "An Overview of Corpus-Based Statistics-Oriented (CBSO) Techniques for Natural Language Processing," Intl. Journal of Computational Linguistics and Chinese Language Processing (CLCLP), vol. 1, no. 1, pp. 101-157, Taipei, August 1996. http://citeseer.ist.psu.edu/article/su96overview.html More
@misc{ keh-yih96overview,
author = "S. Keh-Yih and T. Chiang and J. Chang",
title = "An Overview of Corpus-Based Statistics-Oriented (CBSO) Techniques for Natural
Language Processing",
text = "Su, Keh-Yih, Tung-Hui Chiang and Jing-Shin Chang, An Overview of Corpus-Based
Statistics-Oriented (CBSO) Techniques for Natural Language Processing, Intl.
Journal of Computational Linguistics and Chinese Language Processing (CLCLP),
vol. 1, no. 1, pp. 101-157, Taipei, August 1996.",
year = "1996",
url = "citeseer.ist.psu.edu/article/su96overview.html" }
Citations (may not include all citations):
2528
Maximum Likelihood from Incomplete Data via the EM Algorithm (context) - Dempster, Laird et al. - 1977
1362
A Tutorial on Hidden Markov Models and Selected Applications.. (context) - Rabiner - 1989
1262
Classification And Regression Trees (context) - Breiman, Friedman et al. - 1984
1256
Introduction to Modern Information Retrieval (context) - Salton, McGill - 1983
653
Fundamentals of Speech Recognition (context) - Lawrence, Juang - 1993
471
and Stochastic Processes (context) - Papoulis - 1984
328
A Maximum Likelihood Approach to Continuous Speech Recogniti.. (context) - Bahl, Jelinek et al. - 1993
274
Estimation of Probabilities From Sparse Data for the Languag.. (context) - Katz - 1987
263
A Stochastic Parts Program and Noun Phrase Parser for Unrest.. (context) - Church - 1988
239
Pattern Recognition: A Statistical Approach (context) - Devijver - 1982
219
A Statistical Approach to Machine Translation
- Peter, John et al. - 1990
182
Mathematics of Statistical Machine Translation: Parameter Es..
- Peter, Vincent et al. - 1993
163
Principles and Practice of Information Theory (context) - Blahut - 1987
148
Word-Sense Disambiguation Using Statistical Models of Roget'..
- Yarowsky - 1992
145
Machine Learning: An Artificial Intelligence Approach (context) - Michalski, Carbonell et al. - 1983
134
A Program for Aligning Sentences in Bilingual Corpora
- Gale - 1991
124
The population frequencies of species and the estimation of .. (context) - Good - 1953
110
Unsupervised Word Sense Disambiguation Rivaling Supervised M..
- Yarowsky - 1995
94
Grammatical Category Disambiguation by Statistical Optimizat..
- Steven - 1988
88
Class-Based N-gram Models of Natural Language
- Peter, Vincent et al. - 1992
87
Robust part-of-speech Tagging Using a Hidden Markov Model (context) - Kupiec - 1992
82
Generalized Probabilistic LR Parsing of Natural Language (Co.. (context) - Ted - 1993
65
Word-Sense Disambiguation Using Statistical Methods
- Brown - 1991
62
Aligning Sentences in Parallel Corpora
- Brown - 1991
57
Decision Lists for Lexical Ambiguity Resolution: Application..
- Yarowsky
57
Large-Vocabulary Speaker-Independent Continuous Speech Recog.. (context) - Lee - 1988
52
A theory of adaptive pattern classifiers (context) - Amari - 1967
49
The Computational Analysis of English: A Corpus-Based Approa.. (context) - Roger, Leech et al. - 1987
47
Identifying Word Correspondences in Parallel Texts (context) - Gale - 1991
43
Word Association Norms, Mutual Information, and Lexicography
- Church, Hanks - 1989
41
Tagging Text with a Probabilistic Model (context) - Bernard - 1991
32
Termight: Identifying and Translating Technical Terminology (context) - Ido - 1994
31
Robust Bilingual Word Alignment for Machine-Aided Translatio..
- Ido, Gale - 1993
31
Translating Collocations for Bilingual Lexicons: A Statistic.. (context) - Smadja, McKeown et al. - 1996
28
Aligning A Parallel English-Chinese Corpus Statistically Wit..
- Dekai - 1994
27
A Connectionist Approach to Word Sense Disambiguation (context) - Cottrell - 1989
26
Two Language Are More Informative Than One
- Itai, Schwall - 1991
25
An Algorithm for Finding Noun Phrase Correspondences in Bili..
- Kupiec - 1993
21
Machine Translation: Past (context) - Hutchins - 1986
20
Using Bilingual Materials to Develop Word Sense Disambiguati.. (context) - Gale, Church et al. - 1992
18
A Bayesian Hybrid Method for Context-Sensitive Spelling Corr..
- Golding - 1995
17
On smoothing Techniques for Bigram-based Natural Language Mo.. (context) - Ney, Essen - 1991
17
New Discriminative Training Algorithm Based on the Generaliz.. (context) - Katagiri, Lee et al. - 1991
16
Word Identification for Mandarin Chinese Sentences (context) - Keh-Jiann, Liu - 1992
15
A Segmental k-Means Training Procedure for Connected Word Re.. (context) - Rabiner, Wilpon et al. - 1986
15
GPSM: A Generalized Probabilistic Semantic Model for Ambigui..
- Chang, Luo - 1992
13
Context based Spelling Correction (context) - Mays, Damerau et al. - 1991
13
Combining Trigram-based and Feature-based Methods for - 47/5..
- Golding, Schabes - 1996
12
Grammarless Extraction of Phrasal Translation Examples from ..
- Dekai - 1995
10
Prentice Hall (context) - Papoulis - 1990
9
A Corpus-based Approach to Automatic Compound Extraction
- Keh-Yih, Wu et al. - 1994
9
Statistical Models for Word Segmentation and Unknown Word Re.. (context) - Chiang, Chang et al. - 1992
9
GLR Parsing with Probability (context) - Wright, Wrigley - 1991
8
Identification of Unknown Words from Corpus (context) - Cheng-Huang, Lee - 1994
8
A Preliminary Study on Unknown Word Problem in Chinese Word .. (context) - Ming-Yu, Chiang et al. - 1993
7
GLR Parsing with Scoring (context) - Su, Wang et al. - 1991
7
A New Quantitative Quality Measure for Machine Translation S..
- Su, Wu - 1992
7
Discrimination Oriented Probabilistic Tagging (context) - Yi-Chung, Chiang et al. - 1992
6
A Corpus-Based Statistics-Oriented Transfer and Generation M.. (context) - Chang - 1993
5
Why Corpus-Based Statistics-Oriented Machine Translation (context) - Keh-Yih, Chang - 1992
5
Robustness and Discrimination Oriented Speech Recognition Us.. (context) - Su - 1991
5
Some Key Issues in Designing MT Systems (context) - Su - 1990
5
ArchTran: A Corpus-Based Statistics-Oriented English-Chinese.. (context) - Chen, Chang et al. - 1991
4
Semantic and Syntactic Aspects of Score Function (context) - Su - 1988
4
Introduction to Statistical Theory (context) - Hoel, Port et al. - 1971
4
Speech Recognition Using Weighted HMM and Subspace Projectio.. (context) - Keh-Yih, Lee - 1994
4
The Semantic Score Approach to the Disambiguation of PP Atta.. (context) - Liu, Chang - 1990
3
A Sequential Truncation Parsing Algorithm Based on the Score.. (context) - Su, Wang et al. - 1989
3
The effects of Learning, Parameter Tying, and Model Refineme.. (context) - Yi-Chung, Chiang et al. - 1992
2
Constructing A Phrase Structure Grammar By Incorporating Lin.. (context) - Keh-Yih, Hsu et al. - 1991
2
Corpus-based Automatic Rule Selection in Designing a Grammar.. (context) - Yuan-Ling, Wang et al. - 1993
2
Automatic Clustering of Chinese Characters and Words (context) - Chao-Huang - 1993
2
Statistical Models for Deep-structure Disambiguation
- Chiang, Su - 1996
2
Introduction to Corpus-based Statistics-oriented (CBSO) Tech.. (context) - Keh-Yih, Chiang et al. - 1994
2
Hohn Wiley and Sons (context) - Duda, Hart - 1973
1
Syntactic Ambiguity Resolution Using A Discrimination and Ro.. (context) - Chiang, Lin et al. - 1992
1
Smoothing Statistic Databases in a Machine Translation Syste.. (context) - Su, Su - 1989
1
Machine Translation: an Integration Approach (context) - Chen, Chen - 1995
1
Robust Learning, Smoothing, and Parameter Tying on Syntactic.. (context) - Chiang, Lin et al. - 1995
1
From N-Grams to Collocations: An Evaluation of Xtract
- Smadja - 1991
1
A Level Synchronous Approach to Ill-formed Sentence Parsing .. (context) - Yi-Chung - 1995
1
An Introduction to MT (context) - Maghi - 1995
1
Pattern Recognition: statistical, structural and neural appr.. (context) - Schalkoof - 1992
1
Machine Learning: An - 48/50 - Artificial Intelligence Appro.. (context) - Michalski, Carbonell et al. - 1986
1
The New Generation BehaviorTran: Design Philosophy and syste.. (context) - Yu-Ling, Keh-Yih - 1995
Documents on the same site (http://www.bdc.com.tw/~shin/shin.bib.html): More
Automatic Construction of a Chinese Electronic Dictionary - Chang (1995)
(Correct)
A Corpus-Based Statistics-Oriented Two-Way Design for.. - Su, Chang, Hsu (1995)
(Correct)
An Unsupervised Iterative Method for Chinese New Lexicon.. - Jing-Shin Chang (1997)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC