(Enter summary)
Abstract: Information retrieval is concerned with how to classify information and how to judge the similarity
between two objects, such as written documents. As the amount of information available in digital form has
grown, so too has the need for accurate and scalable algorithms for handling this information. Information
theory is concerned with the production and transmission of information. Using a framework known as
the source-channel model of communication, information theory has established... (Update)
Similar documents (at the sentence level):
31.4%: Error-Correcting Output Coding for Text Classification - Berger (1999)
(Correct)
10.5%: Information Retrieval as Statistical Translation - Berger (1999)
(Correct)
Active bibliography (related documents): More All
0.3: On the Learnability and Design of Output Codes for Multiclass .. - Crammer, Singer (2000)
(Correct)
0.3: Tagging English text with a probabilistic model - Merialdo (1993)
(Correct)
0.2: WebMate: A Personal Agent for Browsing and Searching - Chen, Sycara (1998)
(Correct)
Similar documents based on text: More All
0.2: Statistical Machine Learning for Information Retrieval - Berger (2001)
(Correct)
0.1: Grammatical Trigrams: A New Approach To Statistical Language .. - Sleator, Lafferty (1997)
(Correct)
0.1: Some Error-Correcting Codes and Their Applications - Key
(Correct)
BibTeX entry: (Update)
@inproceedings{ berger99information,
author = "Adam Berger and John D. Lafferty",
title = "Information Retrieval as Statistical Translation",
booktitle = "Research and Development in Information Retrieval",
pages = "222-229",
year = "1999",
url = "citeseer.ist.psu.edu/article/berger99information.html" }
Citations (may not include all citations):
2528
Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
860
The theory of error-correcting codes (context) - MacWilliams, Sloane - 1977
657
Bagging predictors
- Breiman - 1996
509
A decision-theoretic generalization of on-line learning and ..
- Freund, Schapire - 1997
463
Term-weighting approaches in automatic text retrieval (context) - Salton, Buckley - 1988
340
Learning representations by back-propagating errors (context) - Rumelhart, Hinton et al. - 1986
328
A maximum likelihood approach to continuous speech recogniti.. (context) - Bahl, Jelinek et al. - 1983
219
A statistical approach to machine translation
- Brown, Cocke et al. - 1990
214
Improved boosting algorithms using confidence-rated predicti..
- Singer, Schapire - 1999
202
Statistical methods in speech recognition (context) - Jelinek - 1998
201
Relevance weighting of search terms (context) - Robertson, Jones - 1976
182
The mathematics of statistical machine translation: Paramete..
- Brown, Pietra et al. - 1993
164
Webert: Identifying interesting web sites (context) - Pazzani, Muramatsu et al. - 1996
163
A language modeling approach to information retrieval
- Ponte - 1998
163
A language modeling approach to information retrieval
- Ponte, Croft - 1998
155
An empirical comparison of voting classification algorithms:..
- Bauer, Kohavi - 1999
149
Learning to extract symbolic knowledge from the World Wide W..
- Craven, DiPasquo et al. - 1998
140
A comparison of event models for Naive Bayes text classifica..
- McCallum, Nigam - 1998
103
at forty: The independence assumption in information retriev.. (context) - Lewis, Bayes - 1998
95
Hancock-Beaulieu (context) - Robertson, Walker - 1992
89
and arcing classifiers (context) - Breiman, variance - 1996
82
Error-correcting output coding corrects bias and variance
- Kong, Dietterich - 1995
75
Combining instance-based and model-based learning
- Quinlan - 1993
68
Improving regression estimation: Averaging methods for varia..
- Perrone - 1993
55
Using probabilistic models of document retrieval without rel.. (context) - Croft, Harper - 1979
36
Newsweeder: Learning to filter news (context) - Lang - 1995
32
Informedia digital video library (context) - Christel, Kanade et al. - 1995
26
Efficient probabilistic inference for text retrieval (context) - Turtle, Croft - 1991
21
Probabilistic models for automatic indexing (context) - Bookstein, Swanson - 1974
21
The candide system for machine translation
- Berger, Brown et al. - 1994
19
Cloud classification using error-correcting output codes
- Aha, Bankert - 1997
16
Achieving high-accuracy text-to-speech with machine learning
- Bakiri, Dietterich - 1999
11
But dictionaries are data too (context) - Brown, Pietra et al. - 1993
11
Translingual information retrieval: Learning from bilingual ..
- Yang, Carbonell et al. - 1997
10
Majority vote classifiers: theory and applications (context) - James - 1998
6
An application of expert network to clinical classification .. (context) - Yang, Chute - 1994
5
ECRSM -- erasure correcting scalable reliable multicast (context) - Gemmell - 1997
5
The error coding method and PiCTs (context) - James, Hastie - 1997
4
Solving the word mismatch problem through automatic text ana..
- Xu - 1997
4
Distributional clustering for text classification (context) - Baker, McCallum - 1998
4
Information retrieval on the web: Tools and algorithmic issu.. (context) - Broder, Henzinger - 1998
1
SIGIR Workshop on cross-linguistic information retrieval (context) - Grefenstette - 1996
Documents on the same site (http://www.cs.cmu.edu/~aberger/):
Statistical Machine Learning for Information Retrieval - Berger (2001)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC