Results 1 - 10
of
10
Frequency Estimates for Statistical Word Similarity Measures
, 2003
"... Statistical measures of word similarity have application in many areas of natural language processing, such as language modeling and information retrieval. We report a comparative study of two methods for estimating word cooccurrence frequencies required by word similarity measures. Our frequency es ..."
Abstract
-
Cited by 64 (2 self)
- Add to MetaCart
Statistical measures of word similarity have application in many areas of natural language processing, such as language modeling and information retrieval. We report a comparative study of two methods for estimating word cooccurrence frequencies required by word similarity measures. Our frequency estimates are generated from a terabyte-sized corpus of Web data, and we study the impact of corpus size on the effectiveness of the measures. We base the evaluation on one TOEFL question set and two practice questions sets, each consisting of a number of multiple choice questions seeking the best synonym for a given target word.
Performance Issues and Error Analysis in an Open-Domain Question Answering System
- ACM Trans. Inf. Syst
, 2002
"... This paper presents an in-depth analysis of a state-of-the-art Question Answering system. Several scenarios are examined: (1) the performance of each module in a serial baseline system, (2) the impact of feedbacks and the insertion of a logic prover, and (3) the impact of various lexical resources. ..."
Abstract
-
Cited by 59 (2 self)
- Add to MetaCart
This paper presents an in-depth analysis of a state-of-the-art Question Answering system. Several scenarios are examined: (1) the performance of each module in a serial baseline system, (2) the impact of feedbacks and the insertion of a logic prover, and (3) the impact of various lexical resources. The main conclusion is that the overall performance depends on the depth of natural language processing resources and the tools used for answer finding.
Improving the Estimation of Relevance Models Using Large External Corpora
, 2006
"... Information retrieval algorithms leverage various collection statistics to improve performance. Because these statistics are often computed on a relatively small evaluation corpus, we believe using larger, non-evaluation corpora should improve performance. Specifically, we advocate incorporating ext ..."
Abstract
-
Cited by 22 (2 self)
- Add to MetaCart
Information retrieval algorithms leverage various collection statistics to improve performance. Because these statistics are often computed on a relatively small evaluation corpus, we believe using larger, non-evaluation corpora should improve performance. Specifically, we advocate incorporating external corpora based on language modeling. We refer to this process as external expansion. When compared to traditional pseudo-relevance feedback techniques, external expansion is more stable across topics and up to 10% more e#ective in terms of mean average precision. Our results show that using a high quality corpus that is comparable to the evaluation corpus can be as, if not more, e#ective than using the web. Our results also show that external expansion outperforms simulated relevance feedback. In addition, we propose a method for predicting the extent to which external expansion will improve retrieval performance. Our new measure demonstrates positive correlation with improvements in mean average precision.
Web Text Corpus for Natural Language Processing
, 2006
"... Web text has been successfully used as training data for many NLP applications. ..."
Abstract
-
Cited by 10 (0 self)
- Add to MetaCart
Web text has been successfully used as training data for many NLP applications.
Scoring missing terms in information retrieval tasks
- In ACM Thirteenth Conference on Information and Knowledge Management (CIKM-2004
, 2004
"... An usual approach to address mismatching vocabulary problem is to augment the original query using dictionaries and other lexical resources and/or by looking at pseudo-relevant documents. Either way, terms are added to form a new query that will be used to score all documents in a subsequent retriev ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
An usual approach to address mismatching vocabulary problem is to augment the original query using dictionaries and other lexical resources and/or by looking at pseudo-relevant documents. Either way, terms are added to form a new query that will be used to score all documents in a subsequent retrieval pass, and as consequence the original query’s focus may drift because of the newly added terms. We propose a new method to address the mismatching vocabulary problem, expanding original query terms only when necessary and complementing the user query for missing terms while scoring documents. It allows related semantic aspects to be included in a conservative and selective way, thus reducing the possibility of query drift. Our results using replacements for the missing query terms in modified document and passages retrieval methods show significant improvement over the original ones.
How Much Data is Needed for Reliable MT Evaluation? Using Bootstrapping to Study Human and Automatic Metrics
- Proc. of Machine Translation Summit XI
, 2007
"... Evaluating the output quality of machine translation system requires test data and quality metrics to be applied. Based on the results of the French MT evaluation campaign CESTA, this paper studies the statistical reliability of the scores depending on the amount of test data used to obtain them. Bo ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
Evaluating the output quality of machine translation system requires test data and quality metrics to be applied. Based on the results of the French MT evaluation campaign CESTA, this paper studies the statistical reliability of the scores depending on the amount of test data used to obtain them. Bootstrapping is used to compute standard deviation of scores assigned by human judges (mainly of adequacy) as well as of five automatic metrics. The reliability of the scores is measured using two formal criteria, and the minimal number of documents or segments needed to reach reliable scores is estimated. This number does not depend on the exact subset of documents that is used.
2003) Quantum, a French/English Cross-language Question Answering System
- In Proceedings of CrossLanguage Evaluation Forum (CLEF 2003
"... Abstract. We describe a method for modifying a monolingual English question answering system to allow it to accept French questions. Our method relies on a statistical translation engine to translate keywords, and a set of manually written rules for analyzing French questions. The additional steps p ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Abstract. We describe a method for modifying a monolingual English question answering system to allow it to accept French questions. Our method relies on a statistical translation engine to translate keywords, and a set of manually written rules for analyzing French questions. The additional steps performed by the cross-language system lower its performance by 28 % compared to the original system. 1
A WaCky Introduction
"... We use the Web today for a myriad purposes, from buying a plane ticket to browsing an ancient manuscript, from looking up a recipe ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
We use the Web today for a myriad purposes, from buying a plane ticket to browsing an ancient manuscript, from looking up a recipe
INFO4990 Research Methods Literature Review
, 2008
"... The goal of research into Question Answering is to produce an automated system that enables a user to enter a question in natural language and have the system return the correct answer, along with a document supporting that answer so that the user may be sure of it’s reliability. Thus a Question Ans ..."
Abstract
- Add to MetaCart
The goal of research into Question Answering is to produce an automated system that enables a user to enter a question in natural language and have the system return the correct answer, along with a document supporting that answer so that the user may be sure of it’s reliability. Thus a Question Answering system will take as input a collection of documents and a question,
The WaCky Wide Web: A Collection of Very Large Linguistically Processed
"... Abstract. This article introduces ukWaC, deWaC and itWaC, three very large corpora of English, German, and Italian built by web crawling, and describes the methodology and tools used in their construction. The corpora contain more than a billion words each, and are thus among the largest resources f ..."
Abstract
- Add to MetaCart
Abstract. This article introduces ukWaC, deWaC and itWaC, three very large corpora of English, German, and Italian built by web crawling, and describes the methodology and tools used in their construction. The corpora contain more than a billion words each, and are thus among the largest resources for the respective languages. The paper also provides an evaluation of their suitability for linguistic research, focusing on ukWaC and itWaC. A comparison in terms of lexical coverage with existing resources for the languages of interest produces encouraging results. Qualitative evaluation of ukWaC vs. the British National Corpus was also conducted, so as to highlight differences in corpus composition (text types and subject matters). The article concludes with practical information about format and availability of corpora and tools. Keywords: Annotated corpora – Corpus construction – General-purpose linguistic resources – English – German – Italian – Web as corpus – WaCky! 1.

