Results 1 - 10
of
67
Mining Product Reputations on the Web
, 2002
"... Knowing the reputations of your own and/or competitors products is important for marketing and customer relationship management. It is, however, very costly to collect and analyze survey data manually. This paper presents a new framework for mining product reputations on the Internet. It automatica ..."
Abstract
-
Cited by 95 (1 self)
- Add to MetaCart
(Show Context)
Knowing the reputations of your own and/or competitors products is important for marketing and customer relationship management. It is, however, very costly to collect and analyze survey data manually. This paper presents a new framework for mining product reputations on the Internet. It automatically collects people's opinions about target products from Web pages, and uses text mining techniques to obtain reputations of the products. In advance, we generate, on the basis of human-tested examples, syntactic and linguistic rules to determine whether any given statement is an opinion or not, and the positive/negative nature of that opinion. We first collect statements regarding target products using a general search engine, then, using the rules, extract opinions from them and attach to each of the opinions the labels
BalkaNet: Aims, Methods, Results and Perspectives. A General Overview
- In: D. Tufiş (ed): Special Issue on BalkaNet. Romanian Journal on Science and Technology of Information
"... Abstract. BalkaNet is an EC funded project (IST-2000-29388) that started in September 2001 and will end in August 2004. It aims at developing [109] aligned wordnets for the following Balkan languages: Bulgarian, Greek, Romanian, Serbian, Turkish and to extend the Czech wordnet previously developed i ..."
Abstract
-
Cited by 53 (15 self)
- Add to MetaCart
(Show Context)
Abstract. BalkaNet is an EC funded project (IST-2000-29388) that started in September 2001 and will end in August 2004. It aims at developing [109] aligned wordnets for the following Balkan languages: Bulgarian, Greek, Romanian, Serbian, Turkish and to extend the Czech wordnet previously developed in the EuroWordNet project. BalkaNet project has insofar delivered many useful results in the fields of both Computational Lexicography and Natural Language Processing. However, most of these results have been only partially disseminated in different conferences and journals. This is the first attempt to provide an overall description of the findings, methodologies and results of the project as well as a detailed account on each monolingual wordnet. The paper also presents the freeware multilingual tools designed for the development, maintenance and efficient exploitation of the aligned BalkaNet wordnets. A preliminary approach on BalkaNet’s application towards indexing Web documents and Information Retrieval is described, following the consideration that semantic networks are valuable in the context of real world systems and user communities. Last but not least, a rather thorough analyses of wordnet applications over the last years is intended to put in evidence the hottest themes for further developments based on wordnets. The ultimate objective of this contribution is to spread the knowledge and experience that we have acquired, to the benefit of the research and industrial communities. We also hope that our shared experience will be helpful for other wordnet-builders. 10 D. Tufi¸s, D. Cristea, S. Stamou 1.
Toward Semantics-Based Answer Pinpointing
, 2001
"... SHAPE ADJECTIVE COLOR DISEASE TEXT NARRATIVE* GENERAL-INFO DEFINITION USE EXPRESSION-ORIGIN HISTORY WHY-FAMOUS BIO ANTECEDENT INFLUENCE CONSEQUENT CAUSE-EFFECT METHOD-MEANS CIRCUMSTANCE-MEANS REASON EVALUATION PRO-CON CONTRAST RATING COUNSEL-ADVICE Actual answers Answer templates Lou Vasquez, trac ..."
Abstract
-
Cited by 46 (7 self)
- Add to MetaCart
(Show Context)
SHAPE ADJECTIVE COLOR DISEASE TEXT NARRATIVE* GENERAL-INFO DEFINITION USE EXPRESSION-ORIGIN HISTORY WHY-FAMOUS BIO ANTECEDENT INFLUENCE CONSEQUENT CAUSE-EFFECT METHOD-MEANS CIRCUMSTANCE-MEANS REASON EVALUATION PRO-CON CONTRAST RATING COUNSEL-ADVICE Actual answers Answer templates Lou Vasquez, track coach of...and Johnny Mathis <person>, <role> of <entity> Signed Saparmurad Turkmenbachy [Niyazov], <person> <role-title*> of <entity> president of Turkmenistan ...Turkmenistan's President Saparmurad Niyazov... <entity>'s <role> <person> ...in Tchaikovsky's Eugene Onegin... <person>'s <entity> Mr. Jack Welch, GE chairman... <role-title> <person> ... <entity> <role> ...Chairman John Welch said ...GE's <subject>|<psv object> of related role-verb Figure 3. Portion of QA Typology node annotations for Proper-Person. At the time of the TREC-9 Q&A evaluation, we had produced approx. 500 patterns by simply crosscombining approx. 20 Question patterns with approx. 25 Answer patterns. To our disappo...
An Intelligent Discussion-Bot for Answering Student Queries in Threaded Discussions
- In Proceedings of Intelligent User Interface (IUI-2006
, 2006
"... This paper describes a discussion-bot, which provides answers to students ’ discussion board questions in an unobtrusive and human-like way. Using information retrieval and natural language processing techniques, the discussion-bot identifies the questioner’s interest, mines suitable answers from an ..."
Abstract
-
Cited by 40 (10 self)
- Add to MetaCart
This paper describes a discussion-bot, which provides answers to students ’ discussion board questions in an unobtrusive and human-like way. Using information retrieval and natural language processing techniques, the discussion-bot identifies the questioner’s interest, mines suitable answers from an annotated corpus of 1236 archived threaded discussions and 279 course documents, and generates a human-like reply. A novel modeling approach was designed for the analysis of archived threaded discussions to facilitate answer extraction. We compare a self-out and an all-in evaluation of the mined answers. The results show that the discussion-bot can begin to meet students ’ learning requests. We discuss directions that might be taken to increase the effectiveness of the question matching and answer extraction algorithms. The research takes place in the context of an undergraduate computer science course.
A Multi-Strategy and Multi-Source Approach to Question Answering
- In Proceedings of Text REtrieval Conference
, 2003
"... this paper, we first describe the architecture on which PIQUANT is based. We then describe the answering agents currently implemented within the PIQUANT system, and how they were configured for our TREC2002 runs. Finally, we show that significant performance improvement was achieved by our multi-age ..."
Abstract
-
Cited by 33 (3 self)
- Add to MetaCart
(Show Context)
this paper, we first describe the architecture on which PIQUANT is based. We then describe the answering agents currently implemented within the PIQUANT system, and how they were configured for our TREC2002 runs. Finally, we show that significant performance improvement was achieved by our multi-agent architecture by comparing our TREC2002 results against individual answering agent performance
Contextual preferences
- In Proceedings of ACL
, 2008
"... The validity of semantic inferences depends on the contexts in which they are applied. We propose a generic framework for handling contextual considerations within applied inference, termed Contextual Preferences. This framework defines the various context-aware components needed for inference and t ..."
Abstract
-
Cited by 33 (7 self)
- Add to MetaCart
(Show Context)
The validity of semantic inferences depends on the contexts in which they are applied. We propose a generic framework for handling contextual considerations within applied inference, termed Contextual Preferences. This framework defines the various context-aware components needed for inference and their relationships. Contextual preferences extend and generalize previous notions, such as selectional preferences, while experiments show that the extended framework allows improving inference quality on real application data. 1
Use of WordNet Hypernyms for Answering What-Is Questions
, 2001
"... We present a preliminary analysis of the use of WordNet hypernyms for answering "What-is" questions. We analyse the approximately 130 definitional questions in the TREC10 corpus with respect to our technique of Virtual Annotation (VA), which has previously been shown to be effective on the ..."
Abstract
-
Cited by 29 (0 self)
- Add to MetaCart
We present a preliminary analysis of the use of WordNet hypernyms for answering "What-is" questions. We analyse the approximately 130 definitional questions in the TREC10 corpus with respect to our technique of Virtual Annotation (VA), which has previously been shown to be effective on the TREC9 definitional question set and other questions. We discover that VA is effective on a subset of the TREC10 definitional questions, but that some of these questions seem to need a user model to generate correct answers, or at least answers that agree with the NIST judges. Furthermore, there remains a large enough subset of definitional questions that cannot benefit at all from the WordNet isa-hierarchy, prompting the need to investigate alternative external resources.
Answer Selection in a Multi-Stream Open Domain Question Answering System
- Proceedings 26th European Conference on Information Retrieval (ECIR’04),, volume 2997 of LNCS
, 2004
"... Question answering systems aim to meet users' information needs by returning exact answers in response to a question. Traditional open domain question answering systems are built around a single pipeline architecture. In an attempt to exploit multiple resources as well as multiple answering ..."
Abstract
-
Cited by 26 (12 self)
- Add to MetaCart
Question answering systems aim to meet users' information needs by returning exact answers in response to a question. Traditional open domain question answering systems are built around a single pipeline architecture. In an attempt to exploit multiple resources as well as multiple answering strategies, systems based on a multi-stream architecture have recently been introduced. Such systems face the challenging problem of having to select a single answer from pools of answers obtained using essentially di#erent techniques. We report on experiments aimed at understanding and evaluating the e#ect of di#erent options for answer selection in a multi-stream question answering system.
Performance Analysis of a Distributed Question Answering System
- IEEE Transactions on Parallel and Distributed Systems
, 2002
"... The problem of question/answering (Q/A) is to find answers to open-domain questions by search-ing large collections of documents. Unlike information retrieval systems, very common today in the form of Internet search engines, Q/A systems do not retrieve documents, but instead provide short, relevant ..."
Abstract
-
Cited by 24 (4 self)
- Add to MetaCart
(Show Context)
The problem of question/answering (Q/A) is to find answers to open-domain questions by search-ing large collections of documents. Unlike information retrieval systems, very common today in the form of Internet search engines, Q/A systems do not retrieve documents, but instead provide short, relevant answers located in small fragments of text. This enhanced functionality comes with a price: Q/A systems are significantly slower and require more hardware resources than informa-tion retrieval systems. This paper proposes a distributed Q/A architecture that: enhances the sys-tem throughput through the exploitation of inter-question parallelism and dynamic load balancing, and reduces the individual question response time through the exploitation of intra-question par-allelism. Inter and intra-question parallelism are both exploited using several scheduling points: one before the Q/A task is started, and two embedded in the Q/A task. An analytical performance model is introduced. The model analyzes both the inter-question parallelism overhead generated by the migration of questions, and the intra-question parallelism overhead generated by the partitioning of the Q/A task. The analytical model indicates that both question migration and partitioning are required for a high-performance system: intra-question
Instance-based question answering: A data driven approach
- In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-04
, 2004
"... Anticipating the availability of large questionanswer datasets, we propose a principled, datadriven Instance-Based approach to Question Answering. Most question answering systems incorporate three major steps: classify questions according to answer types, formulate queries for document retrieval, an ..."
Abstract
-
Cited by 22 (3 self)
- Add to MetaCart
(Show Context)
Anticipating the availability of large questionanswer datasets, we propose a principled, datadriven Instance-Based approach to Question Answering. Most question answering systems incorporate three major steps: classify questions according to answer types, formulate queries for document retrieval, and extract actual answers. Under our approach, strategies for answering new questions are directly learned from training data. We learn models of answer type, query content, and answer extraction from clusters of similar questions. We view the answer type as a distribution, rather than a class in an ontology. In addition to query expansion, we learn general content features from training data and use them to enhance the queries. Finally, we treat answer extraction as a binary classification problem in which text snippets are labeled as correct or incorrect answers. We present a basic implementation of these concepts that achieves a good performance on TREC test data. 1