Results 1 - 10
of
88
Using Lexical Chains for Text Summarization
, 1997
"... We investigate one technique to produce a summary of an original text without requiring its full semantic interpretation, but instead relying on a model of the topic progression in the text derived from lexical chains. We present a new algorithm to compute lexical chains in a text, merging several r ..."
Abstract
-
Cited by 276 (7 self)
- Add to MetaCart
We investigate one technique to produce a summary of an original text without requiring its full semantic interpretation, but instead relying on a model of the topic progression in the text derived from lexical chains. We present a new algorithm to compute lexical chains in a text, merging several robust knowledge sources: the WordNet thesaurus, a part-of-speech tagger and shallow parser for the ldentification of nominal groups, and a segmentation algorithm derived from (Hearst, 1994) Summarization proceeds in three steps: the original text m first segmented, lexical chains are constructed, strong chains are identified and significant sentences are extracted from the text. We present in this paper empirical results on the identification of strong chain and of significant sentences.
Summarizing Scientific Articles - Experiments with Relevance and Rhetorical Status
- Computational Linguistics
, 2002
"... this paper we argue that scientific articles require a different summarization strategy than, for instance, news articles. We propose a strategy which concentrates on the rhetorical status of statements in the article: Material for summaries is selected in such a way that summaries can highlight the ..."
Abstract
-
Cited by 103 (2 self)
- Add to MetaCart
this paper we argue that scientific articles require a different summarization strategy than, for instance, news articles. We propose a strategy which concentrates on the rhetorical status of statements in the article: Material for summaries is selected in such a way that summaries can highlight the new contribution of the source paper and situate it with respect to earlier work. We provide a gold standard for summaries of this kind consisting of a substantial corpus of conference articles in computational linguistics with human judgements of rhetorical status and relevance. We present several experiments measuring our judges' agreement on these annotations. We also present an algorithm which, on the basis of the annotated training material, selects content and classifies it into a fixed set of seven rhetorical categories. The output of this extraction and classification system can be viewed as a single-document summary in its own right; alternatively, it can be used to generate task-oriented and user-tailored summaries designed to give users an overview of a scientific field.
A Semantics of Contrast and Information Structure for Specifying Intonation in Spoken Language Generation
, 1996
"... ..."
The Effect of Resource Limits and Task Complexity on Collaborative Planning in Dialogue
- Artificial Intelligence Journal
, 1996
"... This paper shows how agents' choice in communicative action can be designed to mitigate the effect of their resource 1/mits in the context of particular features of a collaborative planning task. I first motivate a number of hypotheses about effective language behavior based on a statistical analysi ..."
Abstract
-
Cited by 49 (10 self)
- Add to MetaCart
This paper shows how agents' choice in communicative action can be designed to mitigate the effect of their resource 1/mits in the context of particular features of a collaborative planning task. I first motivate a number of hypotheses about effective language behavior based on a statistical analysis of a corpus of natural collaborative planning dialogues. These hypotheses are then tested in a dialogue testbed whose design is motivated by the corpus analysis. Experiments in the testbed examine the interaction between (1) agents' resource 1/mits in attentional capacity and inferential capacity; (2) agents' choice in communication; and (3) features of communicative tasks that affect task difficulty such as inferential complexity, degree of belief coordination required, and tolerance for errors. The results show that good algorithms for communication must be defined relative to the agents' resource 1/mits and the features of the task. Algorithms that are inefficient for inferentially simple, low coordination or fault-tolerant tasks are effective when tasks require coordination or complex inferences, or are fault-intolerant. The results provide an explanation for the occurrence of utterances in human dialogues that, prima facie, appear inefficient, and provide the basis for the design of effective algorithms for communicative choice for resource limited agents.
Experiments Using Stochastic Search For Text Planning
- IN PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE GENERATION
, 1998
"... Marcu has characterised an important and difficult problem in text planning: given a set of facts' to convey and a set of rhetorical relations that can be used to link them together, how Can one arrange this material so as .to yield the best possible text? We describe experiments with a number of he ..."
Abstract
-
Cited by 47 (11 self)
- Add to MetaCart
Marcu has characterised an important and difficult problem in text planning: given a set of facts' to convey and a set of rhetorical relations that can be used to link them together, how Can one arrange this material so as .to yield the best possible text? We describe experiments with a number of heuristic Search methods for this task.
Centering, Anaphora Resolution, and Discourse Structure
- Centering Theory in Discourse
, 1998
"... Centering was formulated as a model of the relationship between attentional state, the form of referring expressions, and the coherence of an utterance within a discourse segment (Grosz, Joshi and Weinstein, 1986; Grosz, Joshi and Weinstein, 1995). In this chapter, I argue that the restriction of ce ..."
Abstract
-
Cited by 44 (1 self)
- Add to MetaCart
Centering was formulated as a model of the relationship between attentional state, the form of referring expressions, and the coherence of an utterance within a discourse segment (Grosz, Joshi and Weinstein, 1986; Grosz, Joshi and Weinstein, 1995). In this chapter, I argue that the restriction of centering to operating within a discourse segment should be abandoned in order to integrate centering with a model of global discourse structure. The within-segment restriction causes three problems. The first problem is that centers are often continued over discourse segment boundaries with pronominal referring expressions whose form is identical to those that occur within a discourse segment. The second problem is that recent work has shown that listeners perceive segment boundaries at various levels of granularity. If centering models a universal processing phenomenon, it is implausible that each listener is using a different centering algorithm.The third issue is that even for utterances within a discourse segment, there are strong contrasts between utterances whose adjacent utterance within a segment is hierarchically recent and those whose adjacent utterance within a segment is linearly recent. This chapter argues that these problems can be eliminated by replacing Grosz and Sidner's stack model of attentional state with an alternate model, the cache model. I show how the cache model is easily integrated with the centering algorithm, and provide several types of data from naturally occurring discourses that support the proposed integrated model. Future work should provide additional support for these claims with an examination of a larger corpus of naturally occurring discourses.
An Annotation Scheme for Discourse-Level Argumentation in Research Articles
- In Proceedings of the 8th Meeting of the European Chapter of the Association for Computational Linguistics (EACL-99
, 1999
"... In order to build robust automatic ab- stracting systems, there is a need for better training resources than are currently available. In this paper, we introduce an annotation scheme for scientific ar- ticles which can be used to build such a resource in a consistent way. The seven categories ..."
Abstract
-
Cited by 44 (9 self)
- Add to MetaCart
In order to build robust automatic ab- stracting systems, there is a need for better training resources than are currently available. In this paper, we introduce an annotation scheme for scientific ar- ticles which can be used to build such a resource in a consistent way. The seven categories of the scheme are based on rhetorical moves of argumentation.

