Home     Top: Information Retrieval: Extraction    [Classification   Digital Libraries   Extraction   Filtering   Metasearch   Retrieval   Search Engines   World Wide Web]

Change ordering:   Authority   Hubs (tutorials)   Date   Expected authority       Show titles only
Tutorials/surveys/introductory articles (ordered by the degree of citation of authoritative articles)

This directory is created automatically and some papers may be mislabeled. Only document within the CiteSeer database are listed. The directory is intended to provide entry points for browsing the database and is not intended to be authoritative. Papers may not appear in all relevant categories. For example, papers in a sub-category may not appear in higher level categories.

13896.9   The Rhetorical Parsing, Summarization, and Generation of Natural.. - Marcu (1997)   (Correct)
This thesis is an inquiry into the nature of the high-level, rhetorical structure of unrestricted natural language texts, computational means to enable its derivation, and two applications (in automat... / most salient phrases. Information-extraction-based systems

7934.4   A Tutorial on Support Vector Regression - Smola, Schölkopf (1998)   (Correct)
In this tutorial we give an overview of the basic ideas underlying Support Vector (SV) machines for regression and function estimation. Furthermore, we include a summary of currently used algorithms f... /

7831.6   Database Techniques for the World-Wide Web: A Survey - Florescu, Levy, Mendelzon (1998)   (Correct)
The primary goal of this survey is to classify the different tasks to which database concepts have been applied, and to emphasize the technical innovations that are required to do so. We focus on thre... / collection of web sites. Information extraction and integration Certain br Ashish and Craig A. Knoblock. Wrapper generation for semi-structured

7090.0   Machine Learning and Natural Language Processing - Marquez (2000)   (Correct)
In this report, some collaborative work between the fields of Machine Learning (ML) and Natural Language Processing (NLP) is presented. The document is structured in two parts. The first part includes... / and robust parsing information extraction and retrieval automatic

6766.5   Complexity of Lexical Descriptions and its Relevance to Partial.. - Bangalore (1997)   (Correct)
Complexity of Lexical Descriptions and its Relevance to Partial Parsing Srinivas Bangalore Supervisor: Aravind K. Joshi In this dissertation, we have proposed novel methods for robust parsing that int... / part-of-speech tags. In an information extraction task supertags are used

5369.0   Learning And Generalization In The Creation Of Information Extraction .. - Chai (1998)   (Correct)
Computer Science) LEARNING AND GENERALIZATION IN THE CREATION OF INFORMATION EXTRACTION SYSTEMS by Joyce Yue Chai Department of Computer Science Duke University Date: Approved: Dr. Alan W. Bierma... / In The Creation Of Information Extraction Systems By Joyce Yue

5210.3   Relational Learning Techniques for Natural Language Information.. - Califf (1998)   (Correct)
vii Chapter 1 Introduction 1 1.1 Organization of Dissertation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Chapter 2 Background 5 2.1 Information Extraction . . . . . . . . . .... / for Natural Language Information Extraction Mary Elaine Califf

5068.9   An algorithm toolbox for on-line cursive script recognition - Powalka   (Correct)
This thesis deals with algorithms for on-line cursive script recognition. A novel concept of a recognition toolbox is introduced. Algorithms applicable to various aspects of the recognition are invest... / two approaches to zoning information extraction are applied Guerfali

4898.5   Connectionist, Statistical and Symbolic Approaches to Learning for.. - Wermter, Riloff, Scheler   (Correct)
The purpose of this chapter is to provide an introduction to the field of connectionist, statistical and symbolic approaches to learning for natural language processing, based on the contributions i... / Approaches Learning information extraction patterns from examples

4491.9   Techniques For Automatic Digital Video Composition - Ahanger (1999)   (Correct)
Recent developments in digital technology have enabled a class of video-based applications that were not previously viable. However, digital video production systems face the challenge of accessing th... / . . Information Extraction and Representation .

4358.0   Retrieval of Passages for Information Reduction - Daniels (1996)   (Correct)
Information Retrieval (IR) typically retrieves entire documents in response to a user's information need. However, many times a user would prefer to examine smaller portions of a document. One example... / we could save an automated information extraction system from processing an

4342.0   HERALD: Hybrid Environment for Robust Analysis of Language Data - Ballim, Coray, Pallotta (1999)   (Correct)
This project addresses the problem of performing structural and semantic analysis of data where the syntactic and semantic models of the domain are inadequate, and robust methods must be employed to ... / necessary for advanced information extraction and retrieval from large

4265.3   Generating Natural Language Summaries from Multiple On-Line Sources.. - Radev (1999)   (Correct)
information, highlighting agreements and contradictions among sources on the same topic. We have developed novel techniques and algorithms for combining data from multiple sources at the conceptual l... / coupled with appropriate information extraction technology generates br using linguistic knowledge extraction. Information Processing and

4259.8   Information Extraction for Run-time Formal Analysis - Kim (2001)   (Correct)
The significance of software systems has rapidly increased. The assurance of software systems has become a critical requirement of the information age. Formal verification on the design of a system an... / Information Extraction for Run-time Formal

4134.9   Methods of Category Classification Applied to Word-Sense.. - Wiebe, Bruce (1996)   (Correct)
In this work, we will develop probabilistic classifiers for two challenging and diverse natural language processing (NLP) tasks using a common set of techniques. One classifier will be capable of disa... /

4094.3   Learning to Construct Knowledge Bases from the World Wide Web - Craven, DiPasquo, Freitag, McCallum, .. (1999)   (Correct)
The World Wide Web is a vast source of information accessible to computers, but understandable only to humans. The goal of the research described here is to automatically create a computer understanda... / is to develop a trainable information extraction system that takes two

3978.1   Document Image Compression and Analysis - Kia (1997)   (Correct)
Image compression usually considers the minimization of storage space as its main objective. It is desirable, however, to code images so that we have the ability to process the resulting representatio... /

3930.5   Combining Artificial Intelligence and Databases for Data Integration - Levy (1998)   (Correct)
Data integration is a problem at the intersection of the fields of Artificial Intelligence and Database Systems. The goal of a data integration system is to provide a uniform interface to a multitude ... / Weld. Wrapper induction for information extraction. In Proceedings of the br Ashish and Craig A. Knoblock. Wrapper generation for semi-structured internet

3862.0   A Metrics-Based Approach To The Automated Identification Of.. - Etzkorn (1997)   (Correct)
Software reuse has been a long term goal of software developers. This goal has been rather elusive, but the widespread use of the object-oriented paradigm and other innovations in software development... / Sublanguages . . . Information Extraction Systems . . Semantic

3835.9   Designing An Efficient Distributed Digital Library Database: A Case.. - Annamalai (1997)   (Correct)
xii 1. INTRODUCTION : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 1 1.1 Digital Libraries : : : : : : : : : : : ... /

3834.4   Learning to Extract Symbolic Knowledge from the World Wide Web - Craven, DiPasquo, Freitag, McCallum, .. (1998)   (Correct)
The World Wide Web is a vast source of information accessible to computers, but understandable only to humans. The goal of the research described here is to automatically create a computer understanda... / is to develop a trainable information extraction system that takes two

3665.5   Fuzzy Finite-state Automata Can Be Deterministically Encoded into.. - Omlin, Thornber, Giles (1998)   (Correct)
There has been an increased interest in combining fuzzy systems with neural networks because fuzzy neural systems merge the advantages of both paradigms. On the one hand, parameters in fuzzy systems h... /

3574.3   Using Dia-MoLE For Unsupervised Learning Of Domain-Specific Dialogue.. - Möller   (Correct)
This report introduces DIA-MOLE, a tool that supports an engineering-oriented approach towards dialogue modelling for a spoken-language interface. Our approach is applied to the domain of appointmen... / reported on discourse-level information extraction Leh . . .

3495.3   Wrapper Induction: Efficiency and Expressiveness - Kushmerick (2000)   (Correct)
The Internet presents numerous sources of useful information---telephone directories, product catalogs, stock quotes, event listings, etc. Recently, many systems have been built that automatically gat... / reserved. Keywords Information extraction Wrapper induction br C. Knoblock Semi-automatic wrapper generation for Internet information

3457.4   Towards Learning Dialogue Structures from Speech Data and Domain.. - Möller   (Correct)
This paper introduces an engineering-oriented approach towards dialogue modelling. While dialogue models in existing dialogue systems usually are manually coded, or at least the data on which they are... / on discourse level information extraction Leh One possible

3353.6   3-D Computer Vision Using Structured Light: Design, Calibration and.. - DePiero, Trivedi   (Correct)
Structured Light (SL) sensing is a well established method of range acquisition for Computer Vision. This chapter provides thorough discussions of design issues, calibration methodologies and implemen... / than one technique for -D information extraction The above

3322.2   Robust Text Analysis: an Overview - Ballim, Pallotta, Lieske (1999)   (Correct)
Short abstract Contents 1 Introduction 2 1.1 Motivations and Goals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.2 Empiric evidence for Brittleness . . . . . . . . . . ... /

3235.0   Cyclostationary And Higher-Order Statistical Signal Processing.. - McCormick (1998)   (Correct)
In this thesis the problem of monitoring the condition of machinery is addressed using cyclostationary and higher-order statistical analysis of vibration signals. These have the potential to provide i... / disrupts more traditional information extraction procedures such as power

3224.3   Instructable and Adaptive Web-Agents which Learn to Categorize and.. - Eliassi-Rad (1999)   (Correct)
this paper. Section 3 describes our internal representation of Web pages, the major predicates in our advice language, how advice is mapped into neural networks, the mechanisms for re ning advice base... / . . Information Extraction . br text. . Information Extraction Information extraction IE

3181.4   A Computational Theory Of Vocabulary Expansion - Ehrlich, Rapaport (1997)   (Correct)
This project concerns the development and implementation of a computational theory of how human readers and other natural-language-understanding systems can automatically expand their vocabulary by de... / message-processing or information-extraction systems need to be

3064.3   A Machine Learning Approach to POS Tagging - Màrquez, Padró, Rodríguez (1998)   (Correct)
We have applied the inductive learning of statistical decision trees and relaxation labelling to the Natural Language Processing (nlp) task of morphosyntactic disambiguation (Part Of Speech Tagging)... / to syntactic parsing in information extraction and retrieval e.g.

3061.0   Financial Information Extraction using pre-defined and user-definable .. - Costantino (1997)   (Correct)
Financial operators have today access to an extremely large amount of data, both quantitative and qualitative, real-time or historical and can use this information to support their decision-making pro... / of Durham Financial Information Extraction using pre-defined and

2995.1   Intelligent Internet Systems - Levy, Weld (2000)   (Correct)
this article. This work was funded by Oce of Naval Research Grant N00014-98-1-0147, by National Science Foundation Grants IRI-9303461 and IIS-9978567, by ARPA / Rome Labs grant F30602-95-1-0024, and b... / more ambitious than simple information extraction they seek to br automates the bulk of the wrapper-generation process with a combination

2971.6   Data Visualization, Indexing and Mining Engine - A Parallel Computing .. - Meng, Chen, Fowler, Fox.. (1998)   (Correct)
ion : : : : : : : : : : : 9 3.2 Spatial Representation Of the System's Networks : : : : : : : : : : : : : : : : : : : : : : 10 3.3 Display And Interaction Mechanisms : : : : : : : : : : : : : : : : : ... / resource discovery information extraction and coordination of

2939.7   Context-Dependent Reasoning With Lexical Knowledge Using Default Logic - Anthony Hunter (1998)   (Correct)
Lexical knowledge is increasingly important in language engineering. However, it is a difficult kind of knowledge to represent and reason with. Existing approaches to formalizing lexical knowledge hav... / information filtering and information extraction. To address this we need

2770.2   Harvest: A Scalable, Customizable Discovery and Access System - Bowman (1995)   (Correct)
Rapid growth in data volume, user base, and data diversity render Internet-accessible information increasingly difficult to use effectively. In this paper we introduce Harvest, a system that provides ... / the Essence customized information extraction system the Indie

2752.9   Regularized Principal Manifolds - Smola, Mika, Schölkopf, Williamson (1999)   (Correct)
Many settings of unsupervised learning can be viewed as quantization problems --- the minimization of the expected quantization error subject to some restrictions. This allows the use of tools such ... / reflects our emphasis on information extraction by learning the coding

2704.5   Document Image Retrieval With Improvements In Database Quality - Kauniskangas (1999)   (Correct)
Modern technology has made it possible to produce, process, transmit and store digital images efficiently. Consequently, the amount of visual information is increasing at an accelerating rate in many ... /

2671.9   Recurrent Neural Networks Learn Deterministic Representations of.. - Omlin, Giles   (Correct)
The paradigm of deterministic finite-state automata (DFAs) and their corresponding regular languages have been shown to be very useful for addressing fundamental issues in recurrent neural networks. T... /

2653.7   PEA - a Personal Email Assistant with Evolutionary Adaptation - Winiwarter (1999)   (Correct)
In this paper we present PEA, a Personal Email Assistant, which filters incoming emails and ranks them according to their relevance. We provide tools for the acquisition of individual user models, whi... / analysis Local documents Information extraction Templates Display of

2624.9   Learning Text Analysis Rules For Domain-Specific Natural Language.. - Soderland (1997)   (Correct)
LEARNING TEXT ANALYSIS RULES FOR DOMAIN-SPECIFIC NATURAL LANGUAGE PROCESSING FEBRUARY 1997 STEPHEN G. SODERLAND B.Sc., STANFORD UNIVERSITY M.Sc., UNIVERSITY OF MASSACHUSETTS AMHERST Ph.D., UNIVERSITY ... / who helped define the information extraction task and supervised the

2611.9   Equivalence in Knowledge Representation: Automata, Recurrent Neural.. - Giles, Omlin, Thornber   (Correct)
Neuro-fuzzy systems - the combination of artificial neural networks with fuzzy logic - have become useful in many application domains. However, conventional neuro-fuzzy models usually need enhanced re... / citeblanco tr through information extraction methods where

2571.1   Incremental Concept Learning for Bounded Data Mining - Case, Jain, Lange, Zeugmann (1999)   (Correct)
Important refinements of concept learning in the limit from positive data considerably restricting the accessibility of input data are studied. Let c be any concept; every infinite sequence of element... / data mining knowledge extraction information discovery data pattern

2487.3   Satisfying Constraints on Extraction and Adjunction - Bouma, Malouf, Sag (1997)   (Correct)
This paper is much improved thanks to our interactions with them. In addition, we thank Ann Copestake, Dan Flickinger, and Gertjan van Noord for helpful suggestions. Finally, this research was conduct... / might be sensitive to extraction information e.g. a phenomenon that

2468.0   I don't believe in word senses - Kilgarriff (1997)   (Correct)
Word sense disambiguation assumes word senses. Within the lexicography and linguistics literature, they are known to be very slippery entities. The paper looks at problems with existing accounts of `w... / exercises in information extraction MUC- in

2461.1   Content-Based Book Recommending Using Learning for Text Categorization - Mooney, Roy (2000)   (Correct)
Recommender systems improve access to relevant products and information by making personalized suggestions based on previous examples of a user's likes and dislikes. Most existing recommender systems ... / system that utilizes information extraction and a machine-learning

2458.9   Efficient Dynamic Dispatch without Virtual Function Tables. The.. - Zendra, Colnet, Collin (1997)   (Correct)
SmallEiffel is an Eiffel compiler which uses a fast simple type inference mechanism to remove most late binding calls, replacing them by static bindings. Starting from the system's entry point, it com... / languages is information extraction type checking and

2403.7   Grammars Have Exceptions - Crescenzi, Mecca (1998)   (Correct)
Extending database-like techniques to semi-structured and Web data sources is becoming a prominent research field. These data sources are essentially collections of textual documents. Hence, in this c... / Minerva a formalism for wrapper generation over semi-structured and

2396.3   Reasoning with inconsistency in structured text - Hunter (1999)   (Correct)
Reasoning with inconsistency involves some compromise on classical logic. There is a range of proposals for logics (called paraconsistent logics) for reasoning with inconsistency each with pros and ... / and the output from information extraction systems in the form of

2344.0   Learning for Semantic Interpretation: Scaling Up Without Dumbing Down - Mooney (1999)   (Correct)
Most recent research in learning approaches to natural language have studied fairly "lowlevel " tasks such as morphology, part-of-speech tagging, and syntactic parsing. However, I believe that logical... / discourse processing and information extraction Cardie and

2339.9   Cut and Paste - Mecca, Atzeni (1998)   (Correct)
The paper develops Editor, a language for manipulating semi-structured documents, such as the ones typically available on the Web. Editor programs are based on two simple ideas, taken from text editor... / project as a basis for a wrapper-generation toolkit. Introduction

2295.4   Tabling for Non-monotonic Programming - Swift (1999)   (Correct)
this paper we describe tabling as it is implemented in the XSB system unknown Annals of Mathematics and Artificial Intelligence 0 (1999) ?--? 1 Tabling for Non-monotonic Programming Terrance Swift D... / psychiatric diagnosis information extraction from poorly structured

2277.5   Semantic Matching: Formal Ontological Distinctions for Information.. - Guarino (1997)   (Correct)
The task of information extraction can be seen as a problem of semantic matching between a user-defined template and a piece of information written in natural language. To this purpose, the ontologi... / Summer School on Information Extraction Frascati July -

2268.0   Information Extraction: Beyond Document Retrieval - Gaizauskas, Wilks (1998)   (Correct)
In this paper we give a synoptic view of the growth text processing technology of information extraction (IE) whose function is to extract information about a pre-specified set of entities, relations ... / Society of R.O.C. Information Extraction Beyond Document Retrieval

2258.2   Nonlinear Extensions To The Minimum Average Correlation Energy Filter - Fisher, III (1997)   (Correct)
ix CHAPTERS 1 INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .... / Mapping as feature extraction. Information content is measured in

2241.4   Enriching the WordNet Taxonomy with Contextual Knowledge Acquired.. - Sanda Harabagiu (1999)   (Correct)
This paper presents a possible solution for the problem of integrating contextual knowledge in the WordNet database. Contextual structures are derived from three sources: (1) minimal contexts - in the... / of domain patterns for information extraction. The Basic Idea

2205.7   Information Extraction from World Wide Web - A Survey - Eikvil (1999)   (Correct)
This report will first, in Chapter 2, give an introduction to the field of information extraction and then, in Chapter 3, look at the development of the field of wrapper generation for Web sources. Ch... / Information Extraction from World Wide Web br Chapter Information Extraction Information extraction IE is

2127.5   Learning to Resolve Natural Language Ambiguities: A Unified Approach - Roth (1998)   (Correct)
We analyze a few of the commonly used statistics based and machine learning algorithms for natural language disambiguation tasks and observe that they can be recast as learning linear separators in th... / machine translation information extraction and intelligent

2093.7   A default logic based framework for context-dependent reasoning with.. - Hunter (1999)   (Correct)
Lexical knowledge is increasingly important in information systems --- for example in indexing documents using keywords, or disambiguating words in a query to an information retrieval system, or a n... /

2081.7   Advantages of Decision Lists and Implicit Negatives in Inductive.. - Califf, Mooney (1996)   (Correct)
This paper demonstrates the capabilities of Foidl, an inductive logic programming (ILP) system whose distinguishing characteristics are the ability to produce first-order decision lists, the use of an... / to learn patterns for information extraction Califf Mooney

2078.9   Machine Learning for Information Extraction from Online Documents - Freitag (1996)   (Correct)
The field of information extraction (IE) is concerned with applying natural language processing (NLP) to extract essential details from text documents automatically. Recent results have demonstrated t... / Machine Learning for Information Extraction from Online Documents

2067.7   BLT: Bi-Layer Tracing of HTTP and TCP/IP - Feldmann (2000)   (Correct)
We describe BLT, a tool for extracting full HTTP level as well as TCP level traces via packet monitoring. This paper presents the software architecture that allows us to collect traces continuously, o... / is easier than layer information extraction because the switch is in

2065.4   A Conceptual Framework for Text Filtering - Oard, Marchionini (1996)   (Correct)
This report develops a conceptual framework for text filtering practice and research, and reviews present practice in the field. Text filtering is an information seeking process in which documents are... / Stable and Structured Information Extraction Specific Unstructured

2054.3   SNePS: A Logic for Natural Language Understanding and Commonsense.. - Shapiro (1999)   (Correct)
The use of logic for knowledge representation and reasoning systems is controversial. There are, indeed, several ways that standard First Order Predicate Logic is inappropriate for modelling natural l... /

2023.2   Intelligent Information Gathering Using Decision Models - Zilberstein, Lesser (1996)   (Correct)
This paper describes an architecture for the next generation of information gathering systems. The paper is based on a research proposal whose goal is to exploit the vast amount of information sources... / retrieval IR and information extraction IE technologies will

1991.3   Automating the Construction of Internet Portals with Machine Learning - McCallum, Nigam, Rennie, Seymore   (Correct)
Internet portals are growing in popularity because they gather content from the Web and organize it for easy access, retrieval and search. For example, www.campsearch.com allows complex queries by a... / in reinforcement learning information extraction and text classification br search. . Information Extraction Information extraction is concerned

1983.4   A Computational Theory of Vocabulary Acquisition - Rapaport, Ehrlich (1998)   (Correct)
As part of an interdisciplinary project to develop a computational cognitive model of a reader of narrative text, we are developing a computational theory of how natural-language-understanding systems... / message-processing and information-extraction systems need to be robust

1981.0   The Indexing and Retrieval of Document Images: A Survey - Doermann (1998)   (Correct)
The economic feasibility of maintaining large databases of document images has created a tremendous demand for robust ways to access and manipulate the information these images contain. In an attempt ... / image databases including information extraction and indexing and

1959.2   Learning Concepts Incrementally With Bounded Data Mining - Case, Jain, Lange, Zeugmann (1997)   (Correct)
Important refinements of incremental concept learning from positive data considerably restricting the accessibility of input data are studied. Let c be any concept; every infinite sequence of elements... / data mining knowledge extraction information discovery data pattern

1952.8   Word Sense Disambiguation And Its Application To Internet Search - Mihalcea (1999)   (Correct)
ambiguation method presented here is that it provides a ranking of possible associations between words senses, rather than a binary yes/no decision for a possible sense combination. This proves to be ... / operators de ned for information extraction improve both the

1935.5   Improving Minority Class Prediction Using Case-Specific Feature.. - Cardie (1997)   (Correct)
This paper addresses the problem of handling skewed class distributions within the case-based learning (CBL) framework. We first present as a baseline an informationgain -weighted CBL algorithm and ap... / and the acquisition of information extraction patterns i.e.concept br semantic class and concept extraction information for each of these content

1925.6   Logic Programs for Intelligent Web Search - Thomas (1999)   (Correct)
We present a general framework for the information extraction from web pages based on a special wrapper language, called token-templates. By using tokentemplates in conjunction with logic programs we... / a general framework for the information extraction from web pages based on a br . N. Ashish and C. Knoblock. Wrapper generation for semistructured internet

1925.6   Intelligent Web Querying with Logic Programs - Bernd Thomas (1998)   (Correct)
We present a general framework for the information extraction from web pages based on a special wrapper language, called token-templates. By using token-templates in conjunction with logic programs we... / a general framework for the information extraction from web pages based on a br N. Ashish and C. Knoblock. Wrapper generation for semistructured internet

1920.4   A Layered Architecture for Querying Dynamic Web Content - Davulcu, Freire, Kifer, Ramakrishnan (1999)   (Correct)
The design of webbases, database systems for supporting Webbased applications, is currently an active area of research. In this paper, we propose a 3-layer architecture for designing and implementing ... / and querying the Web information extraction and integration continues

1904.8   New Directions in Video Information Extraction and Summarization - Wactlar   (Correct)
The Informedia Digital Video Library project provided a technological foundation for full content indexing and retrieval of video and audio media. New directions for this research extend to: (1) searc... / New Directions in Video Information Extraction and Summarization Howard

1900.5   Graph Matching Using a Direct Classification of Node Attendance - DePiero, Trivedi, al. (1996)   (Correct)
An algorithm has been developed that finds isomorphisms between both graphs and subgraphs. The development is introduced in the object recognition problem domain. The method isolates matching subgraph... /

1896.1   Case-Based Reasoning - Survey and Future Directions - Bartsch-Spörl, Lenz, Hübner (1999)   (Correct)
This paper surveys the field of case-based reasoning (CBR) - both in science and in industrial applications. It starts with a short introduction to the essential ideas and concepts CBR is built upon... /

1880.3   SCREEN: Learning a Flat Syntactic and Semantic Spoken Language.. - Wermter, Weber (1997)   (Correct)
Previous approaches of analyzing spontaneously spoken language often have been based on encoding syntactic and semantic knowledge manually and symbolically. While there has been some progress using st... / Dyer tasks like information extraction from spoken language do

1869.0   Can we make Information Extraction more adaptive? - Wilks, Catizone (1999)   (Correct)
It seems widely agreed that IE (Information Extraction) is now a tested language technology that has reached precision+recall values that put it in about the same position as Information Retrieval a... / Can we make Information Extraction more adaptive Yorick

1835.8   COSY-MATS: An Intelligent and Scalable Summarisation Shell - Aretoulaki (1997)   (Correct)
In this paper, an architecture is presented for robust and portable summarisation, cosy-mats. cosy-mats can avoid the superficiality and domain-dependence of ie approaches by means of high-level (pr... / There are Information Extraction ie environments

1790.7   Modeling of Moving Objects in a Video Database - Li, Özsu, Szafron (1997)   (Correct)
Modeling moving objects has become a topic of increasing interest in the area of video databases. Two key aspects of such modeling are spatial and temporal relationships. In this paper we introduce an... / are used. The motion information extraction is then used at an

1787.6   Using default logic for lexical knowledge - Hunter (1997)   (Correct)
Lexical knowledge is knowledge about the morphology, grammar, and semantics of words. This knowledge is increasingly important in language engineering, and more generally in information retrieval, i... / information filtering and information extraction. Perhaps the most

1740.6   Some Organizing Principles For A Unified Top-Level Ontology - Guarino (1997)   (Correct)
object (Pythagoras' theorem) Quality (the color of a particular piece of plasticine) Fig. 1. The basic "backbone" in the ontology of particulars. A location is either a region of (absolute) space o... /

1730.5   Intelligent Techniques for the Extraction and Integration of.. - Bergamaschi Castano (1999)   (Correct)
Developing intelligent tools for the integration of information extracted from multiple heterogeneous sources is a challenging issue to effectively exploit the numerous sources available on-line in gl... / techniques to information extraction and integration which

1688.7   IR and AI: traditions of representation and anti-representation in.. - Wilks   (Correct)
The paper is concerned with the role of conceptual representations in access to information, as for example, from the World Wide Web. It contrasts two quite different traditions for doing this: In... / IR and more recently Information Extraction IE a development of

1665.8   XWRAP: An XML-enabled Wrapper Construction System for Web Information .. - Liu, Pu, Han (2000)   (Correct)
The amount of useful semi-structured data on the web continues to grow at a stunning pace. Often interesting web data are not in database systems but in HTML pages, XML pages, or text files. Data in t... / developers as declarative information extraction rules. The second phase br the XML documents. The XWRAP wrapper generation framework has three distinct

1664.6   A Cognitive Bias Approach to Feature Selection and Weighting for.. - Cardie   (Correct)
Research in psychology, psycholinguistics, and cognitive science has discovered and examined numerous psychological constraints on human information processing. Short term memory limitations, a focu... / be critical for the larger information extraction task within which the

1658.9   A Parallel System for Textual Inference - Harabagiu, Moldovan (1999)   (Correct)
This paper presents a possible solution for the text inference problem - extracting information unstated in a text, but implied. Text inference is central to natural language applications such as info... / applications such as information extraction and dissemination text

1647.3   Information Processing by a Perceptron in an Unsupervised Learning.. - Nadal (1993)   (Correct)
We study the ability of a simple neural network (a perceptron architecture, no hidden units, binary outputs) to process information in the context of an unsupervised learning task. The network is aske... / Data analysis is a form of information extraction. To give an example for

1624.3   Information Extraction & Database techniques: a user-oriented.. - Lacroix, Sahuguet, Chandrasekar   (Correct)
We propose a novel approach to querying the Web with a system named AKIRA (Agentive Knowledge-based Information Retrieval Architecture) which combines advanced technologies from Information Retrieva... / Information Extraction Database techniques a

1602.4   Building Domain-Specific Search Engines with Machine Learning.. - McCallum, Nigam, Rennie, Seymore (1999)   (Correct)
Domain-specific search engines are becoming increasingly popular because they offer increased accuracy and extra features not possible with the general, Web-wide search engines. For example, www.camps... / text classification and information extraction that automates efficient br Information Extraction Information extraction is concerned

1601.1   Learning to Extract Keyphrases from Text - Turney (1999)   (Correct)
Many academic journals ask their authors to provide a list of about five to fifteen key words, to appear on the first page of each article. Since these key words are often phrases of two or more words... / is also distinct from information extraction the task that has been

1600.5   Retrieval and Reasoning in Distributed Case Bases - Nagendra Prasad (1995)   (Correct)
The proliferation of electronically available networked information has led researchers to examine the issues involved in developing automated methods for gathering information in response to a query ... / Much of the work in information extraction and text summarization

1598.4   Learning to Parse Natural Language Database Queries into Logical Form - Thompson (1997)   (Correct)
For most natural language processing tasks, a parser that maps sentences into a semantic representation is significantly more useful than a grammar or automata that simply recognizes syntactically wel... / such as question answering information extraction summarizing or

1580.4   A Parallel System for Text Inference Using Marker Propagations - Harabagiu, Moldovan (1998)   (Correct)
This paper presents a possible solution for the text inference problem---extracting information unstated in a text, but implied. Text inference is central to natural language applications such as info... / applications such as information extraction and dissemination text

1577.5   Describing Abstraction in Rendered Images through Figure Captions - Hartmann, Preim, Strothotte   (Correct)
ion in Rendered Images through Figure Captions Knut Hartmann, 1 Bernhard Preim, 2 Thomas Strothotte 2 1 Institute for Knowledge and Language Engineering 2 Department of Simulation and Graphic... / adapt visualizations to the information extraction task of the user. For

1576.6   Ontology-Based Extraction and Structuring of Information from.. - Embley, Campbell, Liddle, Smith (1998)   (Correct)
We can extract and structure information from documents if we can match attributes with document data values and associate these matched attribute-value pairs as tuples in relations. In this paper we ... / data semistructured data information extraction information structuring br data information extraction information structuring ontology

1552.7   Foreground and Background Lexicons and Word Sense Disambiguation for.. - Kilgarriff (1997)   (Correct)
this paper I look at this issue in relation to one particular NLP task, Information Extraction (hereafter IE), and one subtask for which both lexical and general knowledge are required, Word Sense Dis... / Sense Disambiguation for Information Extraction Adam Kilgarriff July

1532.9   An Overview of Document Mining Technology - Dixon (1997)   (Correct)
Living through the Information Revolution is becoming a difficult task - humans were not designed to process massive quantities of information. The computer first found it's use in speeding our number... / mining text mining information extraction information retrieval br text mining information extraction information retrieval data mining

1530.0   MILK: a Hybrid system for Multilingual Indexing and Information.. - Bolioli, Dini, Di Tomaso, Goy.. (1997)   (Correct)
Substance Countable Countable Substance Human Animal Inanimate Figure 2: LKML hierarchy: top level. tic typing and recursive typing (identifying larger semantic tags) over a text is a "semi-structure... / Multilingual Indexing and Information Extraction A. Bolioli and L. br between information extraction and information retrieval in a web

1525.5   A Next Generation Information Gathering Agent - Lesser, Horling, Klassner, Raja.. (1998)   (Correct)
The World Wide Web has become an invaluable information resource but the explosion of information available via the web has made web search a time consuming and complex process. Index-based search eng... / planning text processing information extraction IE and

1517.1   Real Time Classification of Rotating Shaft Loading Conditions using.. - McCormick, Nandi (1996)   (Correct)
Vibration analysis can give an indication of the condition of a rotating shaft highlighting potential faults such as unbalance and rubbing. Faults may however only occur intermittently and consequentl... /

1513.7   Indexing and Retrieval of Scientific Literature - Lawrence, Bollacker, Giles (1999)   (Correct)
The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution si... / citation indexing information extraction display of

1509.6   Designing Intelligent Interfaces For Users With Memory And Language.. - Singh (2000)   (Correct)
The main contribution of this paper is to discuss in depth the issues related to the design of computer interfaces for users with language limitations. Language limitations are found to various degree... / information retrieval and information extraction databases and software

1507.3   Dynamic Information Filtering - Baudisch (2001)   (Correct)
The goal of information filtering systems (IF systems) is to support users in finding relevant information from a dynamic base of data objects. IF systems base their relevance computations on so-calle... /

1498.6   From Lexical Cohesion to Textual Coherence: -- A Data Driven.. - Sanda Harabagiu Department   (Correct)
This paper presents research that connects the cohesion structure of a text to the derivation of its coherence structure. Two different algorithms that derive the cohesion structure in the form of lex... / of the language technology information extraction and textual

1483.0   Reasoning about Textual Similarity in a Web-Based Information Access.. - Cohen   (Correct)
The degree to which information sources are pre-processed by Webbased information systems varies greatly. In search engines like Altavista, little pre-processing is done, while in "knowledge integra... / retrieval similarity information extraction . Introduction There br data is to partially automate wrapper generation Wrapper automation

1475.3   Semi-automatic Wrapper Generation for Internet Information Sources - Ashish (1997)   (Correct)
To simplify the task of obtaining information from the vast number of informationsources that are available on the WorldWide Web (WWW),we are buildinginformationmediators for extracting and integratin... / Wrapper induction for information extraction. In International Joint br Semi-automatic Wrapper Generation for Internet Information

1466.4   Processing Natural Language Software Requirement Specifications - Osborne, MacNish (1996)   (Correct)
Ambiguity in requirement specifications causes numerous problems; for example in defining customer/supplier contracts, ensuring the integrity of safetycritical systems, and analysing the implications ... / machine translation information extraction and natural language

1460.9   Wrapper Generation for Web Accessible Data Sources - Jean-Robert Gruser (1998)   (Correct)
There is an increase in the number of data sources that can be queried across the WWW. Such sources typically support HTML forms-based interfaces and search engines query collections of suitably index... / Wrapper Generation for Web Accessible Data

1458.0   Image Fusion - Maître, Bloch (1997)   (Correct)
We present the main lines along which information fusion has evolved from the first days of data fusion up to image fusion, then we discuss some of the reasons why image fusion cannot benefit from man... / are they combined . Information extraction from images It may

1453.0   Text-Based Approaches for the Categorization of Images - Sable, Hatzivassiloglou (1999)   (Correct)
The rapid expansion of multimedia digital collections brings to the fore the need for classifying not only text documents but their embedded non-textual parts as well. We propose a model for basin... / by using for example information extraction methods Wacholder et

1452.9   LE PROJECT No 2110 - Extraction Of   (Correct)
Description of a system for the automatic acquisition of verbal case frames from corpora. The key target is to acquire domain-specific relations rather than the standard relationships found in general... / useful in lexically driven information extraction. As for information

1451.4   Information Extraction as a Stepping Stone toward Story Understanding - Riloff (1999)   (Correct)
this article, we will refer to extraction patterns and case frames interchangeably, with the understanding that case frames for other tasks may be significantly more complex. unknown In Understanding ... / from MIT Press. Information Extraction as a Stepping Stone toward br . What is information extraction Information extraction is a

1432.3   From IR to IE through GL - Bolioli, Dini, Di Tomaso, Sestero (1997)   (Correct)
This paper describes the project MILK (Multilingual Indexing based on Lexical Knowledge), a cooperation between University of Brandeis and CELI (Centro per l'Elaborazione del Linguaggio e dell'Informa... / on the interaction between information extraction and information retrieval br between information extraction and information retrieval in a web

1426.2   Merging potentially inconsistent items of structured text - Hunter (2000)   (Correct)
Structured text is a general concept that is implicit in a variety of approaches to handling information. Syntactically, an item of structured text is a number of grammatically simple phrases togeth... / and the output from information extraction systems in the form of

1423.3   Incremental Interpretation: Applications, Theory, And Relationship To .. - Milward, Cooper (1994)   (Correct)
Why should computers interpret language incrementally ? In recent years psycholinguistic evidence for incremental interpretation has become more and more compelling, suggesting that humans perform sem... / of dialogues include information extraction from conversational

1413.9   Visual Motion Estimation based on Motion Blur Interpretation - Ioannis Rekleitis (1995)   (Correct)
When the relative velocity between the different objects in a scene and the camera is relative large -- compared with the camera's exposure time -- in the resulting image we have a distortion called m... / . . Information extraction from the Cepstrum

1408.0   Toward Team-Oriented Programming - Pynadath, Tambe, Chauvat, Cavedon (1999)   (Correct)
The promise of agent-based systems is leading towards the development of autonomous, heterogeneous agents, designed by a variety of research/industrial groups and distributed over a variety of platf... / environments and information extraction on the Internet

1404.9   Information Retrieval: Still Butting Heads with Natural Language.. - Smeaton (1997)   (Correct)
Information retrieval (IR) is about finding documents which may be of relevance to a user's query, from within a corpus or collection of texts. While apparently a simple task at first glance, IR is ... / on the whole IR task. Information extraction is also fundamentally br task. . Information Extraction Information extraction IE is a

1385.8   Constructing Biological Knowledge Bases by Extracting Information.. - Craven, Kumlien (1999)   (Correct)
Recently, there has been much effort in making databases for molecular biology more accessible and interoperable. However, information in text form, such as MEDLINE records, remains a greatly underuti... / in learning such information-extraction routines. We also present br as one of information extraction. Information extraction IE involves

1384.9   From local to global coherence: A bottom-up approach to text planning - Marcu (1997)   (Correct)
We present a new, data-driven approach to text planning, which can be used not only to map full knowledge pools into natural language texts, but also to generate texts that satisfy multiple, high-leve... / by systems developed for information extraction tasks one can use the

1381.1   BIG: A Resource-Bounded Information Gathering and Decision Support.. - Lesser, Horling, Klassner, Raja.. (1998)   (Correct)
The World Wide Web has become an invaluable information resource but the explosion of available information has made web search a time consuming and complex process. The large number of information so... / retrieval IR and information extraction IE technologies

1377.6   TextNet - A Text-Based Intelligent System - Harabagiu (1996)   (Correct)
A large collection of texts may be reached through the Internet and this provides a powerful platform from which common-sense knowledge may be gathered. This paper presents a system that contains a co... / abductive inference information extraction on Internet coherence

1373.6   Text Categorization Through Probabilistic Learning: Applications to.. - Bennett (1998)   (Correct)
Author: Paul N. Bennett Title: Text Categorization Through Probabilistic Learning: Applications to Recommender Systems Supervising Professor: Raymond J. Mooney, Ph.D. With the growth of the World Wide... / . . Information Extraction . br . Information Extraction Information Extraction attempts to

1370.9   Automatic Template Creation for Information Extraction, an Overview - Collier (1996)   (Correct)
Information Extraction (IE) approaches currently assume that a template exists which sufficiently defines the requirements of the task. Substantial human effort is required to generate these basic ... / Template Creation for Information Extraction an Overview Robin

1370.4   Term Sense Disambiguation Using a Domain-Specific Thesaurus - Maynard, Ananiadou (1998)   (Correct)
Term extraction is important for many information systems applications. Although terms should be monoreferential, in reality they exhibit a high degree of ambiguity. This paper describes a method for ... / applications such as information extraction and retrieval dictionary

1369.6   Applying Machine Learning for High Performance Named-Entity Extraction - Baluja, Mittal, Sukthankar (2000)   (Correct)
This paper describes a machine learning approach to build an efficient, accurate and fast name spotting system. Finding names in free text is an important task in addressing real-world text-based appl... / approaches. Keywords information extraction machine learning

1368.4   Using HTML Formatting to Aid in Natural Language Processing on the.. - DiPasquo (1998)   (Correct)
Because of its magnitude and the fact that it is not computer understandable, the World Wide Web has become a prime candidate for automatic natural language tasks. This thesis argues that there is inf... / . Learning Rules for Information Extraction . . The Data Set br problems of information extraction and information retrieval over over a

1366.5   CLELIA Computational Logics Environment for natural Language.. - Pallotta (1998)   (Correct)
Intelligent information exchanging seems to be one of the most challenging task among those involving hybrid human-computer interactions. A central issues is how to model various types of interaction ... /

1344.1   Distributed Knowledge Networks - Vasant Honavar (1998)   (Correct)
Distributed Knowledge Networks (DKN) provide some of the key enabling technologies for translating recent advances in automated data acquisition, digital storage, computers and communications into fun... / for information retrieval information extraction assimilation and

1339.2   Extraction of Keyphrases from Text: Evaluation of Four Algorithms - Turney (1997)   (Correct)
This report presents an empirical evaluation of four algorithms for automatically extracting keywords and keyphrases from documents. The four algorithms are compared using five different collections o... / work addresses the task of information extraction. An information

1337.8   Evaluation of a Semantic Data Model for Chest Radiology: Application.. - Roberto Rocha Stanley   (Correct)
Syntax Notation One Tool (AsnTool), release 2.0, developed by the National Center for Biotechnology Information (NCBI) [50]. Appendix B contains the encoded version of one of the reports in the studys... / sublanguage. For instance information extraction processes and

1333.3   A sequential model for attentive object selection - Fellenz (1994)   (Correct)
A biologically motivated model for object selection is proposed which combines strategies for preattentive segmentation and attentive object selection to extract consistent descriptions of objects in ... / nonselective unidirectional information extraction There are some major

1330.7   Using Object-Grammars for Internet Data Warehousing - Faulstich, Spiliopoulou, Linnemann (1997)   (Correct)
The increasing amount of information available in the web demands sophisticated querying methods and knowledge discovery techniques. In this study, we introduce our model WIND for a data warehouse ove... /

1323.5   Generating Finite-State Transducers For Semi-Structured Data.. - Hsu, Dung (1998)   (Correct)
Integrating a large number of Web information sources may significantly increase the utility of the World-Wide Web. A promising solution to the integration is through the use of a Web Information medi... / Data Wrapper Induction Information Extraction World Wide Web. . br C.A. Knoblock. Semi-automatic wrapper generation for internet information

1311.2   Towards a Hybrid - Br Id (1997)   (Correct)
Generation System Maria ARETOULAKI Dept. of Pattern Recognition (Informatik 5), University of Erlangen-Nuremberg, Martensstrasse 3, 91058 Erlangen, Germany. Tel: +49 9131 857824 Fax: +49 9131 30381... / methodology is the DIDEROT information extraction system as presented in

1309.2   Classifying Software Process Models Based on Natural Language.. - Ernst Ellmer, Dieter Merkl (1996)   (Correct)
Reuse of the valuable knowledge gained through the realization of software projects is an important step in order to overcome the well-known problems of software industry as for example wrong schedule... /

1307.8   A Statistical Information Extraction System for Turkish - Tür (1999)   (Correct)
Information Extraction (IE) is the process of analyzing natural language text or speech, and collecting information about specified types of entities, relationships, or events, such as marking perso... / A Statistical Information Extraction System for Turkish

1298.7   Learning Information Extraction Rules for Semi-structured and Free.. - Soderland (1999)   (Correct)
A wealth of on-line text information can be made available to automatic processing by information extraction (IE) systems. Each IE application needs a separate set of rules tuned to the domain and w... / The Netherlands. Learning Information Extraction Rules for Semi-structured br N. and Knoblock C. Wrapper generation for semi-structured Internet

1293.4   Algebraic Video for Composition and Content-Based Access - Weiss, Duda, Gifford   (Correct)
We introduce a new data model called algebraic video that provides operations for the composition, search, navigation and playback of digital video presentations. Video presentations are composed usin... / feasible other forms of information extraction can be employed. Text

1289.8   Proposal for A Framework for the High-Precision Identification of.. - Cardie, Mardis (1997)   (Correct)
Current research in Information Retrieval and Information Extraction demands high-precision syntactic and semantic information from natural language text. We propose a plan for developing a framework ... / Information Retrieval and Information Extraction demands high-precision

1284.1   A Wrapper Generation toolkit to specify and construct Wrappers for.. - Bright, Gruser, Raschid, Vidal (1999)   (Correct)
this paper, is generating a new paradigm of JDBC compliant wrappers [19] for WebSources. In addition to answering queries, the wrapper will provide information on the capability of the WebSource, i.e.... / pp - Freitag D.Information Extraction from HTML Application of br A Wrapper Generation toolkit to specify and

1273.8   Intelligent Information Access in the Web: ML based User Modeling for .. - Müller (1999)   (Correct)
It is a well known fact that high precision search for documents concerning a certain topic in the World Wide Web (Www) is a tough problem. Index based search engines vary in recall (with a coverage o... / monolithic architecture for information extraction and integration of

1257.7   Recent Advances in Motion Understanding - Beauchemin, Bajcsy, Barron   (Correct)
Probably the most ambitious goal of Computer Vision is to build the universal vision machine, capable of guiding itself through arbitrary environments, recognizing objects along its path and reachin... / agent the amount of information extraction from the spatiotemporal

1250.7   Layout Appropriateness: A metric for evaluating user interface widget .. - Sears (1993)   (Correct)
Numerous methods to evaluate user interfaces have been investigated. These methods vary greatly in the attention paid to the users' tasks. Some methods require detailed task descriptions while others ... / graphical displays for information extraction tasks This system

1249.5   Extracting and Converting Data from Semistructured Biological.. - Coupaye, Etzold   (Correct)
One fundamental property underlies most biological databanks: their availability in text format. We propose an approach to retrieve and convert biological data stored in textual flat files into inform... / The complete process of information extraction and conversion is referred br . Data Extraction Information or data extraction is

1248.6   Classification of Rotating Machine Condition using Artificial Neural.. - Mccormick Beng (1997)   (Correct)
This paper describes the use of neural networks as a method for automatically classifying the machine condition from the vibration time series. Several methods for the extraction of features to use as... /

1245.7   Information Gathering as a Resource Bounded Interpretation Task - Lesser, Horling, Klassner, Raja.. (1997)   (Correct)
This paper describes the rationale, architecture, and preliminary implementation of a next generation information gathering system. The goal of this funded research is to exploit the vast amount of in... / retrieval IR and information extraction IE technologies will

1239.5   Getting Only What You Want: Data Mining and Event Detection Using.. - Unruh, Martin, Perry (1998)   (Correct)
InfoSleuth 1 is an agent-based system for information gathering and analysis tasks, performed over networks of autonomous information sources. A key motivation of the InfoSleuth system is that real ... / of Information Sources Information Extraction Agents Information

1237.4   Rich Schemata for Semistructured Data: Thesis proposal - Bergholz (1999)   (Correct)
Semistructured data is one of the new challenging research areas in the database community. We believe that the underlying problem is that of moving from content-based to structure-based querying. For... / arises in the context of information extraction from the WWW. A main

1236.9   Memory-Based Shallow Parsing - Daelemans, Buchholz, Veenstra (1999)   (Correct)
We present a memory-based learning (MBL) approach to shallow parsing in which POS tagging, chunking, and identification of syntactic relations are formulated as memory-based modules. The experiments r... / in applications such as information extraction and summary generation.

1236.6   Information Extraction Using Hidden Markov Models - Leek (1997)   (Correct)
This thesis shows how to design and tune a hidden Markov model to extract factual information from a corpus of machine-readable English prose. In particular, the thesis presents a HMM that classifies ... / Of California San Diego Information Extraction Using Hidden Markov

1232.9   An Approach to Visual Interaction in Mixed-Initiative Planning - David Pegram Robert (1999)   (Correct)
Researchers in mixed-initiative problem-solving have generally viewed interaction between the user and the system as a form of dialog, which provides an effective unifying framework for multimodal ... / calls design for information extraction.This paper is

1231.4   Automatic Digital Video Production Concepts - Ahanger, Little (1998)   (Correct)
Video production involves conceiving a story, shooting raw video footage, and editing the final piece. Editing involves manually cutting frames and frame sequences from the raw video and composing the... / efficient. The process of information extraction and related issues are

1220.7   Instructable and Adaptive Web Agents that Learn to Retrieve and.. - Eliassi-Rad, Shavlik (2000)   (Correct)
We present a system for rapidly and easily building instructable and selfadaptive Web agents for information-retrieval and information-extraction tasks. Our Wisconsin Adaptive Web Assistant (Wawa) c... / information-retrieval and information-extraction tasks. Our Wisconsin

1220.7   Knowledge Discovery and Data Mining: Towards a Unifying Framework - Fayyad, Piatetsky-Shapiro, Smyth (1996)   (Correct)
This paper presents a first step towards a unifying framework for Knowledge Discovery in Databases. We describe links between data mining, knowledge discovery, and other related fields. We then define... / data mining knowledge extraction information discovery information

1219.4   Using Machine Learning for Assigning Indices to Textual Cases - Brüninghaus, Ashley   (Correct)
This paper reports preliminary work on developing methods automatically to index cases described in text so that a case-based reasoning system can reason with them. The goal is to classify the text of... / in many fields within AI. Information Extraction IE Cowie Lehnert

1218.2   Relational Learning of Pattern-Match Rules for Information Extraction - Califf, Mooney (1997)   (Correct)
Information extraction systems process natural language documents and locate a specific set of relevant items. Given the recent success of empirical or corpusbased approaches in other areas of natura... / of Pattern-Match Rules for Information Extraction Mary Elaine Califf and

1213.4   ILP: Just Do It - Page (2000)   (Correct)
Inductive logic programming (ILP) is built on a foundation laid by research in other areas of computational logic. But in spite of this strong foundation, at 10 years of age ILP now faces a number... / information retrieval and information extraction. Arguably natural

1211.3   Adaptive Information Extraction from Online Messages - Höfferer, Knaus, Winiwarter (1994)   (Correct)
An information filtering system is described which extracts e-mail messages from on-line resources. The proposed solution applies (1) linguistic analysis to obtain consistent representations of the co... / Adaptive Information Extraction from Online Messages

1210.8   Assessing Software Libraries by Browsing Similar Classes, Functions.. - Michail, Notkin (1999)   (Correct)
Comparing and contrasting a set of software libraries is useful for reuse related activities such as selecting a library from among several candidates or porting an application from one library to ano... / Measure In Section . Information Extraction In This Section We

1207.6   The MATE Workbench - an annotation tool for XML coded speech corpora - McKelvie, Isard, Mengel.. (2000)   (Correct)
This paper describes the design and implementation of the MATE workbench, a program which provides support for the annotation of speech and text. It provides facilities for exible display and editing ... / of training corpora for information extraction. Dierent annotation

1202.2   Information Extraction from the Web - May, Lausen (2000)   (Correct)
The goal of information extraction from the Web is to provide an integrated view on data from autonomous, heterogeneous information sources. The main problem with current wrapper /mediator approache... / Information Extraction from the Web Wolfgang br automatical matching-based wrapper generation for data-rich and

1195.4   Information extraction for semi-structured documents - Smith, Lopez (1997)   (Correct)
this paper constitutes a suitable basis for building an effective solution to extracting information from semi-structured documents for two principal reasons. First, it provides an extensible architec... / Information extraction for semi-structured

1191.1   Steerable Filters and Cepstral Analysis for Optical Flow Calculation.. - Ioannis Rekleitis (1996)   (Correct)
This paper considers the explicit use of motion blur to compute the Optical Flow. In the past, many algorithms have been proposed for estimating the relative velocity from one or more images. The moti... /

1187.6   Knowledge-Lean Coreference Resolution and its Relation to Textual.. - Harabagiu, Maiorano (1999)   (Correct)
In this paper we present a new empirical method for coreference resolution, implemented in the COCKTAIL system. The results of COCKTAIL are used for lightweight abduction of cohesion and coherence str... / not only for supporting Information Extraction IE the central task of

1186.9   Adaptation To The User's Tasks - Höök (1995)   (Correct)
Adapting explanations to users with varying background knowledge and abilities is a difficult task: the explanation content, style, amount of details, terms used, etc. may be affected in various ways.... / both with navigation and information extraction. Apart from the learning

1186.4   A Machine Learning Approach to Building Domain-Specific Search Engines - McCallum, Nigam, Rennie, Seymore (1999)   (Correct)
Domain-specific search engines are becoming increasingly popular because they offer increased accuracy and extra features not possible with general, Web-wide search engines. Unfortunately, they are al... / text classification and information extraction that enables efficient br data. Information Extraction Information extraction is concerned

1186.0   Morphological Cues for Lexical Semantics - Light (1996)   (Correct)
Most natural language processing tasks require lexical semantic information. Automated acquisition of this information would thus increase the robustness and portability of NLP systems. This paper des... / front end to a database information extraction machine translation and

1184.9   Process Model Reuse to Promote Organizational Learning in Software.. - Ellmer, Merkl, Quirchmayr, Tjoa (1996)   (Correct)
Software development often suffers from well-known problems as for example wrong schedules and cost estimations, low productivity, and low product quality. In order to overcome these problems we sugge... / the various models. Such an information extraction has to produce a mutually

1176.0   Template-based Information Extraction from Tree-structured HTML.. - Yih (1997)   (Correct)
iii List of Figures vii List of Tables ix Chapter 1 Introduction 1 1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.2 The Importance of Information Extraction on Agent Technol... / Template-based Information Extraction from Tree-structured HTML

1170.1   Finding Factors - Learning to Classify Case Opinions under Abstract.. - Brüninghaus, Ashley   (Correct)
Fact Categories Stefanie Bruninghaus and Kevin D. Ashley University of Pittsburgh Learning Research and Development Center, Intelligent Systems Program and School of Law Pittsburgh, PA 15260 steffi+... / ontology. Similarly Information Extraction Cowie Lehnert a

1166.5   Using Decision Trees for Coreference Resolution - Mccarthy (1995)   (Correct)
This paper describes resolve, a system that uses decision trees to learn how to classify coreferent phrases in the domain of business joint ventures. An experiment is presented in which the performanc... / The goal of an Information Extraction IE system is to

1162.7   Combining Error-Driven Pruning and Classification for Partial Parsing - Cardie, Mardis, Pierce (1999)   (Correct)
We present a new approach to partial parsing of natural language texts that relies on machine learning methods. The approach combines corpus-based grammar induction with a very simple pattern-matching... / applications including information extraction phrase identification in

1155.3   A methodology for building information agents - Gao, Sterling (1998)   (Correct)
Information agents are increasingly being used for efficient and precise information retrieval from the Internet. Most of them are handcrafted from scratch and can not easily be adapted to other searc... / different from traditional Information Extraction IE that works on natural br agents and semi-automatic wrapper generation Research has also

1147.3   Informedia - Search and Summarization in the Video Medium - Wactlar (2000)   (Correct)
The Informedia system provides "full-content" search and retrieval of current and past TV and radio news and documentary broadcasts. The system implements a fully automatic intelligent process to enab... / goals fully automated information extraction and full-content

1145.9   A Learning Approach to Shallow Parsing - Muñoz, Punyakanok, Roth, Zimak (1999)   (Correct)
A SNoW based learning approach to shallow parsing tasks is presented and studied experimentally. The shallow parsing method suggested learns to identify syntactic patterns by combining simple predic... / applications including information extraction and text summarization

CiteSeer - citeseer.org - Terms of Service - Privacy Policy - Copyright © 1997-2002 NEC Research Institute