Abstract. The paper is concerned with the role of conceptual representations in access to information, as for example, from the World Wide Web. It contrasts two quite different traditions for doing this: Information Retrieval (IR) and more recently Information Extraction (IE), a development of the natural language processing tradition within Artificial Intelligence (AI). The former has been statistical in nature and largely representation-free (though we discuss exceptions), while the latter has been based on representations making use of ontologies and lexicons in semantics and grammars in syntax. However, this distinction has been eroded by the growth in recent years of machine learning methods in IE, which have attempted to match IE performance but with methods less committed to representations: some have no representations, and some seek to learn them automatically from cases of their assignment. We discuss ways of resolving this division of approaches, a deep and historical issue about the ultimate role of representations in information access. We suggest that modes of use of the Web (e.g. the use of short questions by real users rather than the long artificial `queries ' that statistical methods require) will tend to favour representational methods. We then discuss the crucial example of question-answering in a web environment of information access, as exemplified in the recent TREC competition track on question answering, and suggest that, although indecisive at the moment, this is an ideal forum in which the old issue of conceptual representations may be settled. 1
|
484
|
Understanding Computers and Cognition: A New Foundation for Design, Ablex Publishing
– Winograd, Flores
- 1986
|
|
394
|
A statistical approach to machine translation
– Brown, Cocke, et al.
- 1990
|
|
231
|
Unsupervised Word Sense Disambiguation Rivaling Supervised Methods
– Yarowsky
- 1995
|
|
230
|
Understanding Natural Language
– Winograd
- 1972
|
|
228
|
Word Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora
– Yarowsky
- 1992
|
|
210
|
Some advances in transformationbased part of speech tagging
– Brill
- 1994
|
|
209
|
Nymble: A high performance learning name-finder
– Bikel, M, et al.
- 1997
|
|
94
|
Information extraction: Techniques and challenges
– Grishman
- 1997
|
|
91
|
Empirical methods in information extraction
– Cardie
- 1997
|
|
80
|
A statistical approach to mechanized encoding and searching of literary information
– Luhn
- 1957
|
|
77
|
FOUL-UP: A Program that Figures Out Meanings of Words from Context
– Granger
- 1977
|
|
76
|
Deterministic part-of-speech tagging with finite-state transducers
– Roche, Schabes
- 1997
|
|
60
|
CYC: Using common sense knowledge to overcome brittleness and knowledge acquisition bottlenecks
– Lenat, Prakash, et al.
- 1986
|
|
58
|
Generic Information Extraction System
– HOBBS
- 1999
|
|
44
|
Information Extraction: Beyond Document Retrieval
– Gaizauskas, Wilks
- 1998
|
|
42
|
Retrieval Performance in FERRET: A Conceptual Information Retrieval System
– Mauldin
- 1991
|
|
40
|
University of Massachusetts: Description of the CIRCUS system as used for MUC-4
– Lehnert, Cardie, et al.
- 1992
|
|
29
|
Synonymy and Semantic Classification
– Jones, Karen
- 1986
|
|
29
|
BASEBALL: An automatic question answerer
– Green, Chomsky, et al.
- 1961
|
|
27
|
Automated dictionary construction for information extraction from text
– Riloff, Lehnert
- 1993
|
|
23
|
What is the role of NLP in text retrieval
– Jones, K
- 1999
|
|
22
|
A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART
– Salton
|
|
20
|
Experiments on incorporating syntactic processing of user queries into a document retrieval strategy
– Smeaton, Rijsbergen
- 1988
|
|
18
|
Automatic template creation for information extraction
– Collier
- 1998
|
|
18
|
A retrieval model based on an extended modal logic and its application to the RIME experimental approach
– Chiaramella, Nie
- 1990
|
|
17
|
Combining weak knowledge sources for sense disambiguation
– Stevenson, Wilks
- 1999
|
|
14
|
Jape: a java annotations patterns engine
– Cunningham, Maynard, et al.
- 2000
|
|
14
|
Using coreference chains for text summarization
– Azzam, Humphreys, et al.
- 1999
|
|
14
|
A conceptual theory of question answering
– Lehnert
- 1986
|
|
10
|
Frames, Semantics and Novelty
– Wilks
- 1979
|
|
10
|
Using inductive logic programming for natural language processing
– Cussens
- 1997
|
|
10
|
Description of the LOLITA System as used for MUC-6
– Morgan, Garigliano, et al.
- 1995
|
|
10
|
More than one sense per discourse
– Krovetz
- 1998
|
|
8
|
A method for refining automatically-discovered lexical relations: Combining weak techniques for stronger results
– Hearst, Grefenstette
- 1992
|
|
8
|
On the equivalence of models of language used in the fields of mechanical translation and information retrieval
– Gross
- 1964
|
|
8
|
Procedures d'analyse semantique appliquees a la documentation scientifique. Gauthier-Villars
– Bely, Borillo, et al.
- 1970
|
|
7
|
Pathfinder Networks: Theory and Applications
– Schvaneveldt
- 1990
|
|
7
|
CRL/Brandeis: The Diderot System
– Cowie, Guthrie, et al.
- 1993
|
|
7
|
Making information extraction more adaptive
– Wilks, Catizone
- 1999
|
|
7
|
Validation of terminological inference in an information extraction task
– Vilain
- 1993
|
|
6
|
Bayesian Networks: A Model of Self-Activated Memory for Evidential Reasoning
– Pearl
- 1985
|
|
6
|
Generalizing automatically generated patterns
– Grishman, Sterling
- 1992
|
|
6
|
Recent advances in inductive logic programming
– Muggleton
- 1994
|
|
6
|
Linguistic processes in the indexing and retrieval of documents. Linguistics 61
– Hutchins
- 1970
|
|
5
|
The application of CLRU's method of semantic analysis to information retrieval. Cambridge Language Research Unit Memo
– Wilks
- 1965
|
|
4
|
der Sloot and A. van den Bosch, TiMBL: Tilburg memory based learner version 1.0
– Daelemans, Zavrel, et al.
- 1998
|
|
3
|
Automatically aquiring conceptual patterns without an annotated corpus
– Riloff, Shoen
- 1995
|
|
3
|
Extracting Information for Business Needs
– Pietrosanti, Graziadio
- 1997
|
|
2
|
Description of the named entity system as used in MUC-7
– Borthwick, Sterling
- 1998
|
|
2
|
Text Searching with Templates
– Wilks
- 1964
|