MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  IR and AI: traditions of representation and anti-representation in information processing Yorick WILKS

Download:
Download as a PDF | Download as a PS
unknown authors
http://www.dcs.shef.ac.uk/~yorick/papers/irandai.ps
Add To MetaCart

Abstract:

Abstract. The paper is concerned with the role of conceptual representations in access to information, as for example, from the World Wide Web. It contrasts two quite different traditions for doing this: Information Retrieval (IR) and more recently Information Extraction (IE), a development of the natural language processing tradition within Artificial Intelligence (AI). The former has been statistical in nature and largely representation-free (though we discuss exceptions), while the latter has been based on representations making use of ontologies and lexicons in semantics and grammars in syntax. However, this distinction has been eroded by the growth in recent years of machine learning methods in IE, which have attempted to match IE performance but with methods less committed to representations: some have no representations, and some seek to learn them automatically from cases of their assignment. We discuss ways of resolving this division of approaches, a deep and historical issue about the ultimate role of representations in information access. We suggest that modes of use of the Web (e.g. the use of short questions by real users rather than the long artificial `queries ' that statistical methods require) will tend to favour representational methods. We then discuss the crucial example of question-answering in a web environment of information access, as exemplified in the recent TREC competition track on question answering, and suggest that, although indecisive at the moment, this is an ideal forum in which the old issue of conceptual representations may be settled. 1

Citations

484 Understanding Computers and Cognition: A New Foundation for Design, Ablex Publishing – Winograd, Flores - 1986
394 A statistical approach to machine translation – Brown, Cocke, et al. - 1990
231 Unsupervised Word Sense Disambiguation Rivaling Supervised Methods – Yarowsky - 1995
230 Understanding Natural Language – Winograd - 1972
228 Word Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora – Yarowsky - 1992
210 Some advances in transformationbased part of speech tagging – Brill - 1994
209 Nymble: A high performance learning name-finder – Bikel, M, et al. - 1997
94 Information extraction: Techniques and challenges – Grishman - 1997
91 Empirical methods in information extraction – Cardie - 1997
80 A statistical approach to mechanized encoding and searching of literary information – Luhn - 1957
77 FOUL-UP: A Program that Figures Out Meanings of Words from Context – Granger - 1977
76 Deterministic part-of-speech tagging with finite-state transducers – Roche, Schabes - 1997
60 CYC: Using common sense knowledge to overcome brittleness and knowledge acquisition bottlenecks – Lenat, Prakash, et al. - 1986
58 Generic Information Extraction System – HOBBS - 1999
44 Information Extraction: Beyond Document Retrieval – Gaizauskas, Wilks - 1998
42 Retrieval Performance in FERRET: A Conceptual Information Retrieval System – Mauldin - 1991
40 University of Massachusetts: Description of the CIRCUS system as used for MUC-4 – Lehnert, Cardie, et al. - 1992
29 Synonymy and Semantic Classification – Jones, Karen - 1986
29 BASEBALL: An automatic question answerer – Green, Chomsky, et al. - 1961
27 Automated dictionary construction for information extraction from text – Riloff, Lehnert - 1993
23 What is the role of NLP in text retrieval – Jones, K - 1999
22 A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART – Salton
20 Experiments on incorporating syntactic processing of user queries into a document retrieval strategy – Smeaton, Rijsbergen - 1988
18 Automatic template creation for information extraction – Collier - 1998
18 A retrieval model based on an extended modal logic and its application to the RIME experimental approach – Chiaramella, Nie - 1990
17 Combining weak knowledge sources for sense disambiguation – Stevenson, Wilks - 1999
14 Jape: a java annotations patterns engine – Cunningham, Maynard, et al. - 2000
14 Using coreference chains for text summarization – Azzam, Humphreys, et al. - 1999
14 A conceptual theory of question answering – Lehnert - 1986
10 Frames, Semantics and Novelty – Wilks - 1979
10 Using inductive logic programming for natural language processing – Cussens - 1997
10 Description of the LOLITA System as used for MUC-6 – Morgan, Garigliano, et al. - 1995
10 More than one sense per discourse – Krovetz - 1998
8 A method for refining automatically-discovered lexical relations: Combining weak techniques for stronger results – Hearst, Grefenstette - 1992
8 On the equivalence of models of language used in the fields of mechanical translation and information retrieval – Gross - 1964
8 Procedures d'analyse semantique appliquees a la documentation scientifique. Gauthier-Villars – Bely, Borillo, et al. - 1970
7 Pathfinder Networks: Theory and Applications – Schvaneveldt - 1990
7 CRL/Brandeis: The Diderot System – Cowie, Guthrie, et al. - 1993
7 Making information extraction more adaptive – Wilks, Catizone - 1999
7 Validation of terminological inference in an information extraction task – Vilain - 1993
6 Bayesian Networks: A Model of Self-Activated Memory for Evidential Reasoning – Pearl - 1985
6 Generalizing automatically generated patterns – Grishman, Sterling - 1992
6 Recent advances in inductive logic programming – Muggleton - 1994
6 Linguistic processes in the indexing and retrieval of documents. Linguistics 61 – Hutchins - 1970
5 The application of CLRU's method of semantic analysis to information retrieval. Cambridge Language Research Unit Memo – Wilks - 1965
4 der Sloot and A. van den Bosch, TiMBL: Tilburg memory based learner version 1.0 – Daelemans, Zavrel, et al. - 1998
3 Automatically aquiring conceptual patterns without an annotated corpus – Riloff, Shoen - 1995
3 Extracting Information for Business Needs – Pietrosanti, Graziadio - 1997
2 Description of the named entity system as used in MUC-7 – Borthwick, Sterling - 1998
2 Text Searching with Templates – Wilks - 1964