See this document in CiteSeerX!

A Search Engine for Natural Language Applications (2005)  (Make Corrections)  
Michael Cafarella, Oren Etzioni



  Home/Search   Context   Related

 
View or download:
www2005.org/cdrom/docs/p442.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  www2005.org/cdrom/contents (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Many modern natural language-processing applications utilize search engines to locate large numbers of Web documents or to compute statistics over the Web corpus. Yet Web search engines are designed and optimized for simple human queries---they are not well suited to support such applications. As a result, these applications are forced to issue millions of successive queries resulting in unnecessary search engine load and in slow applications with limited scalability. In response, this paper... (Update)

Active bibliography (related documents):   More   All
2.6:   A Search Engine for Natural Language Applications - Cafarella, Etzioni (2005)   (Correct)
0.6:   Efficient Phrase Querying with an Auxiliary Index - Bahle, Williams, Zobel (2002)   (Correct)
0.3:   A Document-Centric Approach to Static Index Pruning in Text.. - Büttcher, Clarke (2006)   (Correct)

Similar documents based on text:   More   All
0.1:   Scaling Question Answering to the Web - Kwok, Etzioni, Weld (2001)   (Correct)
0.1:   Category Translation: Learning to understand information on .. - Perkowitz, Etzioni (1995)   (Correct)
0.1:   Methods for Domain-Independent Information Extraction - From The Web   (Correct)

BibTeX entry:   (Update)

@misc{ cafarella-search,
  author = "Michael Cafarella and Oren Etzioni",
  title = "A Search Engine for Natural Language Applications",
  url = "citeseer.ist.psu.edu/cafarella05search.html" }
Citations (may not include all citations):
372   Modern Information Retrieval - Baeza-Yates, Ribeiro-Neto - 1999
328   Foundations of Statistical Natural Language Processing - Manning, Schutze - 1999
244   Querying the World Wide Web - Mendelzon, Mihalia et al. - 1996
97   Automatic Acquisition of Hyponyms from Large Text Corpora - Hearst - 1992
79   Some Advances in Rule-Based Part of Speech Tagging (context) - Brill - 1994
40   Scaling Question Answering to the Web - Kwok, Etzioni et al. - 2001
27   Data-Intensive Question Answering - Brill, Lin et al. - 2001
23   Lightweight Structured Text Processing - Miller, Myers - 1999
19   Question-Answering by Predictive Annotation (context) - Prager, Brown et al. - 2000
13   Web-scale Information Extraction in KnowItAll - Etzioni, Cafarella et al. - 2004
10   Corpus-based Schema Matching - Madhavan, Bernstein et al. - 2005
10   Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL - Turney - 2001
9   An Analysis of the AskMSR Question-Answering System - Brill, Dumais et al. - 2002
9   Squeal: A Structured Query Language for the Web - Spertus, Stein - 2000
8   Unsupervised Named-Entity Extraction from the Web: An Experi.. (context) - Etzioni, Cafarella et al. - 2005
6   Thumbs Up or Thumbs Down (context) - Turney - 2002
5   A Fast Regular Expression Indexing Engine - Cho, Rajagopalan - 2002
5   Searching with numbers - Agrawal, Srikant - 2002
4   Corpus-Based Knowledge Representation - Halevy, Madhavan - 2003
3   cient Phrase Querying (context) - Williams, Zobel et al. - 1999
3   cient Phrase Querying with an Auxiliary Index (context) - Bahle, Williams et al. - 2002
3   Optimised Phrase Querying and Browsing in Text Databases (context) - Bahle, Williams et al. - 2001

Documents on the same site (http://www.www2005.org/cdrom/contents.htm):   More
Sampling Search-Engine Results - Anagnostopoulos, Broder, Carmel (2005)   (Correct)
Incremental Maintenance for Materialized XPath/XSLT Views - Onizuka, Chan, Michigami, .. (2005)   (Correct)
PageRank as a Function of the Damping Factor - Boldi, Santini, Vigna (2005)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC