4 citations found. Retrieving documents...
Ricardo A. Baeza-Yates. Introduction to data structures and algorithms related to information retrieval. In William B. Frakes and Ricardo A. Baeza-Yates, editors, Information Retrieval Data Structures & Algorithms, pages 13-27, Prentice-Hall, 1992.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:
World Wide Web Search Technologies - Hu   (1 citation)  (Correct)

....useful for crawler 3 construction. To construct an efficient and practical crawler, some other networking tools have to be used. Indexing Software Automatic indexing is the process of algorithmically examining information items to build a data structure that can be quickly searched. Filtering [4] is one of the most important pre processes for indexing. Filtering is a typical transformation in information retrieval, for example to reduce the size of a document, and or standardize it to simplify searching. Traditional search engines utilize the following information, provided by HTML files, ....

....a query. 7 Data Clustering Data clustering is used to improve the search results by dividing the whole data set into data clusters. Each data cluster contains objects of high similarity, and clusters are produced that group documents relevant to the user s query separately from irrelevant ones [4]. Clustering should not be based on the whole Web resource, but on smaller separate query results. In [57] a Suffix Tree Clustering (STC) algorithm based on phrases shared between documents is used to create clusters. Beside clustering the search results, a proposed similarity function has been ....

Ricardo A. Baeza-Yates. Introduction to data structures and algorithms related to information retrieval. In William B. Frakes and Ricardo A. Baeza-Yates, editors, Information Retrieval Data Structures & Algorithms, pages 13-27, Prentice-Hall, 1992.


An Overview of World Wide Web Search Technologies - Hu, Chen, Schmalz, Ritter (2001)   (2 citations)  (Correct)

....The objects could be HTTPs (Hypertext Transfer Protocols) FTPs (File Transfer Protocols) mailto (e mail) news, telnet, etc. Indexing Software Automatic indexing is the process of algorithmically examining information items to build a data structure that can be quickly searched. Filtering [3] is one of the most important pre processes for indexing. Filtering is a typical transformation in information retrieval, for example to reduce the size of a text, and or standardize it to simplify searching. Traditional search engines utilize the following information, provided by HTML files, to ....

....objects of high similarity, and clusters are produced that group documents relevant to the user s query separately from irrelevant ones. Clustering should not be based on the whole Web resource, but on smaller separate query results. Various document clustering algorithms have been proposed in [3, 8, 31]. 5. METASEARCHES None of the current search engines is able to cover the Web comprehensively. Using an individual search engine may miss some critical information provided by other engines. Metasearch engines [9, 14, 19, 29] search several other search engines simultaneously, and present ....

Ricardo A. Baeza-Yates. Introduction to data structures and algorithms related to information retrieval. In William B. Frakes and Richardo A. Baeza-Yates, editors, Information Retrieval Data Structures & Algorithms, pages 13-27, PrenticeHall, 1992.


Information Retrieval on the Web - Kobayashi, Takeda (2000)   (22 citations)  (Correct)

....in depth will be given whenever possible. This having being said, we begin with references to several excellent books which cover a variety of topics in information management and retrieval. 2 They include: Information Retrieval and Hypertext [Agosti, Smeaton 1996] Modern Information Retreival [Baeza Yates, Ribeiro Neto 1999], Text Retreival and Filtering: Analytic Models of Performance [Losee 1998] Natural Language Information Retreival [Strzalkowski 1999] and Managing Gigabytes [Witten, Mo at, Bell 1994] Some older, classical texts which are slightly outdated include: Information Retreival [Frakes, Baeza Yates ....

....Wide Web FAQ 2 , Hamline University 3 , Kuhn s pages 4 (in German) Maire s pages (in French) 5 , Princeton University 6 , U.C. Berkeley 7 , and Yahoo s pages on search engines 8 . The historical development of information retrieval is documented in a number of sources, such as [Baeza Yates, Ribeiro Neto 1999], Cleverdon 1970] Faloutsos, Oard 1995] Salton 1970] and [van Rijsbergen 1979] Historical accounts of the Web and Web search technologies are given in [Berners Lee et al. 1994] Schatz 1997] This paper is organized as follows. In the remainder of this section, we discuss and point to ....

[Article contains additional citation context not shown here]

Baeza-Yates, R., \Introduction to data structures and algorithms related to information retrieval", in Baeza-Yates, R., Ribeiro-Neto, B. (eds.), Modern Information Retreival, ACM Press, New York (1999) 13-27. 29


AbstFinder, A Prototype Natural Language Text Abstraction.. - Goldin, Berry (1994)   (4 citations)  (Correct)

.... Language Processing Work There is a wealth of material representing years of work in automatic indexing, abstraction, and thesaurus generation, all under the rubric of document processing and information retrieval (Rau, 1989; Cavazza, 1992; Damerau, 1993; Salton, 1986; Salton, 1989; Frakes, 1992; Baeza Yates, 1992; Srinivasan, 1992; Fox, 1992) It is interesting to compare our approach, of this paper, to that of this older work. Among this older work, the closest to ours in terms of goals is automatic abstraction, but, all of it deals with automatic identification of key concepts in large collections of ....

Baeza-Yates, R.A. 1992. Introduction to Data Structures and Algorithms Related to Information Retrieval.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC