| Mauldin, M.L, Leavitt, J.R.R.: Web agent related research at the Center for Machine Translation. Proc. of the ACM Special Interest Group on Networked Information Discovery and Retrieval (ACM-SIGNIDR-V). McLean, VA, USA (1994) |
....as the following: starting at an index, any page reached by following a single outward link is likely to be on the same topic. In other work, referential text has been used as a means of summarizing a document when presenting query results to a user of an information retrieval system. In [10], the authors describe using the text surrounding hyperlinks to Web pages as descriptions of those pages when presenting search results to users of Lycos. Similarly, Citeseer [6] a research paper indexing and retrieval system presents results to queries with a list of citations describing each ....
Mauldin, M. and Leavitt, J. R. R. Web Agent Related Research at the Center for Machine Translation. Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval. 1994.
.... in a single platform and use a well known retrieval model such as the vector space model based on TFIDF algorithm [9] These approaches are however often troubled by massive bottlenecks and unacceptable access delays under highly competitive access situations as in search tools such as Lycos [6] and WebCrawler [7] These troubles become more severe as the amount of information stored in the form of database indices increases. This problem has led to distributed information agents, called meta search services such as IBM InfoMarket 1 and MetaCrawler 2 which distribute the information ....
Mauldin, M., and Leavitt, J. Web agent related research at the Center for Machine Translation, in Proceedings of the ACM SIGNIDR '94, 1994.
....is to use an index to store information about web documents. This approach involves periodic automatic off line searching of the web and gathering of information about all the documents found in these searches. AltaVista [Digital Equipment Corporation, 1997] WebCrawler [Pinkerton, 1994] Lycos [Mauldin and Leavitt, 1994], Infoseek [Randall, 1995; InfoSeek Corporation, 1997] and Inktomi [Brewer and Gauthier, 1997] are the most important examples of applications which use this indexing approach. Most will retrieve the entire contents of a document and index a portion of it. This centralization introduces a ....
....itself. Instead, it relies on a number of indices. MetaCrawler sends a user s request to several search engines in parallel. The search engines it relies on are AltaVista [Digital Equipment Corporation, 1997] Excite [Excite Inc. 1997] Infoseek [Randall, 1995; InfoSeek Corporation, 1997] Lycos [Mauldin and Leavitt, 1994], WebCrawler [Pinkerton, 1994] and Yahoo [Yahoo Inc. 1997] When it receives all the results, MetaCrawler organizes them, ranks them, and returns them to the user. MetaCrawler has several advantages; first, by sending a query to several search engines, there are more opportunities to answer the ....
[Article contains additional citation context not shown here]
Michael Mauldin and John Leavitt. Web-agent related research at the center for machine translation. In Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval, McLean, VA, August 1994.
....search (querying) in some manner (e.g. Christophides and Rizk, 1994, Amann, et al. 1994, Lucarella, et al. 1993, etc. The recent emphasis on index servers on the World Wide Web (WWW) exemplifies the incorporation of powerful search mechanisms to facilitate navigation of large hypertexts (Mauldin and Leavitt, 1994). These trends in IR and hypertext systems are illustrated in Figure 1. The figure suggests that the evolution of database and interface technologies is pushing traditionally disparate application domains closer together in terms of the user s interaction with the system. There is some evidence ....
....onto the source language. 111 8.2.3 The Web Finding information on the web is difficult by using static links alone; this difficulty is due to volume of information available, and the lack of well defined organizing structure. Thus in the last few years, Web index servers have been introduced (Mauldin and Leavitt, 1994). These index servers act as search engines that retrieve Uniform Resource Locators (URLs) for Web pages that match the query. Thus index servers may be used to facilitate navigation through the Web, but only at the cognitive cost of moving to an Index page, typing a query, and then sifting ....
Mauldin, M.L. and Leavitt, J.R.R. (1994) Web Agent Related Research at the Center for Machine Translation. In Proceedings of SIGNIDR '94, McLean, VA. ACM Press.
....a keyword given by a user, and it can also find documents that are conceptually connected to this keyword. Schematically, our system is shown in Figure 4: User MUSAG USAG SAG AG [1] keyword) 3 [3] keywords #Musag [2] usr s keywords [4] #Musag [6] Search Weights [5] Search Weights [7] url 0.5 [8] Learning Weights and url Figure 4: Musag, USAg, SAg, and Ag The user can submit at most three keywords to USAg to run a Musag agent on this concept. Whenever Musag is started for this concept, also a SAg and Ag agents are initialized to cooperate with this specific Musag agent. ....
....assistance to the user who performs the search and do not handle any queries the user might have. Queries are handled by all the indexes and search engines that can be found today on the internet. These indexes include agents which build and update a data base such as Webcrawler [8] and Lycos [7]. None of those agents base their database on semantic methods. Other index based search engines as Yahoo [10] bypassed the semantic issue by presenting the user with a list of predefined domains. These domains enable the user to focus his search, although the user might be interested in a ....
Michael L. Mauldin and John R.R. Leavitt. Web agent related research at the center for machine translation, 1994. Presented at SIGNIDR.
....use this information in Web tools to propagate rewards from interesting pages to those that point to them (also done by LaMacchia [21] and to more heavily weight words in titles, headers, etc. when considering document keyword similarity, a technique used earlier in Lycos by Mauldin and Leavitt [22]. Mauldin has made use of link information by naming a page with the anchor text of hyperlinks to it. Iwazume et al. have preferentially expanded hyperlinks containing keywords relevant to the user s query [16] although O Leary observes that anchor text information can be unreliable [28] ....
....[5] They use this information to propagate rewards from interesting pages to those that point to them (also done by LaMacchia [21] and to more heavily weight words in titles, headers, etc. when considering document keyword similarity, a technique used earlier in Lycos by Mauldin and Leavitt [22]. Mauldin has made use of link information by naming a page with the anchor text of hyperlinks to it. Iwazume et al. have preferentially expanded hyperlinks containing keywords relevant to the user s query [16] although O Leary observes that anchor text information can be unreliable [28] ....
Michael Mauldin and John R. R. Leavitt. Web agent related research at the center for machine translation. In Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval, 1994. http://fuzine.mt.cs.cmu.edu/mlm/signidr94.html.
.... the user can explore the Web, and each page that is displayed is augmented with additional controls that can be selected to collect user ratings on the current page, and to instruct Syskill Webert to learn a profile, suggest which links on the current page are interesting, or to consult LYCOS (Mauldin Leavitt, 1994), a Web search engine, to help find interesting pages. Figure 1 shows an example of a Web page annotated with such controls. The user may rate a page as either hot (two thumbs up) or cold (two thumbs down) and the ratings are saved by Syskill Webert so that the pages may be used as positive or ....
Mauldin, M., & Leavitt, J. (1994). Web agent related research at the center for machine translation. Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval. The MITRE Corporation, McLean, Virgiana.
....locator is stored in four types of search databases (1) citation hypertext, 2) citation URL, 3) HTML titles, and (4) HTML address databases. The search interface makes use of the Unix egrep program to query the catalog, and provides the option for searching each of these databases. Lycos ( ML94] developed at the Center for Machine Translation, Carnegie Mellon University is currently the most popular Web search tool ( Poi95] The Lycos resource locator (called, web explorer) searches the Web every day, building a database of all the Web pages it finds. The index of the catalog is ....
Mauldin, M.L. and Leavitt, J.R. Web agent related research at the center for machine translation, August 1994. SIGNIDR Meeting, McLean, Virginia.
....our desire for an information management system which was personal, ubiquitous, automated, and networked. In many ways we already have IR systems which are ubiquitous for a specific domain, automated, and networked Web search engines. Systems such as Lycos and AltaVista automatically spider[31] the Web, attempting to find every document on the Web and index it. Furthermore, systems such as SavvySearch[9] and the MetaCrawler[40] send queries to multiple Web search engines and aggregate the results to produce a better result set than any individual engine does. So, what is it we mean by ....
M. Mauldin and J. Leavett. Web Agent Related Research at the Center for Machine Translation. Presented at SIGNIDR meeting, August 1994.
....or by retrieving a flat file that the DBMS Administrator has created manually. 7.4. 2 Wrapper to Semi Structured Information Sources The complexity and large scale of the world wide web (WWW) has fueled the development of search tools such as Yahoo [56] the Harvest Web Broker [9] and Lycos [38]. These tools are sufficient for simple keyword based searching but the results contain simple information and are represented as HTML documents which impedes further processing. There is a need for tools which can assist consumers in the merging, the combining, and the sorting of retrieved ....
M. L. Mauldin and J. R. R. Leavitt. Web agent related research at the center for machine translation. Center for Machine Translation, Carnegie Mellon University, August 1994.
....Almaden Research Center, 650 Harry Road, San Jose, CA, U.S.A. 95120 1. The Complexity of Graph Traversal Graph traversal is a fundamental problem in computing, since it is the natural abstraction of many search processes, with applications as diverse as Internet searching (Mauldin and Leavitt [40], Selberg and Etzioni [46] and computer aided verification (Dill, et al. 25] Kurshan [38] In computational complexity theory, graph traversal (or more precisely, st connectivity) is a fundamental problem for an additional reason: understanding the complexity of directed versus undirected ....
M. L. Mauldin and J. R. R. Leavitt. Web-agent related research at the Center for Machine Translation. In Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval (SIGNIDR-94), Aug. 1994.
....producing wrappers to these types of data will require the most effort. 4. 5 Wrappers to Semi Structured Information Sources: An Example The complexity and the large scale of the world wide web (WWW) has fueled the development of search tools such as Yahoo, the Harvest Web Broker [6] and Lycos [22]. These tools are sufficient for simple keyword based searching but the results contain simple information and are represented as HTML documents which impedes further processing. There is a need for tools which can assist consumers in the merging, the combining, and the sorting of retrieved ....
M. L. Mauldin and J. R. R. Leavitt. Web agent related research at the center for machine translation. Center for Machine Translation, Carnegie Mellon University, August 1994.
....e.g. archie. The database itself is usually centrally held, although methods of distributing the data have been investigated, e.g. in the Harvest system. Examples of the systems which implement the searching paradigm for resource discovery include archie[11, 12] the World Wide Web Worm[13] Lycos[14], and ALIWEB[15] These systems can be characterised as follows: ffl Nature of meta information This is typically either manually generated, e.g. IAFA templates automatically generated, e.g. recursive directory listings for archie ffl Data quality The automatically maintained data is ....
M.L Mauldin and J.R.R. Leavitt. Web Agent Related Research at the Center for Machine Translation. Presented at the August 1994 SIGNIDR meeting, August 1994. !URL: http://fuzine.mt.cs.cmu.edu/mlm/signidr94.html ?.
....documents. This approach involves periodic automatic searching of the web and gathering of information about all the documents found in these searches. The information gathered is used to make an index which users can later query for desired information. The WebCrawler [Pinkerton, 1994] Lycos [Mauldin and Leavitt, 1994] , OpenText [Open Text Corporation, 1995] Infoseek [Randall, 1995] and Inktomi [Brewer and Gauthier, 1995] are the most important examples of applications which use this indexing approach. Unfortunately, many of the indexing based tools are inadequate now for resource discovery and the others ....
Michael Mauldin and John Leavitt. Web-agent related research at the center for machine translation. In Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval, McLean, VA, August 1994.
....that determines which of the matches are best. Upon finding these best matches, the search engine usually returns a title, a summary and the URL of the source documents. Many search systems have been implemented, including robot based search engines (WebCrawler (Pinkerton 1994) Lycos (Mauldin Leavitt 1994), AltaVista (AltaVista 1996) etc: list based systems (ALIWEB (ALIWEB 1996) CUI W3 (CUIW3 1996) and proxy search systems that forward a user query to multiple search engines (MetaCrawler (MetaCrawler 1996; Selberg Etzioni 1995) SavvySearch (SavvySearch 1996) concurrently. Users ....
Mauldin, M. L., and Leavitt, J. R. R. 1994. WebAgent Related Research at the Center for Machine Translation. In Proceedings of ACM Special Interest Group on Networked Information Discovery and Retrieval, SIGNIDR.
....of expertise. Each search tool is limited by the sub corpus of documents known to it and by its ability to find relevant documents within that corpus. It has been suggested that search engines should not attempt to thoroughly index the entire Web space, as that will lead to much repeated effort [16]. A user with a specific information need will often need to query several search engines before finding relevant documents. To address the problem of navigating the search engines, we have developed a metasearch agent, called SavvySearch, that directs queries to search engines judged to be most ....
Michael L. Mauldin and John R. R. Leavitt. Web agent related research at the center for machine translation. Presented at the SIGNIDR meeting, August 4, 1994 in McLean, Virginia, August 1994.
....locator is stored in four types of search databases (1) citation hypertext, 2) citation URL, 3) HTML titles, and (4) HTML address databases. The search interface makes use of the Unix egrep program to query the catalog, and provides the option for searching each of these databases. Lycos ([13]) developed at the Center for Machine Translation, Carnegie Mellon University is currently the most popular Web search tool ( 15] The Lycos resource locator (called, web explorer) searches the Web every day, building a database of all the Web pages it finds. The index of the catalog is updated ....
M.L. Mauldin and J.R. Leavitt "Web agent related research at the Center for Machine Translation, " August 1994. SIGNIDR Meeting, McLean, Virginia.
....of a modified Mosaic that spawns a spreading activation of URL retrievals similar in behavior to schooling fish (hence the name) Fish search results are transient, available only to the specific Mosaic user and only for the duration of that execution of Mosaic. 4. 2 Lycos The Lycos system [28] employs a Gnu DBM file to store the information discovered during its exploration. The information stored for a given document includes: the title; headings; the 100 most weighty words; the first 20 lines of the document; and . the size of the document, both in bytes and in words. The ....
Mauldin, M. L. and J. R. R. Leavitt, "Web Agent Related Research at the Center for Machine Translation," Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval (SIGNIDR-94), August 1994
.... networking itself is actually unnecessary when the data and meta information are both to be found on the same server. The acquisition of meta information is currently typically done either by having a gathering process, which periodically visits each of the servers which it tracks [34, 35], or by having the tracked server notify the tracker when the information it is interested in has changed [24, 36] This would seem to be worth investigating as an application of IP multicast. The same issues apply when distribution of the indexing information is considered. Perhaps the most ....
M.L Mauldin and J.R.R. Leavitt. Web Agent Related Research at the Center for Machine Translation. Presented at the August 1994 SIGNIDR meeting, August 1994. !URL: http://fuzine.mt.cs.cmu.edu/mlm/signidr94.html ?.
....references came from the other five services. Skeptical readers may argue that service providers could invest in more resources and provide more comprehensive indices to the web. However, recent studies indicate the rate of Web expansion and change makes a complete index virtually impossible[16]. followed followed Jul. 7 Sept. 30 Sept. 8 30 Lycos 35.43 42.17 WebCrawler 30.76 25.74 InfoSeek 18.55 15.70 Galaxy 17.10 15.60 Open Text n a 14.70 Yahoo 10.67 6.59 Table 2: Market Shares of Followed References This table shows the percentage each service has of the total followed ....
Michael L. Mauldin and John R. R. Leavitt. Web Agent Related Research at the Center for Machine Translation. In Proceedings of SIGNIDR V, McLean, Virginia, August 1994.
....the web is to use an index to store information about web documents. This approach involves periodic automatic searching of the web and gathering of information about all the documents found in these searches. AltaVista [Digital Equipment Corporation, 1996] WebCrawler [Pinkerton, 1994] Lycos [Mauldin and Leavitt, 1994] , Infoseek [Randall, 1995] and Inktomi [Brewer and Gauthier, 1995] are the most important examples of applications which use this indexing approach. WebCrawler can also use its own index to suggest starting points for online searches. The main alternative to resource discovery based on offline ....
Michael Mauldin and John Leavitt. Web-agent related research at the center for machine translation. In Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval, McLean, VA, August 1994.
....it is this weighting information that determines which of the matches are best. Upon finding these, the search engine usually returns a title, a summary and the URL of the source documents. Many search systems have been implemented, including robot based search engines (WebCrawler [18] Lycos [15], AltaVista [2] etc. list based systems (ALIWEB [1] CUI W3 [5] and proxy search systems that forward a user query to multiple search engines (MetaCrawler [16, 23] SavvySearch [22] concurrently. Users retrieving information with WWW search systems must contend with four major issues. ....
Michael L. Mauldin and John R. R. Leavitt. Web-Agent Related Research at the Center for Machine Translation. In Proceedings of ACM Special Interest Group on Networked Information Discovery and Retrieval, SIGNIDR, August 1994.
No context found.
Mauldin, M.L, Leavitt, J.R.R.: Web agent related research at the Center for Machine Translation. Proc. of the ACM Special Interest Group on Networked Information Discovery and Retrieval (ACM-SIGNIDR-V). McLean, VA, USA (1994)
No context found.
M.L. Mauldin and J.R. Leavitt "Web agent re- lated research at the Center for Machine Transla- tion," August 1994. SIGNIDR Meeting, McLean, Virginia.
No context found.
Mauldin, M. & Leavitt, J. (1994). Web Agent Related Research at the Center for Machine Translation Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval.
First 50 documents
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC