| G. Attardi, A. Gulli, and F. Sebastiani. Theseus: categorization by context. In WWW8, 1999. |
.... have highlighted how mediation and referral can lead to emergent communities in peer networks [37] The use of referential text to identify semantic similarity between pages is also not new [12, 2, 17] In addition, the idea has also been used for example to find Web sites [16] categorize pages [4], crawl pages [28] and rank crawled pages [23] Despite the active use of referential text for variety of information retrieval tasks, no one has yet demonstrated the effectiveness of this technique for generalpurpose search. There is a large and growing body of work on topical or focused ....
G Attardi, A Gull , and F Sebastiani. Theseus: Categorization by context. In Proc. 8th International World Wide Web Conference, 1999.
....ones) are in fact orthogonal to each other. The third approach, which relies on text near anchors, referred to as the anchor window [9] appears most useful for the Web similarity search task. Indeed, the use of anchor windows has been previously considered for a variety of other Web IR tasks [2, 1, 9, 11]. The anchor window often constitutes a hand built summary of the target document [1] collecting both explicit hand summarization and implicit hand classi cation present in referring documents. We expect that when aggregating over all inlinks, the frequency of relevant terms will dominate the ....
G. Attardi, A. Gull, and F. Sebastiani. Theseus: Categorization by context. Proceedings of WWW8, 1999.
....sessions, it is necessarily of limited coverage. The database in [9] for example, contains very few of the queries seen in the Lucent logs. Many techniques exist for automatically determining the category of a document based on its content (e.g. 18] and its references, 10] and its references, [1]) and the in and out links of the document (e.g. 7, 12] We are currently investigating techniques to include content in our clustering algorithms, with the advantage that by working with the proxy cache we do not require extra spidering. Another approach to document categorization is ....
G. Attardi, A. Gulli, and F. Sebastiani. Theseus: categorization by context. In Proceedings of the Eighth International World Wide Web Conference (WWW8), Toronto, Canada, May 1999. Presented in the poster session.
No context found.
G. Attardi, A. Gulli, and F. Sebastiani. Theseus: categorization by context. In WWW8, 1999.
.... (as exemplified by e.g. Yahoo TM ) and keyword based document querying (as exemplified by e.g. AltaVista TM ) In the framework of the Eurosearch Project, we are addressing the problem of the automatic categorisation of Web documents and sites within Yahoo like hierarchies of categories [1,4]. The tools resulting from this project allow to overcome a major bottleneck in today s Web information organisation, i.e. the need for manual categorisation of Web documents and sites; this latter modality is inadequate, in view of the ever increasing size of the Web and of its ever evolving ....
G. Attardi, A. Gull, and F. Sebastiani. Theseus: categorization by context. In C. Hutchison and G. Lanzarone, editors, Proceedings of THAI'99, European Symposium on Telematics, Hypermedia and Artificial Intelligence, Varese, IT, 1999. Forthcoming.
No context found.
Giuseppe Attardi, Antonio Gull, and Fabrizio Sebastiani. Theseus: categorization by context. In Chris Hutchison and Gaetano Lanzarone, editors, Proceedings of THAI'99, European Symposium on Telematics, Hypermedia and Artificial Intelligence, Varese, IT, 1999. Forthcoming.
No context found.
Giuseppe Attardi, Antonio Gull, and Fabrizio Sebastiani. Theseus: categorization by context. In Proceedings of WWW'99, 8th International Conference on the World Wide Web, Toronto, CA, 1999. Poster Presentation. Forthcoming.
....for humans, to deal with collections typically very large. Automatic Categorization involves knowledge at two levels: knowledge of the meaning of the category, to understand what pertains to it, and knowledge of the meaning of a document, to decide whether or not it matches a category [3]. An e ective and robust representation for meanings is hence desirable, an achievement of which would grant the chance to build a highly trustworthy and e ective automatic classi er. Studies on Automatic Categorization have typically exploited statistical and probabilistic peculiarities of ....
....found at the beginning of scienti c articles, would be as e ective, when it comes to classify a document by the topics it treats. If not all information given by the actual document is present, the main topic is for sure. Similar representations are used by Attardi et al. in the Theseus system [3], in which categorization is done by considering the context a link to a document appears in, rather than the document itself. The assumption made by Attardi et al. is that the information around the link is sucient for the user to determine the relevance of the linked document, and can then be ....
[Article contains additional citation context not shown here]
Giuseppe Attardi, Antonio Gull, Fabrizio Sebastiani, Theseus: Categorization by Context, 8th World Wide Web Conference, Toronto, Canada, 1999
No context found.
Giuseppe Attardi, Antonio Gull, and Fabrizio Sebastiani. Theseus: categorization by context. In Poster Proceedings of WWW'99, 8th International Conference on the World Wide Web, pages 136--137, Toronto, CA, 1999.
No context found.
G. Attardi, A. Gull, and F. Sebastiani. Theseus: Categorization by context. Proceedings of WWW8, 1999.
No context found.
G. Attardi, A. Gulli, and F. Sebastiani. Theseus: categorization by context. In WWW99.
No context found.
G Attardi, A Gull, and F Sebastiani. Theseus: Categorization by context. In Proc. 8th International World Wide Web Conference, 1999.
No context found.
G. Attardi, A. Gull, and F. Sebastiani. Theseus: Categorization by context. Proceedings of WWW8, 1999.
No context found.
Giuseppe Attardi,AntonioGull#,andFabrizioSebastiani.Theseus:categorization by context.InPosterProceedingsofWWW'99, 8thInternationalConference on the W orldW ideW eb,pages136-137,Toronto,CA, 1999.
No context found.
Giuseppe Attardi, Antonio Gull, and Fabrizio Sebastiani. Theseus: categorization by context. In Poster Proceedings of WWW'99, 8th International Conference on the World Wide Web, pages 136--137, Toronto, CA, 1999.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC