| Taher H. Haveliwala, Aristides Gionis, Dan Klein, and Piotr Indyk. Evaluating strategies for similarity search on the web. In Proc. of WWW, pages 432--442, Honolulu, HI, 2002. |
.... lead to emergent communities in peer networks [37] The use of referential text to identify semantic similarity between pages is also not new [12, 2, 17] In addition, the idea has also been used for example to find Web sites [16] categorize pages [4] crawl pages [28] and rank crawled pages [23]. Despite the active use of referential text for variety of information retrieval tasks, no one has yet demonstrated the effectiveness of this technique for generalpurpose search. There is a large and growing body of work on topical or focused crawlers. Starting with the early breadth first [34] ....
TH Haveliwala, A Gionis, D Klein, and P Indyk. Evaluating strategies for similarity search on the Web. In David Lassner, Dave De Roure, and Arun Iyengar, editors, Proc. 11th International World Wide Web Conference. ACM Press, 2002.
....become a useful tool in comparing and evaluating search engine performance, even if search engines are viewed as black boxes. The issue of semantic similarity and its relationship with link and lexical proximity has also been explored recently by Chakrabarti et al. 5, 7] and by Haveliwala et al. [15]. The latter in particular describe an automatic evaluation process closely related to the one proposed here. The final section of this paper discusses the differences and complementarity between the two approaches. 2. SEMANTIC MAPPING Consider two objects p and q. An object could be a page or a ....
....a query (as in query by example retrieval systems) and ranking all other pages, then using semantic similarity data to assess the ranking. Our data supports this approach, assuming the above procedure is repeated with every page used as an example (this is very similar to the method proposed in [15]) Let us define projected precision and recall as follows: P (s) q : #(p, q) s (4) R(s) p,q #s (p, q) 5) where # = #c to evaluate content based ranking and # = # l to evaluate link based ranking. The precision recall plots in Figure 1 show that ranking by link similarity ....
[Article contains additional citation context not shown here]
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the Web. In D. Lassner, D. De Roure, and A. Iyengar, editors, Proc. 11th International World Wide Web Conference. ACM Press, 2002.
....Otherwise, the optimal camera rectangle would always tend to contain the whole accessible region, with little consideration of user requests. Another similarity function that can be adopted as S i (C) is the Intersection Over Union (IOU) measure, also known as the Jaccard measure [Bro97, HGKI02, VH00] IOU(C;R i ) Intersection Over Union is directly related to the Symmetric Difference (SD) measure: SD(C;R i ) Area(C [R i ) Area(C R i ) 1 IOU(C;R i ) The unnormalized version of the SD measure was used by de Berg et al. dBCD 98] to assess the dissimilarity of two ....
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In WWW, 2002.
....two windowed data streams at time t is defined as oe t = The problem is to estimate the similarity oe t at any time t. This is the classical notion of similarity between two sets. It has been useful in estimating transitive closures [11] web page duplicate detection [4] and data mining [12, 35], among other things. Rarity ae finds many data mining applications. For example, consider the data stream of IPaddresses that access any online service like a search engine, online store like Amazon, onlinenewspapers etc. The set of rare IP address (for the appropriate value of ff) in a ....
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating Strategies for Similarity Search on the Web. To appear in Proc. of the Eleventh International World Wide Web Conference, 2002.
No context found.
Taher H. Haveliwala, Aristides Gionis, Dan Klein, and Piotr Indyk. Evaluating strategies for similarity search on the web. In Proc. of WWW, pages 432--442, Honolulu, HI, 2002.
No context found.
T. H. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. Proc. of WWW11, 2002.
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In WWW Conference, pages 432--442, 2002.
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In Proc. of WWW, 2002.
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In Proc. of WWW, 2002.
No context found.
T. H. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. Proc. of WWW11, 2002.
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In WWW Conference, pages 432--442, 2002.
No context found.
TH Haveliwala, A Gionis, D Klein, and P Indyk. Evaluating strategies for similarity search on the Web. In David Lassner, Dave De Roure, and Arun Iyengar, editors, Proc. 11th International World Wide Web Conference, New York, NY, 2002. ACM Press.
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In Proceedings of WWW, 2002.
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P Indyk. Evaluating strategies for similarity search on the web. Proceedings of WWW, Hawai, USA, May 2002.
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P Indyk. Evaluating strategies for similarity search on the web. Proceedings of WWW, Hawai, USA, May 2002.
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P Indyk. Evaluating strategies for similarity search on the web. Proceedings of WWW, Hawai, USA, May 2002.
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P Indyk. Evaluating strategies for similarity search on the web. Proceedings of WWW, Hawai, USA, May 2002.
No context found.
T.H. Haveliwala, A. Gionis, D. Klein, P. Indyk, Evaluating Strategies for Similarity Search on the Web, in Proc. of WWW11, Hawaii, USA, May 2002
No context found.
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In WWW11, 2002.
No context found.
Haveliwala, T., Gionis, A., Klein, D., and Indyk, P. Evaluating Strategies for Similarity Search on the Web. In Proceedings of WWW'02 (Honolulu, HI, May, 2002), ACM Press.
No context found.
T. H Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In Proceedings of the 11th International World Wide Web Conference, 2002.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC