21 citations found. Retrieving documents...
Taher H. Haveliwala, Aristides Gionis, Dan Klein, and Piotr Indyk. Evaluating strategies for similarity search on the web. In Proc. of WWW, pages 432--442, Honolulu, HI, 2002.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:
Search Engine-Crawler Symbiosis - Pant, Bradshaw, Menczer (2003)   (Correct)

.... lead to emergent communities in peer networks [37] The use of referential text to identify semantic similarity between pages is also not new [12, 2, 17] In addition, the idea has also been used for example to find Web sites [16] categorize pages [4] crawl pages [28] and rank crawled pages [23]. Despite the active use of referential text for variety of information retrieval tasks, no one has yet demonstrated the effectiveness of this technique for generalpurpose search. There is a large and growing body of work on topical or focused crawlers. Starting with the early breadth first [34] ....

TH Haveliwala, A Gionis, D Klein, and P Indyk. Evaluating strategies for similarity search on the Web. In David Lassner, Dave De Roure, and Arun Iyengar, editors, Proc. 11th International World Wide Web Conference. ACM Press, 2002.


Semi-Supervised Evaluation of Search Engines via Semantic Mapping - Menczer (2003)   (1 citation)  (Correct)

....become a useful tool in comparing and evaluating search engine performance, even if search engines are viewed as black boxes. The issue of semantic similarity and its relationship with link and lexical proximity has also been explored recently by Chakrabarti et al. 5, 7] and by Haveliwala et al. [15]. The latter in particular describe an automatic evaluation process closely related to the one proposed here. The final section of this paper discusses the differences and complementarity between the two approaches. 2. SEMANTIC MAPPING Consider two objects p and q. An object could be a page or a ....

....a query (as in query by example retrieval systems) and ranking all other pages, then using semantic similarity data to assess the ranking. Our data supports this approach, assuming the above procedure is repeated with every page used as an example (this is very similar to the method proposed in [15]) Let us define projected precision and recall as follows: P (s) q : #(p, q) s (4) R(s) p,q #s (p, q) 5) where # = #c to evaluate content based ranking and # = # l to evaluate link based ranking. The precision recall plots in Figure 1 show that ranking by link similarity ....

[Article contains additional citation context not shown here]

T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the Web. In D. Lassner, D. De Roure, and A. Iyengar, editors, Proc. 11th International World Wide Web Conference. ACM Press, 2002.


Efficient Algorithms for Shared Camera Control - Har-Peled, Koltun, Song.. (2002)   (Correct)

....Otherwise, the optimal camera rectangle would always tend to contain the whole accessible region, with little consideration of user requests. Another similarity function that can be adopted as S i (C) is the Intersection Over Union (IOU) measure, also known as the Jaccard measure [Bro97, HGKI02, VH00] IOU(C;R i ) Intersection Over Union is directly related to the Symmetric Difference (SD) measure: SD(C;R i ) Area(C [R i ) Area(C R i ) 1 IOU(C;R i ) The unnormalized version of the SD measure was used by de Berg et al. dBCD 98] to assess the dissimilarity of two ....

T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In WWW, 2002.


Estimating Rarity and Similarity over Data Stream Windows - Datar, Muthukrishnan (2002)   (5 citations)  (Correct)

....two windowed data streams at time t is defined as oe t = The problem is to estimate the similarity oe t at any time t. This is the classical notion of similarity between two sets. It has been useful in estimating transitive closures [11] web page duplicate detection [4] and data mining [12, 35], among other things. Rarity ae finds many data mining applications. For example, consider the data stream of IPaddresses that access any online service like a search engine, online store like Amazon, onlinenewspapers etc. The set of rare IP address (for the appropriate value of ff) in a ....

T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating Strategies for Similarity Search on the Web. To appear in Proc. of the Eleventh International World Wide Web Conference, 2002.


A Sketch-based Sampling Algorithm on Sparse Data - Ping Li Pingli   (Correct)

No context found.

Taher H. Haveliwala, Aristides Gionis, Dan Klein, and Piotr Indyk. Evaluating strategies for similarity search on the web. In Proc. of WWW, pages 432--442, Honolulu, HI, 2002.


Scaling Link-Based Similarity Search - Aniel Fogaras Budapest (2005)   (Correct)

No context found.

T. H. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. Proc. of WWW11, 2002.


A Multilingual Usage Consultation Tool - Based On Internet (2005)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In WWW Conference, pages 432--442, 2002.


LSH Forest: Self-Tuning Indexes for Similarity Search - Mayank Bawa Bawa (2005)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In Proc. of WWW, 2002.


LSH Forest: Self-Tuning Indexes for Similarity Search - Bawa, Condie, Ganesan (2005)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In Proc. of WWW, 2002.


Scaling Link-Based Similarity Search - Fogaras, Racz (2005)   (Correct)

No context found.

T. H. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. Proc. of WWW11, 2002.


A Multilingual Usage Consultation Tool - Based On Internet (2005)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In WWW Conference, pages 432--442, 2002.


Finding semantic needles in haystacks of Web text and links - Menczer   (Correct)

No context found.

TH Haveliwala, A Gionis, D Klein, and P Indyk. Evaluating strategies for similarity search on the Web. In David Lassner, Dave De Roure, and Arun Iyengar, editors, Proc. 11th International World Wide Web Conference, New York, NY, 2002. ACM Press.


Efficient Algorithms for Shared Camera Control - Sariel Har-Peled Vladlen (2003)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In Proceedings of WWW, 2002.


Providing Ranked Relevant Results for Web Database Queries - Nambiar, Kambhampati (2004)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P Indyk. Evaluating strategies for similarity search on the web. Proceedings of WWW, Hawai, USA, May 2002.


Providing Ranked Relevant Results for Web Database Queries - Nambiar, Kambhampati (2004)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P Indyk. Evaluating strategies for similarity search on the web. Proceedings of WWW, Hawai, USA, May 2002.


Answering Imprecise Database Queries: A Novel Approach - Nambiar, Kambhampati (2003)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P Indyk. Evaluating strategies for similarity search on the web. Proceedings of WWW, Hawai, USA, May 2002.


Mining Approximate Functional Dependencies and Concept.. - Nambiar, Kambhampati (2004)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P Indyk. Evaluating strategies for similarity search on the web. Proceedings of WWW, Hawai, USA, May 2002.


SEWeP: Using Site Semantics and a Taxonomy to.. - Eirinaki, Lampos, .. (2003)   (Correct)

No context found.

T.H. Haveliwala, A. Gionis, D. Klein, P. Indyk, Evaluating Strategies for Similarity Search on the Web, in Proc. of WWW11, Hawaii, USA, May 2002


The Anatomy of a Clustering Engine for Web-page Snippets - Ferragina, Gulli (2004)   (Correct)

No context found.

T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In WWW11, 2002.


Using Titles and Category Names from Editor-driven .. - Beitzel, Jensen.. (2003)   (Correct)

No context found.

Haveliwala, T., Gionis, A., Klein, D., and Indyk, P. Evaluating Strategies for Similarity Search on the Web. In Proceedings of WWW'02 (Honolulu, HI, May, 2002), ACM Press.


Comparing and Aggregating Rankings with Ties - Fagin, Kumar, Mahdian.. (2003)   (2 citations)  (Correct)

No context found.

T. H Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating strategies for similarity search on the web. In Proceedings of the 11th International World Wide Web Conference, 2002.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC