| Wessel Kraaij, Thijs Westerveld and Djoerd Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pages 27--34. ACM Press, 2002. |
....task. In the final version of this paper we will include a reference to recent work [3] on TREC style experiments on a di#erent approach to mixing the e#ects of anchor text, content, titles, and other ranking schemes. We believe that this remains a fruitful area of research for the future. In [8], they suggested a method by which it is possible to combine a language model for anchor text with a language model for content in a system based on statistical language models. The reasoning is that anchor texts and the body texts ( content only ) provide two very di#erent textual ....
.... (1 #)P (T i #P (T i D) The quantity P (T i is estimated using the distribution of terms in the collection and the observation of queries made on the collection. The crucial di#culty in this approach is to estimate P (T i D) namely the relevance of a term to a given document. In [8] it was suggested to mix two models of content for anchortext and anchor text, using P (T i D) # P content (T i (1 )P anchor (T i If a term appears in the anchortext of a document, then this term may be a likely candidate for inclusion in the model of similar documents, and this ....
[Article contains additional citation context not shown here]
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In Proc. of the 25th annual international ACM SIGIR conference on research and development in information retrieval, pages 27--34. Association for Computing Machinery, 2002.
....in TREC10 that did not use a form of document prior [3] The best performing single document representation, document in link text, had a MRR of 0.515, so combining the document representations significantly improves performance. Additionally, we tried using the document URL priors described in [4] as a re ranking strategy for the top 1000 documents. The use of document priors did improve performance for all evaluation measures used for the task. Configuration MRR TOP 10 FAlL Equal lambdas .676 83.4 5.5 Equal lambdas URL length prior .799 91.7 3.4 Table 5: Results of the homepage ....
W. Kraaij, T. Westerveld, and D. Hiemstra. The Importance of Prior Probabilities for Entry Page Search. In Proceedings of the 25 t/' Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2002), pages 27-34.
....factors in web search remains an active area of research. In [13] and [20] the authors present the problem of combining di#erent ranking functions in the context of a Bayesian probabilistic model for information retrieval. This approach was used with a naive Bayesian independence assumption in [25] and [30] to combine ranking functions on document length, indegree and URL depth as prior probabilities on the documents in a collection. They further suggested a technique to combine di#erent content models for anchor text and content. Due to limitations in their software system, their model was ....
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In Proc. 25th SIGIR, pages 27--34, 2002.
....132 of the topics, for 16 topics there are two relevant pages, and there are three relevant pages for the remaining 2 topics. The precursor of this task was TREC 2001 s home page finding task [9] For entry page finding, non content features such as URLs and links provided valuable information [11]. We did not see a straightforward way to use non content features for this year s task. An alternative is to use the anchortexts in the collection [5] For the named page finding task, we experimented with plain text runs, anchor text runs, and their combinations. Table 7: Overview of the named ....
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In K. J arvelin, M. Beaulieu, R. Baeza-Yates, and S.H. Myaeng, editors, Proceedings of the 25th Annual International ACM SIGIR Conference on Research and development in information retrieval, pages 27--34, 2002.
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of the 25th ACM Conference on Research and Development in Information Retrieval (SIGIR'02), pages 27--24, 2002.
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of the 25th ACM Conference on Research and Development in Information Retrieval (SIGIR'02), 2002. (in this volume)
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In Proc. of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pages 27--34. ACM Press, 2002.
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002.
....previously unavailable way to reason about ranking of retrieval results. Language models have been developed at the University of Twente for cross language information retrieval, relevance feedback and manually formulated Boolean structured queries [28, 29] adaptive filtering [27] and web search [35]. As suggested in [41, 1, 35] language models also open exciting new possibilities for combining di#erent content representations of semistructured data. In theory that is, because none of these publications actually reported that they were able to implement none trivial versions of such a ....
....way to reason about ranking of retrieval results. Language models have been developed at the University of Twente for cross language information retrieval, relevance feedback and manually formulated Boolean structured queries [28, 29] adaptive filtering [27] and web search [35] As suggested in [41, 1, 35], language models also open exciting new possibilities for combining di#erent content representations of semistructured data. In theory that is, because none of these publications actually reported that they were able to implement none trivial versions of such a system. Implementing general ....
[Article contains additional citation context not shown here]
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of the 25th ACM Conference on Research and Development in Information Retrieval (SIGIR'02), pages 27--34, 2002.
No context found.
Wessel Kraaij, Thijs Westerveld and Djoerd Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pages 27--34. ACM Press, 2002.
No context found.
KRAAIJ, W., WESTERVELD, T., AND HIEMSTRA, D. 2002. The importance of prior probabilities for entry page search. In Proceedings of SIGIR'02. 27--34.
No context found.
Wessel Kraaij, Thijs Westerveld, and Djoerd Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of SIGIR 2002.
No context found.
Kraaij, W., Westerveld, T. and Hiemstra, D. (2002), The importance of prior probabilities for entry page search, in M. Beaulieu, R. Baeza-Yates, S. H. Myaeng and K. Jarvelin, eds, "Proc. ACM-SIGIR Int. Conf. on Research and Development in Information Retrieval", ACM Press, New York, Tampere, Finland, pp. 27--34.
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 27--34. ACM, 2002.
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In K. J arvelin, M. Beaulieu, R. Baeza-Yates, and S. H. Myaeng, editors, Proceedings of the 25th Annual International ACM SIGIR Conference on Research 2002.
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 27--34. ACM, 2002.
No context found.
KRAAIJ, W., WESTERVELD, T., AND HIEMSTRA, D. 2002. The importance of prior probabilities for entry page search. In Proceedings of SIGIR'02. 27--34.
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. In K. J arvelin, M. Beaulieu, R. Baeza-Yates, and S. H. Myaeng, editors, Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 27--34. ACM Press, New York NY, USA, 2002.
No context found.
Wessel Kraaij, Thijs Westerveld and Djoerd Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pages 27--34. ACM Press, 2002.
No context found.
Kraaij, W., Westerveld T. and Hiemstra, D., The importance of prior probabilities for entry page search, SIGIR, pages 27-34, 2002.
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search. pages 27--34, 2002.
No context found.
W. Kraaij, T. Westerveld, and D. Hiemstra. The Importance of Prior Probabilities for Entry Page Search. In Proceedings of the 25 Retrieval (SIGIR 2002), pages 27-34.
No context found.
W. Kraaij, T. Westerveld, D. Hiemstra. The importance of prior probabilities for entry page search. In Proceedings of the 25 Annual International ACM SIGIR Conference on Research and
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC