• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

DMCA

Broad Expertise Retrieval in Sparse Data Environments (2007)

Cached

  • Download as a PDF

Download Links

  • [ilk.uvt.nl]
  • [www.dcs.gla.ac.uk]
  • [krisztianbalog.com]
  • [ilk.uvt.nl]
  • [ir.dcs.gla.ac.uk]
  • [ilk.uvt.nl]
  • [www.dcs.gla.ac.uk]
  • [www.science.uva.nl]
  • [staff.science.uva.nl]
  • [eprints.gla.ac.uk]
  • [eprints.gla.ac.uk]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Krisztian Balog , Maarten de Rijke, et al.
Citations:23 - 6 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Balog07broadexpertise,
    author = {Krisztian Balog and Maarten de Rijke and et al.},
    title = {Broad Expertise Retrieval in Sparse Data Environments },
    year = {2007}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

Expertise retrieval has been largely unexplored on data other than the W3C collection. At the same time, many intranets of universities and other knowledge-intensive organisations offer examples of relatively small but clean multilingual expertise data, covering broad ranges of expertise areas. We first present two main expertise retrieval tasks, along with a set of baseline approaches based on generative language modeling, aimed at finding expertise relations between topics and people. For our experimental evaluation, we introduce (and release) a new test set based on a crawl of a university site. Using this test set, we conduct two series of experiments. The first is aimed at determining the effectiveness of baseline expertise retrieval methods applied to the new test set. The second is aimed at assessing refined models that exploit characteristic features of the new test set, such as the organizational structure of the university, and the hierarchical structure of the topics in the test set. Expertise retrieval models are shown to be robust with respect to environments smaller than the W3C collection, and current techniques appear to be generalizable to other settings.

Keyphrases

sparse data environment    broad expertise retrieval    new test    test set    w3c collection    broad range    baseline approach    university site    baseline expertise retrieval method    characteristic feature    experimental evaluation    organizational structure    new test set    expertise retrieval model    generative language modeling    expertise relation    knowledge-intensive organisation    refined model    expertise retrieval    clean multilingual expertise data    expertise area    main expertise retrieval task    hierarchical structure    many intranet    current technique   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University