by Lyle H. Ungar, Dean P. Foster, Ellen Andre, Star Wars, Fred Star Wars, Dean Star Wars, Jason Hiver Whispers
http://www.cis.upenn.edu/datamining/Publications/clust.pdf
Add To MetaCart
Abstract:
Grouping people into clusters based on the items they have purchased allows accurate recommendations of new items for purchase: if you and I have liked many of the same movies, then I will probably enjoy other movies that you like. Recommending items based on similarity of interest (a.k.a. collaborative ltering) is attractive for many domains: books, CDs, movies, etc., but does not always work well. Because data are always sparse { any given person has seen only a small fraction of all movies { much more accurate predictions can be made by grouping people into clusters with similar movies and grouping movies into clusters which tend to be liked by the same people. Finding optimal clusters is tricky because the movie groups should be used to help determine the people groups and visa versa. We present a formal statistical model of collaborative ltering, and compare di erent algorithms for estimating the model parameters including variations of K-means clustering and Gibbs Sampling. This formal model is easily extended to handle clustering of objects with multiple attributes.
Citations
|
4345
|
Maximum likelihood from incomplete data via the EM algorithm
– Dempster, Laird, et al.
- 1977
|
|
604
|
Social information filtering: Algorithms for automating “word of mouth
– Shardanand, Maes
- 1995
|
|
505
|
The EM Algorithm and Extensions
– McLachlan, Krishnan
- 1996
|
|
413
|
Using collaborative filtering to weave an information tapestry
– Goldberg, Nichols, et al.
|
|
410
|
A New View of the EM Algorithm that Justifies Incremental and Other Variants“, Learning in Graphical Models
– Neal, Hinton
- 1993
|
|
105
|
Evolving agents for personalized information filtering
– Sheth, Maes
- 1993
|
|
65
|
Cluster Analysis
– Aldenderfer, Blashfield
- 1984
|
|
25
|
A new view of the EM algorithm that justi es incremental and other variants
– Neal, Hinton
- 1993
|
|
12
|
et al. Grouplens: applying collaborative filtering to usenet news
– Konstan
- 1997
|
|
7
|
The Interpretation of Analytical Chemical Data by the Use of Cluster Analysis
– Massart, Kaufman
- 1983
|
|
2
|
A Collaborative Filtering System for the Analysis of Consumer Data, unpublished
– Herz, Ungar, et al.
- 1998
|
|
2
|
et al., GroupLens: Applying collaborative ltering to Usenet news
– Konstan, Miller, et al.
- 1997
|