sac treck 2008

the effect of correlation coefficients on

communities of recommenders

neal lathia, stephen hailes, licia capradepartment of computer science

university college london

n.lathia@cs.ucl.ac.uk

ACM SAC TRECK, Fortaleza, Brazil: March 2008Trust, Recommendations, Evidence and other Collaboration

Know-how

recommender systems:

built on collaboration between users

collaborative filtering research design

methodsto solve problems

1. accuracy, coverage

2. data sparsity, cold-start

3. incorporating tag knowledge

for example,

… a method to classify content correctly

data predictedratingsintelligent

process

our focus: k-nearest neighbours (kNN)

how do we model kNN collaborative filtering?

a graph of cooperating users

nodes = userslinks = weighted according to similarity

accuracy, coverage

to answer this question, we need to find the optimal weighting:

the best similarity measure for the dataset, from the many available:

bibaia

bibaiaba

bibaia

bibaiaba

and there are more still…

5.25.2

ibiaba

concordance: proportion of agreement

DCw ba

+0.5 +3.0

-1.5+1.5

+1.5 +/-?

concordant

discordant

Somers’ d}

community view of the graph:

-0.430.57

(a very small example)

me-0.50

0.010.57

0.840.220.99

0.41 0.01

or, put another way:

-0.430.57

goodgood

nonegood

what is the best way of generating the graph?

like this?

-0.430.57

badbad

goodgood

nonebad

or like this?

-0.430.57

megood

nonebad

similarity values depend on the method used:

there is no agreement between measures

[2][3][1][5][3]

[4][1][3][2][3]

my profile neighbour profile

pearson -0.50weighted- pearson -0.05cosine angle0.76co-rated proportion1.00concordance -0.06

badnear zero

goodvery goodnear zero

nodes = userslinks = weighted according to similarity

each method will change the distribution of similarity across the graph

… the pearson distribution

intelligent process

Pearson Distribution

… the modified pearson distributionsweighted-PCC, constrained-PCC

Modified Pearson Distributions

Weighted-PCC Constrained-PCC

… and other measures

intelligent process

Other Distributions

Co-Rated Somers VSS

somers’ d, co-rated, cosine angle

an experiment withrandom numbers

what happens if we do this?

java.util.Random r = new java.util.Random()

for all neighbours i {

similarity(i) = (r.nextDouble()*2.0)-1.0);

Neighborhood Co Rated Somers’ d PCC wPCC R(0.5, 1.0) Constant(1.0) R(-1.0, 1.0)

1 0.9449 0.9492 1.1150 0.9596 1.0665 1.0406 1.0341

10 0.8498 0.8355 1.0455 0.8277 0.9595 0.9495 0.9689

30 0.7979 0.7931 0.9464 0.7847 0.8903 0.9108 0.8848

50 0.7852 0.7817 0.9007 0.7733 0.8584 0.8922 0.8498

100 0.7759 0.7728 0.8136 0.7647 0.8222 0.8511 0.8153

153 0.7726 0.7727 0.7817 0.7638 0.8053 0.8243 0.8024

229 0.7717 0.7771 0.7716 0.7679 0.7919 0.7992 0.8058

459 0.7718 0.7992 0.8073 0.8025 0.7773 0.7769 0.7811

iaia ,,accuracy

…cross-validation results in paper

movielens u1 subset…

sprediction#

sprediction uncovered#Coveragecoverage

…cross-validation results in paper

movielens u1 subset…

Neighborhood Co Rated Somers’ d PCC wPCC Oracle

1 0.67795 0.57165 0.96725 0.61375 0.00495

10 0.15455 0.0999 0.80515 0.1114 0.00495

30 0.0512 0.0407 0.57225 0.04135 0.00495

50 0.03065 0.0266 0.3641 0.0251 0.00495

100 0.01515 0.01645 0.08345 0.01485 0.00495

153 0.00945 0.0122 0.0273 0.01135 0.00495

229 0.00715 0.00965 0.01165 0.00915 0.00495

459 0.00495 0.0054 0.00495 0.00495 0.00495

(best coverage when all of community used)

why do we get these results?

a) our error measures are not good

enough?

iaia ,,

sprediction#

sprediction uncovered#Coverage

J. Herlocker, J. Konstan, L. Terveen, and J. Riedl. Evaluating collaborative filtering recommender systems. In ACM Transactions on Information Systems, volume 22, pages 5–53. ACM Press, 2004.

S.M. McNee, J. Riedl, and J.A. Konstan. Being accurate is not enough: How accuracy metrics have hurt recommender systems. In Extended Abstracts of the 2006 ACM Conference on Human Factors in Computing Systems. ACM Press, 2006.

prRMSE iaia

b) is there something wrong with the dataset?

c) is user-similarity not strong enough to capture the best recommender relationships in

the graph?

one proposal…

N. Lathia, S. Hailes, L. Capra. Trust-Based Collaborative Filtering. To appear In IFIPTM 2008: Joint iTrust and PST Conferences on Privacy, Trust management and Security. Trondheim, Norway. June 2008.

is modelling filtering as a trust-management problem a potential solution?

once we do that, more questions arise…

what other graph properties emerge from kNN collaborative filtering?

how does the graph evolve over time?

current work

N. Lathia, S. Hailes, L. Capra. Evolving Communities of Recommenders: A Temporal Evaluation. Research Note RN/08/01, Department of Computer Science, University College London. Under Submission.

N. Lathia, S. Hailes, L. Capra. kNN User Filtering: A Temporal Implicit Social Network. Current Work.

read more: http://mobblog.cs.ucl.ac.uktrust, recommendations, …

neal lathia, stephen hailes, licia capradepartment of computer science

university college london

n.lathia@cs.ucl.ac.uk

ACM SAC TRECK, Fortaleza, Brazil: March 2008Trust, Recommendations, Evidence and other Collaboration Know-how

questions?

sac treck 2008

knn collaborative filtering

graph nodes

small example

knn user filtering

acm press

graph of cooperating

weighted pearson

graph properties

Technology

indoavis nusantara vfr-sac chart bulletin (sectional...

iep-sac journal 2008-2009

lintec uv e-2001rc - treck hall

geoperu sac

austintown girls softball league €¦ · no player name...

introduction to sac-ci...

on the treck west to leadville they wore at a loss for

techfest.org · bmx jam stratazenith—a game theory...

spot cooler factory fan / large factory fan...

english sac

plasto-sac product presentation. plasto-sac security...

5fusion absorcion red sac absorve a black sac

sac manual

sac bylaws

indoavis nusantara vfr-sac chart bulletin...

sac - revised

glamour sac

brochure_smart_access_peru sac

madison, may 20, 2009 tom gaisser1 icecube collaboration...

sac technical assistance (ta) page: ...