a hybrid recommender system using link analysis and genetic tuning in the bipartite network of...

26
A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada, Reno

Upload: leslie-copeland

Post on 16-Dec-2015

216 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of

BoardGameGeek.com

Brett BogeCS 765University of Nevada, Reno

Page 2: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Recap

Data

General Approach

Step 1: Link-analysis

Step 2: Content-based Cascade

Step 2: Genetic tuning

Page 3: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Recap

Data

General Approach

Step 1: Link-analysis

Step 2: Content-based Cascade

Step 2: Genetic tuning

Page 4: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Data (Overview)

Users

400,000 +

Games

55,000 +

Ratings

0–3000

/ea

Page 5: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Data (Scope)

• Starting with the top 5,000 games

• List of users == those which have rated at least one of the top 5,000 games

• Users with no ratings cannot be connected to anycomponent of the graph, and can only be evaluatedin the most general sense

Page 6: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Data (Retrieval)

• Data will be obtained through the BGG XML API2

• Game|Small World, id 40692http://boardgamegeek.com/xmlapi2/

thing?id=40692&ratingcomments=1

• User|Licinianhttp://boardgamegeek.com/xmlapi2/

user?name=Licinian

http://boardgamegeek.com/xmlapi2/collection?name=Licinian&own/played/trade/want/wishlist/etc

Page 7: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,
Page 8: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Data (Sets)

Ratings/Ownership Data

TeachingSet70%

TestingSet30%

(hopefully most recent)

Page 9: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Recap

Data

General Approach

Step 1: Link-analysis

Step 2: Content-based Cascade

Step 2: Genetic tuning

Page 10: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

• User & Item profiles• Based on content specific to that object

(properties)ContentBased

• Users & Items similar to those liked/owned in the past

• More abstract, only links matterCollaborative

Based

General Approach

Page 11: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

• Weighted• Switched• Mixed• Feature combination• Cascade

Methods of Hybrid Filtering

R. Burke, "Hybrid recommender systems: Survey and experiments,"

ApproachesGeneral Approach

Page 12: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Our Method

ApproachesGeneral Approach

Link-

analysis

•As described by Huang et al. in A Link analysis approach to recommendation under sparse data

•A PageRank style analysis of hubs and authorities

Content-based

•Refines the previous results

•Uses information about the items themselves to adjust ranking

•Will need tuning

Page 13: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Recap

Data

General Approach

Step 1: Link-analysis

Step 2: Content-based Cascade

Step 2: Genetic tuning

Page 14: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Overview

From Z. Huang, et al., "A Link analysis approach torecommendation under sparse data," 2004.

ApproachesLink Analysis Step

LinkAnalysis

Consumer - Product

Matrix

ConsumerRepresentativeness

Matrix

ProductRepresentativeness

Matrix

Page 15: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Matrix Definitions

From Z. Huang, et al., "A Link analysis approach torecommendation under sparse data," 2004.

ApproachesLink Analysis Step

ProductRepresentativeness

Matrix

ConsumerRepresentativeness

Matrix

Page 16: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Initialization

From Z. Huang, et al., "A Link analysis approach torecommendation under sparse data," 2004.

ApproachesLink Analysis Step

ConsumerRepresentativeness

Matrix

ProductRepresentativeness

Matrix

Page 17: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Update Phase

From Z. Huang, et al., "A Link analysis approach torecommendation under sparse data," 2004.

ApproachesLink Analysis Step

Update Phase

ConsumerRepresentativeness

Matrix

ProductRepresentativeness

Matrix

Page 18: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Recap

Data

General Approach

Step 1: Link-analysis

Step 2: Content-based Cascade

Step 2: Genetic tuning

Page 19: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Product Representativeness Result

ApproachesContent-based Cascade

ProductRepresentativeness

Matrix

Game1

Game2

Game3

UserA

x x x

UserB

PR21 PR22 PR23

UserC

x x x

PRi

Page 20: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Additional Data

ApproachesContent-based Cascade

Property Description

Subdomain (S) General type of game (Strategy,Family, Party)

Category (C) Genre/specific type of game(Civilization, Territory Building)

Playing Time (P) Publisher provided, in minutes

Mechanic (M) Game mechanics used (Dice Rolling,Variable Powers)

Suggested best Number of players (N)

User voted best number of players toplay the game

Page 21: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Similarity Measures

ApproachesContent-based Cascade

Property Similarity

Subdomain (S) Cosine

Category (C) Cosine

Playing Time (P) Error

Mechanic (M) Cosine

Suggested best Number of players (N)

Error

These will need to be normalized on the same scale (0.00 - 1.00)

Page 22: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Product Similarity Matrix

ApproachesContent-based Cascade

S C P M N

Game 1 .12 .2 .6 .1 .5

Page 23: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Refining the Product Ranking

ApproachesContent-based Cascade

• Create PRfinal by refining PR:

• W is a vector of weights which determine how much a givenproperty should effect the original score

Page 24: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Recap

Data

General Approach

Step 1: Link-analysis

Step 2: Content-based Cascade

Step 2: Genetic tuning

Page 25: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Determining an Optimal W

ApproachesGenetic Tuning

• W needs to be defined optimally for this given domain

• A genetic algorithm will be used to tune W

• Chromosome = sequential binary representation of W

• Fitness based on Rank Score (from Huang et al.)

• 8 bits per weight, ranging from 0 - .25 to start

• Rates of crossover/mutation TBD

Page 26: A Hybrid Recommender System Using Link Analysis and Genetic Tuning in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada,

Conclusion / Questions