cstalks-quaternary semantics recomandation system-24 aug

44
1 A Unified Framework for Recommendations Based on Quaternary Semantic Analysis Wei Chen *, Wynne Hsu*, Mong Li Lee* *School of Computing, National University of Singapore

Upload: cstalks

Post on 09-May-2015

407 views

Category:

Education


0 download

TRANSCRIPT

Page 1: CSTalks-Quaternary Semantics Recomandation System-24 Aug

1

A Unified Framework for

Recommendations Based on

Quaternary Semantic Analysis

Wei Chen*, Wynne Hsu*, Mong Li Lee*

*School of Computing, National University of Singapore

Page 2: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Introduction

The amount of information on the web is increasing

at a lightning pace. E.g products in Amazon, videos

in Youtube, movies in Netflix

Recommendation is necessary.

Page 3: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Introduction

Recommendation systems are typically classified

according to four types :

User recommendation

Item recommendation

Tag recommendation

Item rating prediction

Page 4: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Related Work

Most of the work in recommendation systems utilize only ternary relationships in generating recommendations.

The collaborative filtering-based recommendation systems use <user ,rating, items >

[B. Sarwar,WWW’01,SIGIR’09]

Tag-based recommendation systems utilize the <users, tags, items >.

Page 5: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Motivation

We argue that recommendations based on ternary

relationships are not accurate as they would have

missed out important associations

Page 6: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Motivation Example

Page 7: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Motivation Example

Beautiful Mind and Groundhog day will be recommended to U3

Page 8: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Motivation example

Page 9: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Motivation example

Groundhog Day and Toy story will be recommended to

U3

Page 10: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Motivation example

Page 11: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Motivation example

Groundhog day is recommended to U3

Page 12: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Motivation

The need of quaternary relationship is necessary. This is reinforced by the following observations:

Users may use the same tag for an item but have different ratings for it.

Items may have multiple tags indicating their different facets.

Some tags may carry implicit semantics that can reveal the users’ preferences.

Page 13: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Overview of the paper

We propose a model: using tensor to model the

quaternary relationship.

Higher-Order Singular Value Decomposition

(HOSVD) is applied in the 4-order tensor to reveal

the latent semantic associations among users,

items, tags and ratings.

Page 14: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND - Tensor

A tensor is a multidimensional array. An N-order

tensor is denoted as

Page 15: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND – Tensor unfolding

The matrix unfolding of an N-order tensor

along the dimension i are vectors

obtained by keeping the index fixed while varying

the other indices.

Page 16: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND – n-mode product

Page 17: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND – HOSVD

HOSVD is a generalization of Singular Value

Decomposition (SVD) to higher-order tensors and

can be written as n-mode product

Where U(n) contain the orthonormal vectors (n-

mode singular vector) spanning the column space

of the A (n) , is the core tensor

Page 18: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND – HOSVD

Page 19: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND – HOSVD

With this, the core tensor can be

constructed as described in [L. D.,SIAM 2000], that is

and we can get:

Page 20: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND- Rank, Low Rank

Approximation

Page 21: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND

Suppose we want to get the RANK-(2,3,3)

approximation, we first retain the first ci column of

matrix U(i) at mode i as follows:

Page 22: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND –Tensor

Approximation

We can now construct the approximate core tensor

using

Page 23: CSTalks-Quaternary Semantics Recomandation System-24 Aug

BACKGROUND

Finally, we obtain the RANK-(2,3,3) approximation

Page 24: CSTalks-Quaternary Semantics Recomandation System-24 Aug

QUATENARY SEMANTIC

ANALYSIS

The main idea is to capture the underlying

relationships among users-tags-items-ratings by

reducing the rank of the original tensor to minimize

the effect of noise on the underlying population

and reduce spareness.

Page 25: CSTalks-Quaternary Semantics Recomandation System-24 Aug

QUATENARY SEMANTIC

ANALYSIS - Initialization

Input: list of quadruples < users, tags, rating, items>;

Page 26: CSTalks-Quaternary Semantics Recomandation System-24 Aug

QUATENARY SEMANTIC

ANALYSIS - Initialization

constructed tensor

where |U|, |T|, |R| and |V| is the number of user, tags , ratings

and items respectively

Page 27: CSTalks-Quaternary Semantics Recomandation System-24 Aug

QUATENARY SEMANTIC

ANALYSIS

Calculate the matrix unfolding A(1) , A(2) , A(3) and

A(4) from Tensor

Perform SVD on each matrix unfolding and get the

left singular matrix U(1) , U(2) , U(3) and U(4)

Page 28: CSTalks-Quaternary Semantics Recomandation System-24 Aug

QUATENARY SEMANTIC

ANALYSIS Remove the least significant rows |U|-c1; |V |-c2; |T|-c3

and |R|-c4 from U(1);U(2);U(3); and U(4), respectively. We

choose c1= 4; c2 =4; c3 = 4; c4 = 2.

Page 29: CSTalks-Quaternary Semantics Recomandation System-24 Aug

QUATENARY SEMANTIC

ANALYSIS

Calculate the approximate core tensor

Approximate the original tensor by:

Page 30: CSTalks-Quaternary Semantics Recomandation System-24 Aug

QUATENARY SEMANTIC

ANALYSIS

Page 31: CSTalks-Quaternary Semantics Recomandation System-24 Aug

QUATENARY SEMANTIC

ANALYSIS

Latent associations such as the newly added

quadruples in Table 6 may not be found if the

tensor data is sparse

We overcome this problem by applying a

smoothing technique to the tensor in Algorithm.

Page 32: CSTalks-Quaternary Semantics Recomandation System-24 Aug

RECOMMENDATION

GENERATION

Page 33: CSTalks-Quaternary Semantics Recomandation System-24 Aug

RECOMMENDATION

GENERATION

Page 34: CSTalks-Quaternary Semantics Recomandation System-24 Aug

RECOMMENDATION

GENERATION

Page 35: CSTalks-Quaternary Semantics Recomandation System-24 Aug

RECOMMENDATION

GENERATION

Page 36: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Experimental result – dataset

description

Datasets: Movielens Data

The first file contains users’ tags on different movies. The second file contains users’ ratings on different movies on a scale of 1 to 5.

By joining these two files over user and movie, we obtain the quadruples < user; movie; tag; rating >.

After preprocessing, the dataset has 11122 tuples with 201 users, 501 movies, and 404 tags.

Page 37: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Experimental result – Item

Recommendation

Compare method:

UPCC: User based recommendation

IPCC: Item based recommendation

Probabilistic Matrix Factorization (PMF)

Page 38: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Experimental result – Item

recommendation

Page 39: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Experimental result – Rating

Prediction

Page 40: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Experimental result – Tag

Recommendation

Compare method:

TSA [TKDE10]: Ternary Semantic Analysis

RTF [KDD.09]: Optimal ranking using tensor

factorization.

Page 41: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Experimental result – Tag

Recommendation

Page 42: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Experimental result – User

recommendation

Page 43: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Conclusion

We have shown that quaternary semantic analysis

can lead to more accurate recommendation.

We have proposed using a 4-order tensor to model

the four heterogeneous entities: users, items, tags

and ratings.

A unified framework is proposed that utilize

quaternary relation for user recommendation, item

recommendation, tag recommendation and rating

prediction.

Page 44: CSTalks-Quaternary Semantics Recomandation System-24 Aug

Thank you very much!

Q/A

44