phd defense

Motivation Problems Tag Recommender Systems NN Tag Recommendation Cross-Tagging Tag Enrichment Conclusions

Recommender Systems for Social Tagging Systems

Leandro Balby Marinho

Machine Learning LabUniversity of Hildesheim

PhD Defense

Leandro Balby Marinho 1 / 32 Machine Learning Lab, University of Hildesheim


Outline

1. Motivation

2. Problems and Contributions

3. Tag Recommender Systems

4. Nearest Neighbor-based Tag Recommendation

5. Cross-Tagging

6. Tag Enrichment

7. Conclusions and Future Work



I Web 2.0 sites more used than e-mail! [Nielsen Online (2009)]

I In Web 2.0, the user plays the main role!



I Tags help users to organize and retrieve content.



I Tags also help other users to organize and retrieve their content.



Folksonomy

I A folksonomy is a structure F := (U,R,T ,Y )

I U ... users

I R ... resources

I T ... tags

I Y ⊆ U × R × T ... tag assignments

I X := {(u, r) | ∃t ∈ T : (u, r , t) ∈ Y } ... set of posts



Outline

1. Motivation




5. Cross-Tagging

6. Tag Enrichment




Problems and Contributions

I Tag Sparsity: Users are lazy to tag!

I 1− |Y ||U|×|R|×|T | ≈ 0.99 in all datasets used!

I Solution: Tag Recommendation

I Social Network Divide: Compatible social systems are disconnected.

I Tag Idiosyncrasy: Tags bearing unclear semantics.

I Solution: Tag Enrichment.



Outline

1. Motivation




5. Cross-Tagging

6. Tag Enrichment




Tag Recommender Systems

I ...change the process from creation to recognition!

I Personalized methods take the user preferences for tags intoconsideration.

I Value for the industry, e.g., youtube, flickr, last.fm, amazon.



Evaluation and Metric

I Xtrain∪Xtest = X ... train/test splits based on posts

I For each user, randomly pick one post for test.

I Task: For (u, r) ∈ Xtest compute T (u, r)

I Metric: Recall((u, r) ∈ Xtest, n) := |T (u,r)∩T (u,r)||T (u,r)|



Formalization

I Given (u, r) ∈ Xtest, a tag recommender system first computes:

Utility : {u} × {r} × T → R (1)

I And then presents the tags in descending order of their utility:

T (u, r) :=n

argmaxt∈T

Utility(u, r , t) (2)



Outline

1. Motivation




5. Cross-Tagging

6. Tag Enrichment




Nearest Neighbor-based (NN) Tag Recommenders

I Collaborative Filtering (CF): Similar users tend to like similar things.

I Here: Similar users tend to tag alike.

I Traditional CF cannot be directly applied to folksonomies unless:

resources

tagsresources

user

s

user

s

user

s

tags

Y

πUTYπURY



Collaborative Filtering for Tag Recommendation

I Neighborhood Formation: Nku :=

kargmaxv∈Ur\{u}

sim(~mu, ~mv )

I Recommendation:

T (u, r) :=n

argmaxt∈T

∑v∈Nk

u

sim(~mu, ~mv )δ(v , r , t)

where δ(v , r , t) := 1 if (v , r , t) ∈ Y and 0 else.



Ensembles of CF

I Projections’ Ensemble:

I Similarities’ Ensemble:

T (u, r) =n

argmaxt∈T

∑v∈Nu

(λsim(~mu, ~mv ) + (1− λ)sim(~zu,~zv ))δ(v , r , t)

where ~mu and ~mv are rows of πUTY , and ~zu and ~zv rows of πURY .



A Graph-Based Tag Recommender based on Posts

We represent X as a homogeneous, undirected graph G := (X ,E ) overthe post set. Posts are related to each other if they share the same user:

Ruser := {(x , x ′) ∈ X × X | user(x) = user(x ′)}

the same resource:

Rres := {(x , x ′) ∈ X × X |res(x) = res(x ′)}

or either share the same user or resource:

Rresuser := Ruser ∪Rres

where user(x) and res(x) are the user and resource associated with the

post x respectively.



Relational Graph based on Posts



Weighting Schemes

For x ∈ Xtest and (x , x ′) ∈ E :

1. User-Tag Profile:

φuser-tag := (|Y ∩ ({user(x)} × R × {t})|)t∈T

2. Resource-Tag Profile:

φres-tag := (|Y ∩ (U × {res(x)} × {t})|)t∈T

Weight:

w(x , x ′) :=〈φ(x), φ(x ′)〉‖φ(x)‖‖φ(x ′)‖



Relational Classification

Weighted Average (WA) [Marinho et al. (2009)]:

P(t|x) :=

∑x′∈Nx |t∈T (x′) w(x , x ′)∑

x′∈Nxw(x , x ′)

where:

Nx := {x ′ ∈ X | (x , x ′) ∈ R, T (x) 6= ∅}

Runtime: O (|T ||Nx |))



Evaluation

Datasets:

dataset |U| |R| |T | Triples |Y | Posts |X |BibSonomy 116 361 412 10,148 2,522Last.fm 2,917 1,853 2,045 219,702 75,565Delicious 37,399 74,874 22,170 7,487,319 3,055,436

Evaluated methods:

I Baselines: (Locally) Constant Models (GCT,LCR, LCU).

I Ensemble of Locally Constant Models (LCE) [Jaschke et al. 2008].

I TopicRank, FolkRank [Jaschke et al. 2007]

I RTF [Rendle et al. 2009]

I PITF [Rendle et al. 2010]

I Our NN-based Recommenders



Results: NN Methods

0

0.2

0.4

0.6

0.8

1

0 2 4 6 8 10

Rec

all

Number of recommended tags

Top-10 Tag Recommendations in Delicious

WACF UTCF UR

matrixExtsimEns

LCRGCT



Results: WA vs. State-of-the-Art

0

0.2

0.4

0.6

0.8

1

0 2 4 6 8 10

Rec

all


Top-10 Tag Recommendations in BibSonomy

WARTFPITF

FolkRankLCE

TopicRank




0

0.2

0.4

0.6

0.8

1

0 2 4 6 8 10

Rec

all


Top-10 Tag Recommendations in Last.fm

PITFWA

RTFFolkRank

LCETopicRank




0

0.2

0.4

0.6

0.8

1

0 2 4 6 8 10

Rec

all


Top-10 Tag Recommendations in Delicious

PITFWA

FolkRankLCE

TopicRank



Runtime: WA vs. PITF

BibSonomy Last.fm DeliciousMethod Runtime Runtime Runtime

WA < 1 second < 1 minute ≈ 3 minutes

PITF ≈ 5 minutes ≈ 7 hours ≈ 33 days



ECML/Discovery Challenge 2009

2nd Place ECML/PKDD Discovery Challenge 2009!

Rank Method Top-5 F1

1 PITF [Rendle et al. (2009)] 0.355942 Relational Ensemble [Marinho et al. (2009)]1 0.33185– WA (not submitted) 0.325193 Content-based [Lipczak et al. (2009)] 0.32461

1With Christine PreisachLeandro Balby Marinho 23 / 32 Machine Learning Lab, University of Hildesheim


Outline

1. Motivation




5. Cross-Tagging

6. Tag Enrichment




Problem

Use resources overlap to cross tags between systems.



Tag Recommendation for Cross-Tagging

I Cross-Tagging Approaches:

I LCR (locally constant per resource).I Collaborative Filtering.



Evaluation

Tag-Aware-based Evaluation

I The better the tags the better a tag-aware recommender that usesthose tags.

I Tag-Aware based on HOSVD [Symeonidis et al. (2008)]

Datasets

Blogger.com Last.fm Annotated Blog

|U| 6,620 44,143 3,827|R| 17,372 17,372 1,323|T | 0 4,903 422|Y | 0 254,388 32,900



Recall on the top-5 resources of HOSVD

I n - Number of tags used to annotate the test posts of Blogger.com.



Outline

1. Motivation




5. Cross-Tagging

6. Tag Enrichment




Problems

I First we map tags from a folksonomy to concepts C of an ontology

H : T → C

I Then we learn an ontology P such that:

CP := T ∪ C

I The better the ontology the better a ontology-aware recommenderthat uses this ontology.

I Taxonomy driven CF [Ziegler et al. (2004)]

Datasets:

dataset |U| |T | |R| |Y |Last.fm 3,532 7,081 982 130,899musicmoz - 555 982 -



Results

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

Trivial Ontology Domain Expert Ontology Learned Ontology

Rec

all



Outline

1. Motivation




5. Cross-Tagging

6. Tag Enrichment




Conclusions

I Tag Sparsity: Nearest Neighbor Method that

I Performs competitively to more sophisticated methods.I Require modest computational effort.

I Social Network Divide:

I Cross-tagging as a tag recommendation problem.I Personalized cross-tagging better than non-personalized

cross-tagging.

I Tag idiosyncrasy: Tag enrichment

I Well agreed concepts that match the semantic intention ofusers.

I Learned ontology better than trivial or domain expert ontology.

I New recommender systems-based evaluation protocols.



Future Work

I Optimzed weight learning for WA.

I Bidirectional Cross-Tagging.

I Optimized Cross-Tagging/Ontology learning.



Results NN vs. Baselines

0

0.2

0.4

0.6

0.8

1

0 2 4 6 8 10

Rec

all


Top-10 Tag Recommendations in BibSonomy

WACF UTCF UR

matrixExtsimEns

LCRGCT

0

0.2

0.4

0.6

0.8

1

0 2 4 6 8 10R

ecal

l


Top-10 Tag Recommendations in Last.fm

WACF UTCF UR

matrixExtsimEns

LCRGCT



PageRank for Folksonomias

I Based on PageRank [Hotho et al. 2006]

I Each hyperedge is broken into three undirected edges:

I Now PageRank can be applied:

~wt+1 ← λAT~wt + (1− λ)~p

I Rank will be dominated by popular nodes (Skewd distribution of tagassignments)



FolkRank

1. First compute vector ~w (0) with ~p = 1.

2. Next compute vector ~w (1) with ~p[u] := 1 + |U|, ~p[r ] := 1 + |R|, and~p[v ] := 1 for v 6= u, r .

3. Finally compute ~w := ~w (1) − ~w (0).

4. Recommendation list T (u, r) is the top-n nodes in the rankrestricted to tags.



RTF: Ranking with Tensor Factorization

I Tag Recommendation as a tensor completion problem.

I Positive tags have higher rank than negative ones [Rendle et al. 2009].

yu,r,t1 > yu,r,t2 ⇔ (u, r , t1) ∈ T+u,r ∧ (u, r , t2) ∈ T−u,r

T+u,r := {t | (u, r) ∈ Xtreino ∧ (u, r , t) ∈ Y }, T−u,r := {t | (u, r) ∈ Xtreino ∧ (u, r , t) 6∈ Y }



Tucker Decomposition Model

Y := C ×u U ×r R ×t T

or equivalently:

yu,r ,t =∑

u

∑r

∑t

cu,r ,t · uu,u · rr ,r · tt,t

where the model parameters are:

C ∈ RkU×kR×kT , U ∈ R|U|×kU , R ∈ R|R|×kR , T ∈ R|T |×kT



PITF: Pairwise Interaction Tensor Factorization

PITF only models the two-way interactions between user and tags as wellas between resources and tags:

au,r ,t =k∑f

uu,f · tUt,f +

k∑f

rr ,f · tRt,f

where U ∈ R|U|×k , R ∈ R|R|×k , TU ∈ R|T |×k and TR ∈ R|T |×k



Complexity

Learning Runtime Complexity

Method RuntimeWA O(1)FolkRank O(1)RTF O

(iter · |Xtrain||T |2 · kU · kR · kT

)PITF O(iter · |Xtrain||T |2 · 2k)

Prediction Runtime Complexity

Method RuntimeWA O (|T ||Nx |+ |T | log(n)))Folkrank O(iter · (|Y |+ |U|+ |R|+ |T |) + |T |+ |T | log(n))RTF O(|T | · kU + kR · kT · kT )PITF O(|T |2k + |T | log(n))



Relation Rewarding

We can reward the best relation by a factor c ∈ R



Results Cross-Tagging



Tag Enrichment Approach

I Semantic mapping as an ontology matching problem.

I P(A,B) ≈ |JAK∩JBK||R| [Doan et al. (2004)]

I Jaccard coefficient:

JS(A,B) := P(A ∩ B)/P(A ∪ B) :=P(A,B)

P(A,B) + P(A, B) + P(A,B)



Ontology learning

I Frequent itemset mining for ontology learning [Marinho et al. 2008]2.

2Algorithm proposed by Krisztian Buza co-author of [Marinho et al. 2008]Leandro Balby Marinho 32 / 32 Machine Learning Lab, University of Hildesheim


Semantic mapping

tags mapped concepts

electro electronica

hip hop hip hop

chillout rock

old skool dance house

anything else but death heavy metal

post-hardcore emo

california punk

political punk

urban hip hop

60s stuff country-rock

relaxing folk rock

explorer experimental rock

rock en espanol latin pop



An Extract of Domain Expert Ontology

heavy_metal

death_metal

doom_metal

black_metal

thrash rap-metal

hair_metal

speed_metal

grindcore

metal

Pajek



An Extract of Learned Ontology

maynard james keenan

powerful

technical death metal

brit-rock

metalcore

nu metal

doom metal

new age

finnishprogressive metal

alternative metal

melodic death metal

melancholic

black metal

progressive

ethereal

swedish metal

gothenburg metal

progressive death metal

german

bands i have seen live

speed metal

nwobhm

heavy

power metal

symphonic metal

guitargasm

death-doom metal

gothic metal

famous frontman

art rock

viking metalgroove metal

melodic metal

violent

aggressive alternative - at work music

moody

faves

a-o-t-w

slipknot

grindcore

great lyrics

gothenburg

dark

g00ds

70s progressive rock

depressing

cold

doom

art-rock

prog

trash metal

depression

brutal death metal

usloud

sad

korn

soad

mezmerize

fall out boy

rap-metal

seen them live

nu-metal

cello metal

melodic black metal

folk metal

guitar music

symphonic prog

british metal

awesome

zeuhl

female fronted metal

love metal

aggressive

finland

epic

nellis1

symphonic black metal

new metal

ominous

buen metal

bands ive seen live

classic thrash

bands i have seen

prog metal

classic metal

prog rock metal gods

my band inspiration

metal of some persuasionfavorite shitnice music

grooving metal

fav artistsblizzards main tags

symphonic death

grind

melodic power metal

everything

speed

favs

melodic death

heavy_metal

death

progressive_rock

doom_metal

death_metal

thrash

metal

speed_metal

periods

Pajek


phd defense

Education

tag enrichment7

tag recommender systems4

contributionsi tag sparsity

tag assignmentsi x

compatible social systems

postsleandro balby marinho

tagsi y u r t

resourcesi t