modeling the interest-forgetting curve for music recommendation · 2020-05-01 · forgetting curve...

1
Music recommendation plays a key role in our daily lives as well as in the multimedia industry. This paper adapts the memory forgetting curve to model the human interest- forgetting curve for music recommendations based on the observation of recency effects in people’s listening to music. Two music recommendation methods are proposed using this model with respect to the sequence-based and the IFC-based transition probabilities, respectively. We also bring forward a learning method to approximate the global optimal or personalized interest-forgetting speed(s). The experimental results show that our methods can significantly improve the accuracy in music recommendations. Meanwhile, the IFC-based method outperforms the sequence-based method when recommendation list is short at each time. Abstract Given : Song set = { 1 ,…, || }, User set = { 1 ,…, || }, Row-normalized song transition probability matrix ( , )≥0, Playlist set of user , = { 1 ,…, } (Kleene closure of ), Current playlist (not expired) of user , . Recommend Top-K songs by ranking: = argmax Pr( | , , ) Pr , , = Pr | ( , ) Candidate set : = { |∃ , >0∧ } Neighborhood-normalized transition probability: , = , , Music Recommendation = max ∈ℝ ≥0 1 | | Pr \{ , , ) = max ∈ℝ ≥0 1 | | ∈\{ } Pr \{ ) ( , ) Maximize to obtain the optimized global , or personalized for each individual user . Learning Interest Forgetting Speed Memory retention as time elapses: = , , strength of memory; , elapsed time. Interest retention as time step increases: = , , (personalized) interest forgetting speed. Interest-Forgetting Curve Prior Probabilities: Pr = −( ( )) −( ( )) [email protected], [email protected], [email protected] Jun CHEN, Chaokun WANG, Jianmin WANG Modeling the Interest-Forgetting Curve for Music Recommendation School of Software, Tsinghua University, Beijing 100084, P.R. China Fig.2 Users’s interest on a certain item loses as time elapses. Transition Probabilities (1) Sequence-based: , = { , } { } , { , } is a subsequence of playlist . is the indicator function, which returns 1 when is satisfied, othewise, returns 0. (2) IFC-based: , = −( −( )) { , } −( −( )) { , } , ( ) represents the position index of in . Modeling IFC in Recommendation Fig.1 Recommendation illustration. Music A, F, B, G are in the current playlist. Music C, E, H are candidates due to reachability. Music J and D are out of consideration for recommendation at this time. Evaluations Fig.3 Accuracy comparison with baselines. Neighborhood method (NH), Sequential Pattern method with different supports and window sizes (SP-*). Fig.4 Comparison on the selection of transition probability expression. Fig.5 Distribution of playlist length in the training set and the test set. Fig.6 Accuracy under different groups of playlist length.

Upload: others

Post on 28-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Modeling the Interest-Forgetting Curve for Music Recommendation · 2020-05-01 · forgetting curve for music recommendations based on the observation of recency effects in people’s

Music recommendation plays a key role in our

daily lives as well as in the multimedia

industry. This paper adapts the memory

forgetting curve to model the human interest-

forgetting curve for music recommendations

based on the observation of recency effects in

people’s listening to music. Two music

recommendation methods are proposed using

this model with respect to the sequence-based

and the IFC-based transition probabilities,

respectively. We also bring forward a learning

method to approximate the global optimal or

personalized interest-forgetting speed(s). The

experimental results show that our methods

can significantly improve the accuracy in music

recommendations. Meanwhile, the IFC-based

method outperforms the sequence-based

method when recommendation list is short at

each time.

Abstract

Given: Song set 𝒮 = {𝑠1, … , 𝑠|𝒮|},

User set 𝒰 = {𝑢1, … , 𝑢|𝒰|},

Row-normalized song transition probability

matrix 𝑀(𝑠𝑖 , 𝑠𝑗) ≥ 0,

Playlist set of user 𝑢, 𝒫𝑢 = {𝑝1𝑢, … , 𝑝𝑙𝑢

𝑢 }

𝑝𝑖𝑢 ∈ 𝒮∗ (Kleene closure of 𝒮),

Current playlist (not expired) of user 𝑢, 𝑝𝑢.

Recommend Top-K songs by ranking:

𝒮𝑟𝑒𝑐 = argmax𝑠𝑖∈𝒮Pr(𝑠𝑖| 𝑝

𝑢,𝒫𝑢, 𝑀)

Pr 𝑠𝑖 𝑝𝑢,𝒫𝑢, 𝑀 =

𝑠𝑗∈ 𝑝𝑢

Pr 𝑠𝑗| 𝑝𝑢 𝑀(𝑠𝑗 , 𝑠𝑖)

Candidate set:

𝒞 𝑝𝑢 = {𝑠𝑥|∃𝑠𝑗 ∈ 𝑝𝑢 ∧ 𝑀 𝑠𝑗 , 𝑠𝑖 > 0 ∧ 𝑠𝑥 ∉ 𝑝

𝑢}

Neighborhood-normalized transition probability:

𝑀𝒞 𝑝𝑢𝑠𝑗 , 𝑠𝑖 =

𝑀 𝑠𝑗 , 𝑠𝑖

𝑠𝑥∈𝒞 𝑝𝑢𝑀 𝑠𝑗 , 𝑠𝑥

Music Recommendation

𝐺 𝛼 = max𝛼∈ℝ≥0

1

|𝒫𝑡𝑟𝑎𝑖𝑛|

𝑝∈𝒫𝑡𝑟𝑎𝑖𝑛

Pr 𝑠𝑝 𝑝\{ 𝑠𝑝 , 𝒫𝑡𝑟𝑎𝑖𝑛, 𝑀𝒞𝑝)

= max𝛼∈ℝ≥0

1

|𝒫𝑡𝑟𝑎𝑖𝑛|

𝑝∈𝒫𝑡𝑟𝑎𝑖𝑛

𝑠𝑗∈𝑝\{̂̂̂̂̂̂̂̂̂ 𝑠𝑝}

Pr 𝑠𝑗 𝑝\{ 𝑠𝑝 )𝑀𝒞𝑝(𝑠𝑗 ,

𝑠𝑝)

Maximize 𝐺 𝛼 to obtain the optimized global 𝛼, or

personalized 𝛼𝑢 for each individual user 𝑢.

Learning Interest Forgetting Speed

Memory retention as time elapses:

𝑅 = 𝑒−𝑡

𝑆,

𝑆, strength of memory; 𝑡, elapsed time.

Interest retention as time step increases:

𝑅 = 𝑒−𝛼𝑡,

𝛼, (personalized) interest forgetting speed.

Interest-Forgetting Curve

Prior Probabilities:

Pr 𝑠𝑗 𝑝𝑢 =

𝑒−𝛼( 𝑝𝑢 − 𝑝𝑢(𝑠𝑗))

𝑠𝑥∈ 𝑝𝑢 𝑒−𝛼( 𝑝𝑢 − 𝑝𝑢(𝑠𝑥))

[email protected], [email protected], [email protected]

Jun CHEN, Chaokun WANG, Jianmin WANG

Modeling the Interest-Forgetting Curve for Music Recommendation

School of Software, Tsinghua University, Beijing 100084, P.R. China

Fig.2 Users’s interest on a certain item loses as time elapses.

Transition Probabilities

(1) Sequence-based:

𝑀𝑆𝐸𝑄 𝑠𝑖 , 𝑠𝑗 = 𝑢∈𝒰 𝑝∈𝒫𝑢 𝕀{𝑠𝑖,𝑠𝑗}𝑠𝑒𝑞⊆𝑝

𝑢∈𝒰 𝑝∈𝒫𝑢 𝕀{𝑠𝑖}𝑠𝑒𝑞⊆𝑝,

{𝑠𝑖 , 𝑠𝑗}𝑠𝑒𝑞 is a subsequence of playlist 𝑝. 𝕀𝑐𝑜𝑛𝑑 is the

indicator function, which returns 1 when 𝑐𝑜𝑛𝑑 is

satisfied, othewise, returns 0.

(2) IFC-based:

𝑀𝐼𝐹𝐶 𝑠𝑖 , 𝑠𝑗 = 𝑢∈𝒰 𝑝∈𝒫𝑢 𝑒

−𝛼(𝑝 𝑠𝑗 −𝑝(𝑠𝑖))𝕀{𝑠𝑖,𝑠𝑗}𝑠𝑒𝑞⊆𝑝

𝑠𝑥 𝑢∈𝒰 𝑝∈𝒫𝑢 𝑒−𝛼(𝑝 𝑠𝑥 −𝑝(𝑠𝑖))𝕀{𝑠𝑖,𝑠𝑥}𝑠𝑒𝑞⊆𝑝

,

𝑝(𝑠𝑖) represents the position index of 𝑠𝑖 in 𝑝.

Modeling IFC in Recommendation

Fig.1 Recommendation illustration. Music A, F, B, G are in the

current playlist. Music C, E, H are candidates due to reachability.

Music J and D are out of consideration for recommendation at

this time.

Evaluations

Fig.3 Accuracy comparison with

baselines. Neighborhood method

(NH), Sequential Pattern

method with different supports

and window sizes (SP-*).

Fig.4 Comparison on the

selection of transition

probability expression.

Fig.5 Distribution of playlist

length in the training set and

the test set.

Fig.6 Accuracy under different

groups of playlist length.