time to shop for valentine’s day: shopping occasions and...

26
Time to Shop for Valentine’s Day: Shopping Occasions and Sequential Recommendation in E-commerce Jianling Wang, Raphael Louca*, Diane Hu*, Caitlin Cellier*, James Caverlee and Liangjie Hong* Texas A&M University * Etsy Inc.

Upload: others

Post on 18-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Time to Shop for Valentine’s Day: Shopping Occasions and Sequential

Recommendation in E-commerceJianling Wang, Raphael Louca*, Diane Hu*, Caitlin Cellier*,

James Caverlee and Liangjie Hong*Texas A&M University

* Etsy Inc.

Page 2: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Recommender Systems usually:

● Capture users' intrinsic preference from their long-term behaviorpatterns.

● Infer users’ current needs by emphasizing recent actions.

Page 3: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

However, intrinsic user behavior may be shifted by occasions, such asbirthdays, anniversaries, or gifting celebrations (Valentine's Day orMother's Day).

In E-commerce

Page 4: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

In E-commerce

Often, these occasion-based purchases deviate from long-termpreferences and are not related to recent actions.

Page 5: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Idea: Let’s incorporate occasion signals!● Recommend more time or season-aware candidates, which may

alleviate the cold-start problem.● Reduce the noise in modeling users' intrinsic preferences.● Recommend relevant items to the user for upcoming reoccurring

occasions.

Page 6: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Global Occasions● Happen at the same time for a large number of users.● Encourage or lead to similar shopping decisions for crowds of users.● Examples: Christmas, Valentine’s Day, …

Page 7: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Personal Occasions● Happen at different times for different users.● Occur in a periodic and repeated pattern for a specific user.● Examples: Birthdays (for you or your friends) and anniversaries.

Page 8: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

To Illustrate: Temporal Shopping TrendsUsers' shopping preferences are dynamic and can reflect reoccurring occasions (festivals, holidays, seasonal activities).

Valentine’s Day

Mother’s Day

Father’s Day

Thanksgiving Christmas

(a) (b)

Time-sensitive Most Popular V.S. General Most Popular

Page 9: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

To Illustrate: From Personal PerspectiveThere are occasions that may re-occur within a certain period (e.g. annually or monthly) over the course of multiple years and trigger relevant purchase.

⇒ ⇒ traceable patterns in personal occasion signals.

Time Gap between Purchases for Wedding and Anniversary within a year. More than 50% of purchases for anniversary are near the date of wedding purchase within a time window less than 30 days.

Page 10: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Occasion-Aware Recommendation (OAR)Intrinsic Preference +

Personal Occasion Signals +

Global Occasion Signals

Our Proposal:

Page 11: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Occasion-Aware Recommendation (OAR)● Model the repeated personal occasion signals with attention layers.● Model the global occasion signals by memorizing the temporal trends

of shopping behaviors.● With a gating component, we balance the global and local effects of

different occasions. ● Experiments on Etsy and Amazon.

Page 12: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Attention Mechanisms● Query

● Key-Value Pairs

● Scoring Function - Calculating similarity/relationship between a Query & a Key

Keys ValuesQueries… ……

Attention Weights Outputs

q<latexit sha1_base64="BWkoXlOkYejmq6A3DcUbfxqunms=">AAAB8XicbVDLTgJBEOzFF+IL9ehlIjHxRHbRRI9ELx4xkUcEQmaHWZgwO7vO9BrJhr/w4kFjvPo33vwbB9iDgpV0UqnqTneXH0th0HW/ndzK6tr6Rn6zsLW9s7tX3D9omCjRjNdZJCPd8qnhUiheR4GSt2LNaehL3vRH11O/+ci1EZG6w3HMuyEdKBEIRtFK9x3kT+gH6cOkVyy5ZXcGsky8jJQgQ61X/Or0I5aEXCGT1Ji258bYTalGwSSfFDqJ4TFlIzrgbUsVDbnpprOLJ+TEKn0SRNqWQjJTf0+kNDRmHPq2M6Q4NIveVPzPaycYXHZToeIEuWLzRUEiCUZk+j7pC80ZyrEllGlhbyVsSDVlaEMq2BC8xZeXSaNS9s7KldvzUvUqiyMPR3AMp+DBBVThBmpQBwYKnuEV3hzjvDjvzse8NedkM4fwB87nDx2HkTQ=</latexit>

o<latexit sha1_base64="MdMM0CCP6gkS6I9FmEsJNlSQy9E=">AAAB8XicbVBNS8NAEJ34WetX1aOXxSJ4KkkV9Fj04rGC/cA2lM120y7dbMLuRCyh/8KLB0W8+m+8+W/ctjlo64OBx3szzMwLEikMuu63s7K6tr6xWdgqbu/s7u2XDg6bJk414w0Wy1i3A2q4FIo3UKDk7URzGgWSt4LRzdRvPXJtRKzucZxwP6IDJULBKFrpoYv8CYMwiye9UtmtuDOQZeLlpAw56r3SV7cfszTiCpmkxnQ8N0E/oxoFk3xS7KaGJ5SN6IB3LFU04sbPZhdPyKlV+iSMtS2FZKb+nshoZMw4CmxnRHFoFr2p+J/XSTG88jOhkhS5YvNFYSoJxmT6PukLzRnKsSWUaWFvJWxINWVoQyraELzFl5dJs1rxzivVu4ty7TqPowDHcAJn4MEl1OAW6tAABgqe4RXeHOO8OO/Ox7x1xclnjuAPnM8fGn2RMg==</latexit>

k1<latexit sha1_base64="vpeHugvKEkgkR/0j0altVfMTXNo=">AAAB83icbVBNS8NAEJ3Ur1q/qh69LBbBU0mqoMeiF48V7Ac0pWy2m3bpZhN2J2IJ/RtePCji1T/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVjDdZLGPdCajhUijeRIGSdxLNaRRI3g7GtzO//ci1EbF6wEnCexEdKhEKRtFKvo/8CYMwG0/7Xr9ccavuHGSVeDmpQI5Gv/zlD2KWRlwhk9SYrucm2MuoRsEkn5b81PCEsjEd8q6likbc9LL5zVNyZpUBCWNtSyGZq78nMhoZM4kC2xlRHJllbyb+53VTDK97mVBJilyxxaIwlQRjMguADITmDOXEEsq0sLcSNqKaMrQxlWwI3vLLq6RVq3oX1dr9ZaV+k8dRhBM4hXPw4ArqcAcNaAKDBJ7hFd6c1Hlx3p2PRWvByWeO4Q+czx9BnZHS</latexit>

k2<latexit sha1_base64="MYieos1SHUd3AQE445MyTQ0XbWo=">AAAB83icbVBNS8NAEN3Ur1q/qh69LBbBU0mqoMeiF48V7Ac0pWy2k3bpZhN2J2IJ/RtePCji1T/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVHJo8lrHuBMyAFAqaKFBCJ9HAokBCOxjfzvz2I2gjYvWAkwR6ERsqEQrO0Eq+j/CEQZiNp/1av1xxq+4cdJV4OamQHI1++csfxDyNQCGXzJiu5ybYy5hGwSVMS35qIGF8zIbQtVSxCEwvm988pWdWGdAw1rYU0rn6eyJjkTGTKLCdEcORWfZm4n9eN8XwupcJlaQIii8WhamkGNNZAHQgNHCUE0sY18LeSvmIacbRxlSyIXjLL6+SVq3qXVRr95eV+k0eR5GckFNyTjxyRerkjjRIk3CSkGfySt6c1Hlx3p2PRWvByWeOyR84nz9DIZHT</latexit>

k3<latexit sha1_base64="jAR3sPEfMoURVaK7hqHc6/mT8KA=">AAAB83icbVBNS8NAEN3Ur1q/qh69LBbBU0laQY9FLx4r2FZoStlsJ+3SzSbsTsQS+je8eFDEq3/Gm//GbZuDtj4YeLw3w8y8IJHCoOt+O4W19Y3NreJ2aWd3b/+gfHjUNnGqObR4LGP9EDADUihooUAJD4kGFgUSOsH4ZuZ3HkEbEat7nCTQi9hQiVBwhlbyfYQnDMJsPO3X++WKW3XnoKvEy0mF5Gj2y1/+IOZpBAq5ZMZ0PTfBXsY0Ci5hWvJTAwnjYzaErqWKRWB62fzmKT2zyoCGsbalkM7V3xMZi4yZRIHtjBiOzLI3E//zuimGV71MqCRFUHyxKEwlxZjOAqADoYGjnFjCuBb2VspHTDOONqaSDcFbfnmVtGtVr16t3V1UGtd5HEVyQk7JOfHIJWmQW9IkLcJJQp7JK3lzUufFeXc+Fq0FJ585Jn/gfP4ARKWR1A==</latexit>

kL<latexit sha1_base64="XeR3m7IUyoUIUJ0yLNPrDRZNyDQ=">AAAB83icbVA9SwNBEJ3zM8avqKXNYhCswl0UtAzaWFhEMB+QC2Fvs5cs2ds7dufEcORv2FgoYuufsfPfuEmu0MQHA4/3ZpiZFyRSGHTdb2dldW19Y7OwVdze2d3bLx0cNk2casYbLJaxbgfUcCkUb6BAyduJ5jQKJG8Fo5up33rk2ohYPeA44d2IDpQIBaNoJd9H/oRBmI0mvbteqexW3BnIMvFyUoYc9V7py+/HLI24QiapMR3PTbCbUY2CST4p+qnhCWUjOuAdSxWNuOlms5sn5NQqfRLG2pZCMlN/T2Q0MmYcBbYzojg0i95U/M/rpBhedTOhkhS5YvNFYSoJxmQaAOkLzRnKsSWUaWFvJWxINWVoYyraELzFl5dJs1rxzivV+4ty7TqPowDHcAJn4MEl1OAW6tAABgk8wyu8Oanz4rw7H/PWFSefOYI/cD5/AGqJke0=</latexit>

vL<latexit sha1_base64="I0VC8gBrQq8yWNyo+yFpgT4wz1Y=">AAAB83icbVBNS8NAEJ3Ur1q/qh69BIvgqSRV0GPRiwcPFewHNKFstpt26WYTdifFEvo3vHhQxKt/xpv/xm2bg7Y+GHi8N8PMvCARXKPjfFuFtfWNza3idmlnd2//oHx41NJxqihr0ljEqhMQzQSXrIkcBeskipEoEKwdjG5nfnvMlOaxfMRJwvyIDCQPOSVoJM9D9oRBmI2nvfteueJUnTnsVeLmpAI5Gr3yl9ePaRoxiVQQrbuuk6CfEYWcCjYtealmCaEjMmBdQyWJmPaz+c1T+8wofTuMlSmJ9lz9PZGRSOtJFJjOiOBQL3sz8T+vm2J47WdcJikySReLwlTYGNuzAOw+V4yimBhCqOLmVpsOiSIUTUwlE4K7/PIqadWq7kW19nBZqd/kcRThBE7hHFy4gjrcQQOaQCGBZ3iFNyu1Xqx362PRWrDymWP4A+vzB3tWkfg=</latexit>

v1<latexit sha1_base64="LBrO2AG9JVSYbXOTEEaiIcFSFcM=">AAAB83icbVBNS8NAEN34WetX1aOXxSJ4KkkV9Fj04rGC/YCmlM120i7dbMLupFhC/4YXD4p49c9489+4bXPQ1gcDj/dmmJkXJFIYdN1vZ219Y3Nru7BT3N3bPzgsHR03TZxqDg0ey1i3A2ZACgUNFCihnWhgUSChFYzuZn5rDNqIWD3iJIFuxAZKhIIztJLvIzxhEGbjac/rlcpuxZ2DrhIvJ2WSo94rffn9mKcRKOSSGdPx3AS7GdMouIRp0U8NJIyP2AA6lioWgelm85un9NwqfRrG2pZCOld/T2QsMmYSBbYzYjg0y95M/M/rpBjedDOhkhRB8cWiMJUUYzoLgPaFBo5yYgnjWthbKR8yzTjamIo2BG/55VXSrFa8y0r14apcu83jKJBTckYuiEeuSY3ckzppEE4S8kxeyZuTOi/Ou/OxaF1z8pkT8gfO5w9SapHd</latexit>

v2<latexit sha1_base64="aZYsu15SPJ5E+sMT1cDcJ3BeBvA=">AAAB83icbVBNS8NAEJ34WetX1aOXxSJ4KkkV9Fj04rGC/YCmlM120y7dbMLupFhC/4YXD4p49c9489+4bXPQ1gcDj/dmmJkXJFIYdN1vZ219Y3Nru7BT3N3bPzgsHR03TZxqxhsslrFuB9RwKRRvoEDJ24nmNAokbwWju5nfGnNtRKwecZLwbkQHSoSCUbSS7yN/wiDMxtNetVcquxV3DrJKvJyUIUe9V/ry+zFLI66QSWpMx3MT7GZUo2CST4t+anhC2YgOeMdSRSNuutn85ik5t0qfhLG2pZDM1d8TGY2MmUSB7YwoDs2yNxP/8zophjfdTKgkRa7YYlGYSoIxmQVA+kJzhnJiCWVa2FsJG1JNGdqYijYEb/nlVdKsVrzLSvXhqly7zeMowCmcwQV4cA01uIc6NIBBAs/wCm9O6rw4787HonXNyWdO4A+czx9T7pHe</latexit>

v3<latexit sha1_base64="22jywEjQYStxJ5VhI3BqGrGrjow=">AAAB83icbVBNS8NAEJ3Ur1q/qh69BIvgqSStoMeiF48V7Ac0oWy2m3bpZhN2J8US+je8eFDEq3/Gm//GbZuDtj4YeLw3w8y8IBFco+N8W4WNza3tneJuaW//4PCofHzS1nGqKGvRWMSqGxDNBJeshRwF6yaKkSgQrBOM7+Z+Z8KU5rF8xGnC/IgMJQ85JWgkz0P2hEGYTWb9er9ccarOAvY6cXNSgRzNfvnLG8Q0jZhEKojWPddJ0M+IQk4Fm5W8VLOE0DEZsp6hkkRM+9ni5pl9YZSBHcbKlER7of6eyEik9TQKTGdEcKRXvbn4n9dLMbzxMy6TFJmky0VhKmyM7XkA9oArRlFMDSFUcXOrTUdEEYomppIJwV19eZ20a1W3Xq09XFUat3kcRTiDc7gEF66hAffQhBZQSOAZXuHNSq0X6936WLYWrHzmFP7A+vwBVXKR3w==</latexit>

↵q1<latexit sha1_base64="KHLFA9dJh8UXBjlmeG4gSSqYavE=">AAAB8nicbVBNS8NAEN3Ur1q/qh69LBbBU0mqoMeiF48V7Ae0oUy2m3bpZhN3J0IJ/RlePCji1V/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVjDdZLGPdCcBwKRRvokDJO4nmEAWSt4Px7cxvP3FtRKwecJJwP4KhEqFggFbq9kAmI+hnj960X664VXcOukq8nFRIjka//NUbxCyNuEImwZiu5yboZ6BRMMmnpV5qeAJsDEPetVRBxI2fzU+e0jOrDGgYa1sK6Vz9PZFBZMwkCmxnBDgyy95M/M/rphhe+5lQSYpcscWiMJUUYzr7nw6E5gzlxBJgWthbKRuBBoY2pZINwVt+eZW0alXvolq7v6zUb/I4iuSEnJJz4pErUid3pEGahJGYPJNX8uag8+K8Ox+L1oKTzxyTP3A+fwBTE5FH</latexit>

↵q2<latexit sha1_base64="iTEZbT2BAe/Lm+3dV9kMoHumgPI=">AAAB8nicbVBNS8NAEJ3Ur1q/qh69LBbBU0mqoMeiF48V7Ae0oWy2m3bpZhN3J0IJ/RlePCji1V/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVjDdZLGPdCajhUijeRIGSdxLNaRRI3g7GtzO//cS1EbF6wEnC/YgOlQgFo2ilbo/KZET72WNt2i9X3Ko7B1klXk4qkKPRL3/1BjFLI66QSWpM13MT9DOqUTDJp6VeanhC2ZgOeddSRSNu/Gx+8pScWWVAwljbUkjm6u+JjEbGTKLAdkYUR2bZm4n/ed0Uw2s/EypJkSu2WBSmkmBMZv+TgdCcoZxYQpkW9lbCRlRThjalkg3BW355lbRqVe+iWru/rNRv8jiKcAKncA4eXEEd7qABTWAQwzO8wpuDzovz7nwsWgtOPnMMf+B8/gBUmJFI</latexit>

↵qL<latexit sha1_base64="kkjpS9t58amNksFaPDZ+g+H+llA=">AAAB8nicbVA9SwNBEJ3zM8avqKXNYRCswl0UtAzaWFhEMB9wOcLcZpMs2ds9d/eEcORn2FgoYuuvsfPfuEmu0MQHA4/3ZpiZFyWcaeN5387K6tr6xmZhq7i9s7u3Xzo4bGqZKkIbRHKp2hFqypmgDcMMp+1EUYwjTlvR6Gbqt56o0kyKBzNOaBjjQLA+I2isFHSQJ0PsZo93k26p7FW8Gdxl4uekDDnq3dJXpydJGlNhCEetA99LTJihMoxwOil2Uk0TJCMc0MBSgTHVYTY7eeKeWqXn9qWyJYw7U39PZBhrPY4j2xmjGepFbyr+5wWp6V+FGRNJaqgg80X9lLtGutP/3R5TlBg+tgSJYvZWlwxRITE2paINwV98eZk0qxX/vFK9vyjXrvM4CnAMJ3AGPlxCDW6hDg0gIOEZXuHNMc6L8+58zFtXnHzmCP7A+fwBfBqRYg==</latexit>

↵q3<latexit sha1_base64="HjnYOqDG+9mWsRnkxKGgT7/tGPs=">AAAB8nicbVBNS8NAEN3Ur1q/qh69LBbBU0laQY9FLx4r2FZoQ5lsN+3STTbuToQS+jO8eFDEq7/Gm//GbZuDtj4YeLw3w8y8IJHCoOt+O4W19Y3NreJ2aWd3b/+gfHjUNirVjLeYkko/BGC4FDFvoUDJHxLNIQok7wTjm5nfeeLaCBXf4yThfgTDWISCAVqp2wOZjKCfPdan/XLFrbpz0FXi5aRCcjT75a/eQLE04jEyCcZ0PTdBPwONgkk+LfVSwxNgYxjyrqUxRNz42fzkKT2zyoCGStuKkc7V3xMZRMZMosB2RoAjs+zNxP+8borhlZ+JOEmRx2yxKEwlRUVn/9OB0JyhnFgCTAt7K2Uj0MDQplSyIXjLL6+Sdq3q1au1u4tK4zqPo0hOyCk5Jx65JA1yS5qkRRhR5Jm8kjcHnRfn3flYtBacfOaY/IHz+QNWHZFJ</latexit>

Softm

ax

Page 13: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Intrinsic Preference ModelingModel users' dynamic intrinsic preferences based on the correlation between the most recent purchase and the personal historic purchases.

● Query = embedding of current item

● Keys, Values = sequence of previous items

Ref: Wang-Cheng Kang and Julian McAuley. 2018. Self-attentive sequential recommendation. In ICDM.

+ + +Item Embedding

Positional Embedding

+ … Query

Self-Attention

Intrinsic Preference Modeling

Shopping History

Intrinsic Preference

Page 14: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Personal Occasion ElicitationElicit personal occasion signals by tracing the user's previous shopping behavior in the neighboring days.

● Query = timestamp of interest

● (Key, Value) = timestamp embedding of previous purchase, corresponding item embedding

Item Embedding …

Timestamp Embedding

Query

Personal Occasion Next Shopping

Personal Occasion Elicitation

Shopping History

Page 15: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Global Occasion MemorizationMemorize the shopping behavior of the crowd under different global occasions.

● Query = current timestamp embedding

● (Key, Value) = (timestamp embedding, memory slot embedding)

Time Key Embedding

Memory Slots

Query

Next Shopping

… Global OccasionGlobal Occasion

Memorization

Global Table

Page 16: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Gating LayerUtilize an attention (gating) layer to balance a user's intrinsic preferences with different occasion signals for personalization.

Personal Occasion Next Shopping

Query

x

Predicted Preference

Global Occasion

Gated Output

User EmbeddingShopping History

Gating Layer

Intrinsic Preference

Page 17: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Experiment: Data

Etsy: November 2006 to December 2018

Amazon: May 1996 to July 2014

Dataset # Users # Items # Purchases Density Cutting Time

Etsy 118,668 80,214 5.3M 0.0561% 2018/1/1Amazon 84,191 100,946 1.0M 0.0124% 2013/8/1

Page 18: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Experiment: Metrics

● Normalized Discounted Cumulative Gain (NDCG@K)● Hit Rate (HR@K)● Mean Reciprocal Rank (MRR)

Page 19: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Experiment: Baselines

● Most Popular● MF-BPR: Matrix Factorization with Bayesian Personalized Ranking● Fossil: Fusing Similarity Models with Markov Chains● GRU4Rec+: Recurrent Neural Networks with Top-k Gains● TCN: A Simple Convolutional Generative Network for Next Item

Recommendation● HPMN: Lifelong Sequential Modeling with Hierarchical Periodic

Memory Network● SARec: Self-attentive sequential recommendation

Page 20: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

ModelEtsy Amazon

NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195

MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650

GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*

Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.

Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR

Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047

I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108

Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165

Table 3: Ablation Test Results.

methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.

4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.

Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.

Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic

Figure 7: Similarity between di�erent calendar days.

preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.

4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).

The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,

OAR achieves the best performance on both Etsy and Amazon

OAR vs Baselines

6%~8% improvement

Page 21: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

ModelEtsy Amazon

NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195

MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650

GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*

Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.

Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR

Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047

I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108

Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165

Table 3: Ablation Test Results.

methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.

4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.

Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.

Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic

Figure 7: Similarity between di�erent calendar days.

preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.

4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).

The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,

Impact of each component?The global occasion memorization component performs better than recommending general most popular items.

Page 22: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Impact of each component?Personal occasion component works better than the intrinsic preference component.

ModelEtsy Amazon

NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195

MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650

GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*

Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.

Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR

Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047

I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108

Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165

Table 3: Ablation Test Results.

methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.

4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.

Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.

Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic

Figure 7: Similarity between di�erent calendar days.

preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.

4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).

The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,

Page 23: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Impact of each component?Replace the Gating layer with simple addition ⇒ The performance drops

ModelEtsy Amazon

NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195

MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650

GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*

Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.

Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR

Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047

I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108

Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165

Table 3: Ablation Test Results.

methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.

4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.

Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.

Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic

Figure 7: Similarity between di�erent calendar days.

preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.

4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).

The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,

Page 24: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Visualization of Occasion-driven ShoppingPreferences for Occasion-related items are dynamics in a year.

Page 25: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Summary● Propose an occasion-aware sequential recommender● Model the repeated personal occasion signals with attention layers,

while modeling the global occasion signals by memorizing the temporal trends of shopping behaviors

● Experiments on Etsy and Amazon show the positive impact of incorporating occasion signals

● Next…

Page 26: Time to Shop for Valentine’s Day: Shopping Occasions and ...people.tamu.edu/~jwang713/slides/occasion_wsdm2020.pdf · To Illustrate: Temporal Shopping Trends Users' shopping preferences

Thank you!