time to shop for valentine’s day: shopping occasions and...
TRANSCRIPT
Time to Shop for Valentine’s Day: Shopping Occasions and Sequential
Recommendation in E-commerceJianling Wang, Raphael Louca*, Diane Hu*, Caitlin Cellier*,
James Caverlee and Liangjie Hong*Texas A&M University
* Etsy Inc.
Recommender Systems usually:
● Capture users' intrinsic preference from their long-term behaviorpatterns.
● Infer users’ current needs by emphasizing recent actions.
However, intrinsic user behavior may be shifted by occasions, such asbirthdays, anniversaries, or gifting celebrations (Valentine's Day orMother's Day).
In E-commerce
In E-commerce
Often, these occasion-based purchases deviate from long-termpreferences and are not related to recent actions.
Idea: Let’s incorporate occasion signals!● Recommend more time or season-aware candidates, which may
alleviate the cold-start problem.● Reduce the noise in modeling users' intrinsic preferences.● Recommend relevant items to the user for upcoming reoccurring
occasions.
Global Occasions● Happen at the same time for a large number of users.● Encourage or lead to similar shopping decisions for crowds of users.● Examples: Christmas, Valentine’s Day, …
Personal Occasions● Happen at different times for different users.● Occur in a periodic and repeated pattern for a specific user.● Examples: Birthdays (for you or your friends) and anniversaries.
To Illustrate: Temporal Shopping TrendsUsers' shopping preferences are dynamic and can reflect reoccurring occasions (festivals, holidays, seasonal activities).
Valentine’s Day
Mother’s Day
Father’s Day
Thanksgiving Christmas
(a) (b)
Time-sensitive Most Popular V.S. General Most Popular
To Illustrate: From Personal PerspectiveThere are occasions that may re-occur within a certain period (e.g. annually or monthly) over the course of multiple years and trigger relevant purchase.
⇒ ⇒ traceable patterns in personal occasion signals.
Time Gap between Purchases for Wedding and Anniversary within a year. More than 50% of purchases for anniversary are near the date of wedding purchase within a time window less than 30 days.
Occasion-Aware Recommendation (OAR)Intrinsic Preference +
Personal Occasion Signals +
Global Occasion Signals
Our Proposal:
Occasion-Aware Recommendation (OAR)● Model the repeated personal occasion signals with attention layers.● Model the global occasion signals by memorizing the temporal trends
of shopping behaviors.● With a gating component, we balance the global and local effects of
different occasions. ● Experiments on Etsy and Amazon.
Attention Mechanisms● Query
● Key-Value Pairs
● Scoring Function - Calculating similarity/relationship between a Query & a Key
Keys ValuesQueries… ……
Attention Weights Outputs
q<latexit sha1_base64="BWkoXlOkYejmq6A3DcUbfxqunms=">AAAB8XicbVDLTgJBEOzFF+IL9ehlIjHxRHbRRI9ELx4xkUcEQmaHWZgwO7vO9BrJhr/w4kFjvPo33vwbB9iDgpV0UqnqTneXH0th0HW/ndzK6tr6Rn6zsLW9s7tX3D9omCjRjNdZJCPd8qnhUiheR4GSt2LNaehL3vRH11O/+ci1EZG6w3HMuyEdKBEIRtFK9x3kT+gH6cOkVyy5ZXcGsky8jJQgQ61X/Or0I5aEXCGT1Ji258bYTalGwSSfFDqJ4TFlIzrgbUsVDbnpprOLJ+TEKn0SRNqWQjJTf0+kNDRmHPq2M6Q4NIveVPzPaycYXHZToeIEuWLzRUEiCUZk+j7pC80ZyrEllGlhbyVsSDVlaEMq2BC8xZeXSaNS9s7KldvzUvUqiyMPR3AMp+DBBVThBmpQBwYKnuEV3hzjvDjvzse8NedkM4fwB87nDx2HkTQ=</latexit>
o<latexit sha1_base64="MdMM0CCP6gkS6I9FmEsJNlSQy9E=">AAAB8XicbVBNS8NAEJ34WetX1aOXxSJ4KkkV9Fj04rGC/cA2lM120y7dbMLuRCyh/8KLB0W8+m+8+W/ctjlo64OBx3szzMwLEikMuu63s7K6tr6xWdgqbu/s7u2XDg6bJk414w0Wy1i3A2q4FIo3UKDk7URzGgWSt4LRzdRvPXJtRKzucZxwP6IDJULBKFrpoYv8CYMwiye9UtmtuDOQZeLlpAw56r3SV7cfszTiCpmkxnQ8N0E/oxoFk3xS7KaGJ5SN6IB3LFU04sbPZhdPyKlV+iSMtS2FZKb+nshoZMw4CmxnRHFoFr2p+J/XSTG88jOhkhS5YvNFYSoJxmT6PukLzRnKsSWUaWFvJWxINWVoQyraELzFl5dJs1rxzivVu4ty7TqPowDHcAJn4MEl1OAW6tAABgqe4RXeHOO8OO/Ox7x1xclnjuAPnM8fGn2RMg==</latexit>
k1<latexit sha1_base64="vpeHugvKEkgkR/0j0altVfMTXNo=">AAAB83icbVBNS8NAEJ3Ur1q/qh69LBbBU0mqoMeiF48V7Ac0pWy2m3bpZhN2J2IJ/RtePCji1T/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVjDdZLGPdCajhUijeRIGSdxLNaRRI3g7GtzO//ci1EbF6wEnCexEdKhEKRtFKvo/8CYMwG0/7Xr9ccavuHGSVeDmpQI5Gv/zlD2KWRlwhk9SYrucm2MuoRsEkn5b81PCEsjEd8q6likbc9LL5zVNyZpUBCWNtSyGZq78nMhoZM4kC2xlRHJllbyb+53VTDK97mVBJilyxxaIwlQRjMguADITmDOXEEsq0sLcSNqKaMrQxlWwI3vLLq6RVq3oX1dr9ZaV+k8dRhBM4hXPw4ArqcAcNaAKDBJ7hFd6c1Hlx3p2PRWvByWeO4Q+czx9BnZHS</latexit>
k2<latexit sha1_base64="MYieos1SHUd3AQE445MyTQ0XbWo=">AAAB83icbVBNS8NAEN3Ur1q/qh69LBbBU0mqoMeiF48V7Ac0pWy2k3bpZhN2J2IJ/RtePCji1T/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVHJo8lrHuBMyAFAqaKFBCJ9HAokBCOxjfzvz2I2gjYvWAkwR6ERsqEQrO0Eq+j/CEQZiNp/1av1xxq+4cdJV4OamQHI1++csfxDyNQCGXzJiu5ybYy5hGwSVMS35qIGF8zIbQtVSxCEwvm988pWdWGdAw1rYU0rn6eyJjkTGTKLCdEcORWfZm4n9eN8XwupcJlaQIii8WhamkGNNZAHQgNHCUE0sY18LeSvmIacbRxlSyIXjLL6+SVq3qXVRr95eV+k0eR5GckFNyTjxyRerkjjRIk3CSkGfySt6c1Hlx3p2PRWvByWeOyR84nz9DIZHT</latexit>
k3<latexit sha1_base64="jAR3sPEfMoURVaK7hqHc6/mT8KA=">AAAB83icbVBNS8NAEN3Ur1q/qh69LBbBU0laQY9FLx4r2FZoStlsJ+3SzSbsTsQS+je8eFDEq3/Gm//GbZuDtj4YeLw3w8y8IJHCoOt+O4W19Y3NreJ2aWd3b/+gfHjUNnGqObR4LGP9EDADUihooUAJD4kGFgUSOsH4ZuZ3HkEbEat7nCTQi9hQiVBwhlbyfYQnDMJsPO3X++WKW3XnoKvEy0mF5Gj2y1/+IOZpBAq5ZMZ0PTfBXsY0Ci5hWvJTAwnjYzaErqWKRWB62fzmKT2zyoCGsbalkM7V3xMZi4yZRIHtjBiOzLI3E//zuimGV71MqCRFUHyxKEwlxZjOAqADoYGjnFjCuBb2VspHTDOONqaSDcFbfnmVtGtVr16t3V1UGtd5HEVyQk7JOfHIJWmQW9IkLcJJQp7JK3lzUufFeXc+Fq0FJ585Jn/gfP4ARKWR1A==</latexit>
kL<latexit sha1_base64="XeR3m7IUyoUIUJ0yLNPrDRZNyDQ=">AAAB83icbVA9SwNBEJ3zM8avqKXNYhCswl0UtAzaWFhEMB+QC2Fvs5cs2ds7dufEcORv2FgoYuufsfPfuEmu0MQHA4/3ZpiZFyRSGHTdb2dldW19Y7OwVdze2d3bLx0cNk2casYbLJaxbgfUcCkUb6BAyduJ5jQKJG8Fo5up33rk2ohYPeA44d2IDpQIBaNoJd9H/oRBmI0mvbteqexW3BnIMvFyUoYc9V7py+/HLI24QiapMR3PTbCbUY2CST4p+qnhCWUjOuAdSxWNuOlms5sn5NQqfRLG2pZCMlN/T2Q0MmYcBbYzojg0i95U/M/rpBhedTOhkhS5YvNFYSoJxmQaAOkLzRnKsSWUaWFvJWxINWVoYyraELzFl5dJs1rxzivV+4ty7TqPowDHcAJn4MEl1OAW6tAABgk8wyu8Oanz4rw7H/PWFSefOYI/cD5/AGqJke0=</latexit>
vL<latexit sha1_base64="I0VC8gBrQq8yWNyo+yFpgT4wz1Y=">AAAB83icbVBNS8NAEJ3Ur1q/qh69BIvgqSRV0GPRiwcPFewHNKFstpt26WYTdifFEvo3vHhQxKt/xpv/xm2bg7Y+GHi8N8PMvCARXKPjfFuFtfWNza3idmlnd2//oHx41NJxqihr0ljEqhMQzQSXrIkcBeskipEoEKwdjG5nfnvMlOaxfMRJwvyIDCQPOSVoJM9D9oRBmI2nvfteueJUnTnsVeLmpAI5Gr3yl9ePaRoxiVQQrbuuk6CfEYWcCjYtealmCaEjMmBdQyWJmPaz+c1T+8wofTuMlSmJ9lz9PZGRSOtJFJjOiOBQL3sz8T+vm2J47WdcJikySReLwlTYGNuzAOw+V4yimBhCqOLmVpsOiSIUTUwlE4K7/PIqadWq7kW19nBZqd/kcRThBE7hHFy4gjrcQQOaQCGBZ3iFNyu1Xqx362PRWrDymWP4A+vzB3tWkfg=</latexit>
v1<latexit sha1_base64="LBrO2AG9JVSYbXOTEEaiIcFSFcM=">AAAB83icbVBNS8NAEN34WetX1aOXxSJ4KkkV9Fj04rGC/YCmlM120i7dbMLupFhC/4YXD4p49c9489+4bXPQ1gcDj/dmmJkXJFIYdN1vZ219Y3Nru7BT3N3bPzgsHR03TZxqDg0ey1i3A2ZACgUNFCihnWhgUSChFYzuZn5rDNqIWD3iJIFuxAZKhIIztJLvIzxhEGbjac/rlcpuxZ2DrhIvJ2WSo94rffn9mKcRKOSSGdPx3AS7GdMouIRp0U8NJIyP2AA6lioWgelm85un9NwqfRrG2pZCOld/T2QsMmYSBbYzYjg0y95M/M/rpBjedDOhkhRB8cWiMJUUYzoLgPaFBo5yYgnjWthbKR8yzTjamIo2BG/55VXSrFa8y0r14apcu83jKJBTckYuiEeuSY3ckzppEE4S8kxeyZuTOi/Ou/OxaF1z8pkT8gfO5w9SapHd</latexit>
v2<latexit sha1_base64="aZYsu15SPJ5E+sMT1cDcJ3BeBvA=">AAAB83icbVBNS8NAEJ34WetX1aOXxSJ4KkkV9Fj04rGC/YCmlM120y7dbMLupFhC/4YXD4p49c9489+4bXPQ1gcDj/dmmJkXJFIYdN1vZ219Y3Nru7BT3N3bPzgsHR03TZxqxhsslrFuB9RwKRRvoEDJ24nmNAokbwWju5nfGnNtRKwecZLwbkQHSoSCUbSS7yN/wiDMxtNetVcquxV3DrJKvJyUIUe9V/ry+zFLI66QSWpMx3MT7GZUo2CST4t+anhC2YgOeMdSRSNuutn85ik5t0qfhLG2pZDM1d8TGY2MmUSB7YwoDs2yNxP/8zophjfdTKgkRa7YYlGYSoIxmQVA+kJzhnJiCWVa2FsJG1JNGdqYijYEb/nlVdKsVrzLSvXhqly7zeMowCmcwQV4cA01uIc6NIBBAs/wCm9O6rw4787HonXNyWdO4A+czx9T7pHe</latexit>
v3<latexit sha1_base64="22jywEjQYStxJ5VhI3BqGrGrjow=">AAAB83icbVBNS8NAEJ3Ur1q/qh69BIvgqSStoMeiF48V7Ac0oWy2m3bpZhN2J8US+je8eFDEq3/Gm//GbZuDtj4YeLw3w8y8IBFco+N8W4WNza3tneJuaW//4PCofHzS1nGqKGvRWMSqGxDNBJeshRwF6yaKkSgQrBOM7+Z+Z8KU5rF8xGnC/IgMJQ85JWgkz0P2hEGYTWb9er9ccarOAvY6cXNSgRzNfvnLG8Q0jZhEKojWPddJ0M+IQk4Fm5W8VLOE0DEZsp6hkkRM+9ni5pl9YZSBHcbKlER7of6eyEik9TQKTGdEcKRXvbn4n9dLMbzxMy6TFJmky0VhKmyM7XkA9oArRlFMDSFUcXOrTUdEEYomppIJwV19eZ20a1W3Xq09XFUat3kcRTiDc7gEF66hAffQhBZQSOAZXuHNSq0X6936WLYWrHzmFP7A+vwBVXKR3w==</latexit>
↵q1<latexit sha1_base64="KHLFA9dJh8UXBjlmeG4gSSqYavE=">AAAB8nicbVBNS8NAEN3Ur1q/qh69LBbBU0mqoMeiF48V7Ae0oUy2m3bpZhN3J0IJ/RlePCji1V/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVjDdZLGPdCcBwKRRvokDJO4nmEAWSt4Px7cxvP3FtRKwecJJwP4KhEqFggFbq9kAmI+hnj960X664VXcOukq8nFRIjka//NUbxCyNuEImwZiu5yboZ6BRMMmnpV5qeAJsDEPetVRBxI2fzU+e0jOrDGgYa1sK6Vz9PZFBZMwkCmxnBDgyy95M/M/rphhe+5lQSYpcscWiMJUUYzr7nw6E5gzlxBJgWthbKRuBBoY2pZINwVt+eZW0alXvolq7v6zUb/I4iuSEnJJz4pErUid3pEGahJGYPJNX8uag8+K8Ox+L1oKTzxyTP3A+fwBTE5FH</latexit>
↵q2<latexit sha1_base64="iTEZbT2BAe/Lm+3dV9kMoHumgPI=">AAAB8nicbVBNS8NAEJ3Ur1q/qh69LBbBU0mqoMeiF48V7Ae0oWy2m3bpZhN3J0IJ/RlePCji1V/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVjDdZLGPdCajhUijeRIGSdxLNaRRI3g7GtzO//cS1EbF6wEnC/YgOlQgFo2ilbo/KZET72WNt2i9X3Ko7B1klXk4qkKPRL3/1BjFLI66QSWpM13MT9DOqUTDJp6VeanhC2ZgOeddSRSNu/Gx+8pScWWVAwljbUkjm6u+JjEbGTKLAdkYUR2bZm4n/ed0Uw2s/EypJkSu2WBSmkmBMZv+TgdCcoZxYQpkW9lbCRlRThjalkg3BW355lbRqVe+iWru/rNRv8jiKcAKncA4eXEEd7qABTWAQwzO8wpuDzovz7nwsWgtOPnMMf+B8/gBUmJFI</latexit>
↵qL<latexit sha1_base64="kkjpS9t58amNksFaPDZ+g+H+llA=">AAAB8nicbVA9SwNBEJ3zM8avqKXNYRCswl0UtAzaWFhEMB9wOcLcZpMs2ds9d/eEcORn2FgoYuuvsfPfuEmu0MQHA4/3ZpiZFyWcaeN5387K6tr6xmZhq7i9s7u3Xzo4bGqZKkIbRHKp2hFqypmgDcMMp+1EUYwjTlvR6Gbqt56o0kyKBzNOaBjjQLA+I2isFHSQJ0PsZo93k26p7FW8Gdxl4uekDDnq3dJXpydJGlNhCEetA99LTJihMoxwOil2Uk0TJCMc0MBSgTHVYTY7eeKeWqXn9qWyJYw7U39PZBhrPY4j2xmjGepFbyr+5wWp6V+FGRNJaqgg80X9lLtGutP/3R5TlBg+tgSJYvZWlwxRITE2paINwV98eZk0qxX/vFK9vyjXrvM4CnAMJ3AGPlxCDW6hDg0gIOEZXuHNMc6L8+58zFtXnHzmCP7A+fwBfBqRYg==</latexit>
↵q3<latexit sha1_base64="HjnYOqDG+9mWsRnkxKGgT7/tGPs=">AAAB8nicbVBNS8NAEN3Ur1q/qh69LBbBU0laQY9FLx4r2FZoQ5lsN+3STTbuToQS+jO8eFDEq7/Gm//GbZuDtj4YeLw3w8y8IJHCoOt+O4W19Y3NreJ2aWd3b/+gfHjUNirVjLeYkko/BGC4FDFvoUDJHxLNIQok7wTjm5nfeeLaCBXf4yThfgTDWISCAVqp2wOZjKCfPdan/XLFrbpz0FXi5aRCcjT75a/eQLE04jEyCcZ0PTdBPwONgkk+LfVSwxNgYxjyrqUxRNz42fzkKT2zyoCGStuKkc7V3xMZRMZMosB2RoAjs+zNxP+8borhlZ+JOEmRx2yxKEwlRUVn/9OB0JyhnFgCTAt7K2Uj0MDQplSyIXjLL6+Sdq3q1au1u4tK4zqPo0hOyCk5Jx65JA1yS5qkRRhR5Jm8kjcHnRfn3flYtBacfOaY/IHz+QNWHZFJ</latexit>
Softm
ax
Intrinsic Preference ModelingModel users' dynamic intrinsic preferences based on the correlation between the most recent purchase and the personal historic purchases.
● Query = embedding of current item
● Keys, Values = sequence of previous items
Ref: Wang-Cheng Kang and Julian McAuley. 2018. Self-attentive sequential recommendation. In ICDM.
…
+ + +Item Embedding
Positional Embedding
+ … Query
Self-Attention
Intrinsic Preference Modeling
Shopping History
Intrinsic Preference
Personal Occasion ElicitationElicit personal occasion signals by tracing the user's previous shopping behavior in the neighboring days.
● Query = timestamp of interest
● (Key, Value) = timestamp embedding of previous purchase, corresponding item embedding
…
Item Embedding …
Timestamp Embedding
…
Query
Personal Occasion Next Shopping
Personal Occasion Elicitation
Shopping History
Global Occasion MemorizationMemorize the shopping behavior of the crowd under different global occasions.
● Query = current timestamp embedding
● (Key, Value) = (timestamp embedding, memory slot embedding)
Time Key Embedding
Memory Slots
Query
Next Shopping
…
… Global OccasionGlobal Occasion
Memorization
Global Table
Gating LayerUtilize an attention (gating) layer to balance a user's intrinsic preferences with different occasion signals for personalization.
…
Personal Occasion Next Shopping
Query
x
Predicted Preference
Global Occasion
Gated Output
User EmbeddingShopping History
Gating Layer
Intrinsic Preference
Experiment: Data
Etsy: November 2006 to December 2018
Amazon: May 1996 to July 2014
Dataset # Users # Items # Purchases Density Cutting Time
Etsy 118,668 80,214 5.3M 0.0561% 2018/1/1Amazon 84,191 100,946 1.0M 0.0124% 2013/8/1
Experiment: Metrics
● Normalized Discounted Cumulative Gain (NDCG@K)● Hit Rate (HR@K)● Mean Reciprocal Rank (MRR)
Experiment: Baselines
● Most Popular● MF-BPR: Matrix Factorization with Bayesian Personalized Ranking● Fossil: Fusing Similarity Models with Markov Chains● GRU4Rec+: Recurrent Neural Networks with Top-k Gains● TCN: A Simple Convolutional Generative Network for Next Item
Recommendation● HPMN: Lifelong Sequential Modeling with Hierarchical Periodic
Memory Network● SARec: Self-attentive sequential recommendation
ModelEtsy Amazon
NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195
MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650
GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*
Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.
Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR
Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047
I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108
Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165
Table 3: Ablation Test Results.
methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.
4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.
Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.
Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic
Figure 7: Similarity between di�erent calendar days.
preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.
4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).
The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,
OAR achieves the best performance on both Etsy and Amazon
OAR vs Baselines
6%~8% improvement
ModelEtsy Amazon
NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195
MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650
GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*
Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.
Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR
Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047
I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108
Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165
Table 3: Ablation Test Results.
methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.
4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.
Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.
Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic
Figure 7: Similarity between di�erent calendar days.
preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.
4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).
The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,
Impact of each component?The global occasion memorization component performs better than recommending general most popular items.
Impact of each component?Personal occasion component works better than the intrinsic preference component.
ModelEtsy Amazon
NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195
MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650
GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*
Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.
Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR
Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047
I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108
Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165
Table 3: Ablation Test Results.
methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.
4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.
Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.
Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic
Figure 7: Similarity between di�erent calendar days.
preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.
4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).
The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,
Impact of each component?Replace the Gating layer with simple addition ⇒ The performance drops
ModelEtsy Amazon
NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195
MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650
GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*
Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.
Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR
Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047
I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108
Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165
Table 3: Ablation Test Results.
methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.
4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.
Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.
Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic
Figure 7: Similarity between di�erent calendar days.
preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.
4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).
The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,
Visualization of Occasion-driven ShoppingPreferences for Occasion-related items are dynamics in a year.
Summary● Propose an occasion-aware sequential recommender● Model the repeated personal occasion signals with attention layers,
while modeling the global occasion signals by memorizing the temporal trends of shopping behaviors
● Experiments on Etsy and Amazon show the positive impact of incorporating occasion signals
● Next…
Thank you!