tech-econ

Collaborative Filtering & Matrix Factorization

Recommend based on similar user preferences

2009 11142 cited

Matrix Factorization Techniques for Recommender Systems

Yehuda Koren, Robert Bell, Chris Volinsky

The definitive Netflix Prize paper establishing matrix factorization as the dominant CF paradigm.

2008 3154 cited

Collaborative Filtering for Implicit Feedback Datasets

Yifan Hu, Yehuda Koren, Chris Volinsky

Foundational paper for implicit feedback (clicks, views); introduces confidence-weighted MF with ALS optimization.

2009 4304 cited

BPR: Bayesian Personalized Ranking from Implicit Feedback

Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, Lars Schmidt-Thieme

Standard pairwise ranking loss for implicit feedback; BPR-OPT criterion ubiquitous in modern training.

2008 3863 cited

Factorization Meets the Neighborhood

Yehuda Koren

Combines neighborhood and latent factor models; integrates user/item biases; essential for blending approaches.

2007 5900 cited

Probabilistic Matrix Factorization

Ruslan Salakhutdinov, Andriy Mnih

Introduced probabilistic treatment of MF that scales linearly with observations; handles Netflix-scale sparsity with automatic regularization via adaptive priors.

2010 2800 cited

Factorization Machines

Steffen Rendle

General predictor combining SVM flexibility with factorization model power; models all pairwise feature interactions in linear time. Subsumes SVD++, PITF, and specialized models.

2009 1600 cited

Collaborative Filtering with Temporal Dynamics

Yehuda Koren

TimeSVD++ explicitly models drifting user preferences, item popularity evolution, and time-varying biases. Core component of Netflix Prize winning solution.

Title	Authors	Year	Citations
Matrix Factorization Techniques for Recommender Systems The definitive Netflix Prize paper establishing matrix factorization as the dominant CF paradigm.	Yehuda Koren, Robert Bell, Chris Volinsky	2009	11142
Collaborative Filtering for Implicit Feedback Datasets Foundational paper for implicit feedback (clicks, views); introduces confidence-weighted MF with ALS optimization.	Yifan Hu, Yehuda Koren, Chris Volinsky	2008	3154
BPR: Bayesian Personalized Ranking from Implicit Feedback Standard pairwise ranking loss for implicit feedback; BPR-OPT criterion ubiquitous in modern training.	Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, Lars Schmidt-Thieme	2009	4304
Factorization Meets the Neighborhood Combines neighborhood and latent factor models; integrates user/item biases; essential for blending approaches.	Yehuda Koren	2008	3863
Probabilistic Matrix Factorization Introduced probabilistic treatment of MF that scales linearly with observations; handles Netflix-scale sparsity with automatic regularization via adaptive priors.	Ruslan Salakhutdinov, Andriy Mnih	2007	5900
Factorization Machines General predictor combining SVM flexibility with factorization model power; models all pairwise feature interactions in linear time. Subsumes SVD++, PITF, and specialized models.	Steffen Rendle	2010	2800
Collaborative Filtering with Temporal Dynamics TimeSVD++ explicitly models drifting user preferences, item popularity evolution, and time-varying biases. Core component of Netflix Prize winning solution.	Yehuda Koren	2009	1600

Deep Recommenders

Build neural network-based recommendation models

2016 3182 cited

Deep Neural Networks for YouTube Recommendations

Paul Covington, Jay Adams, Emre Sargin

Seminal industry paper establishing two-stage deep learning (candidate generation + ranking) at billion-scale.

2020 230 cited

Embedding-based Retrieval in Facebook Search

Jui-Ting Huang, et al.

Comprehensive industry paper on two-tower embedding models; covers hard negative mining, ANN indexing, production deployment.

2018 12 cited

Self-Attentive Sequential Recommendation (SASRec)

Wang-Cheng Kang, Julian McAuley

Applies self-attention to sequential recommendations; the Transformer foundation inspiring BERT4Rec and modern sequential models.

2019 228 cited

BERT4Rec: Sequential Recommendation with BERT

Fei Sun, et al. (Alibaba)

Adapts BERT's bidirectional self-attention with Cloze task for sequential predictions.

2017 6183 cited

Neural Collaborative Filtering

Xiangnan He, et al.

Replaces inner product with neural networks; introduces GMF+MLP framework (NeuMF); highly cited baseline.

2016 4500 cited

Wide & Deep Learning for Recommender Systems

Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, Rohan Anil, Zakaria Haque, Lichan Hong, Vihan Jain, Xiaobing Liu, Hemal Shah

Jointly trains wide linear models (memorization) with deep networks (generalization) using shared embeddings. Productionized on Google Play for 1B+ users; seminal industry paper establishing wide+deep paradigm.

2017 2700 cited

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, Xiuqiang He

Combines FM component (low-order interactions) with deep component (high-order) via shared embeddings. Unlike Wide&Deep, requires no manual feature engineering. End-to-end training.

2018 2100 cited

Deep Interest Network for Click-Through Rate Prediction

Guorui Zhou, Xiaoqiang Zhu, Chenru Song, Ying Fan, Han Zhu, Xiao Ma, Yanghui Yan, Junqi Jin, Han Li, Kun Gai

Attention mechanism adaptively learns user interest representations from behavioral history relative to candidate items—solving fixed-length embedding bottleneck. Deployed on Alibaba main traffic.

Title	Authors	Year	Citations
Deep Neural Networks for YouTube Recommendations Seminal industry paper establishing two-stage deep learning (candidate generation + ranking) at billion-scale.	Paul Covington, Jay Adams, Emre Sargin	2016	3182
Embedding-based Retrieval in Facebook Search Comprehensive industry paper on two-tower embedding models; covers hard negative mining, ANN indexing, production deployment.	Jui-Ting Huang, et al.	2020	230
Self-Attentive Sequential Recommendation (SASRec) Applies self-attention to sequential recommendations; the Transformer foundation inspiring BERT4Rec and modern sequential models.	Wang-Cheng Kang, Julian McAuley	2018	12
BERT4Rec: Sequential Recommendation with BERT Adapts BERT's bidirectional self-attention with Cloze task for sequential predictions.	Fei Sun, et al. (Alibaba)	2019	228
Neural Collaborative Filtering Replaces inner product with neural networks; introduces GMF+MLP framework (NeuMF); highly cited baseline.	Xiangnan He, et al.	2017	6183
Wide & Deep Learning for Recommender Systems Jointly trains wide linear models (memorization) with deep networks (generalization) using shared embeddings. Productionized on Google Play for 1B+ users; seminal industry paper establishing wide+deep paradigm.	Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, Rohan Anil, Zakaria Haque, Lichan Hong, Vihan Jain, Xiaobing Liu, Hemal Shah	2016	4500
DeepFM: A Factorization-Machine based Neural Network for CTR Prediction Combines FM component (low-order interactions) with deep component (high-order) via shared embeddings. Unlike Wide&Deep, requires no manual feature engineering. End-to-end training.	Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, Xiuqiang He	2017	2700
Deep Interest Network for Click-Through Rate Prediction Attention mechanism adaptively learns user interest representations from behavioral history relative to candidate items—solving fixed-length embedding bottleneck. Deployed on Alibaba main traffic.	Guorui Zhou, Xiaoqiang Zhu, Chenru Song, Ying Fan, Han Zhu, Xiao Ma, Yanghui Yan, Junqi Jin, Han Li, Kun Gai	2018	2100

Causal Recommendations & Debiasing

Make fair recommendations despite biased data

2016 142 cited

Recommendations as Treatments: Debiasing Learning and Evaluation

Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, Thorsten Joachims

Foundational paper applying causal inference to recommenders; introduces IPS-based unbiased estimators.

2017 495 cited

Unbiased Learning-to-Rank with Biased Feedback

Thorsten Joachims, Adith Swaminathan, Tobias Schnabel

Addresses position bias in click data; propensity-weighted ranking SVM; essential for implicit feedback.

2015 200 cited

The Self-Normalized Estimator for Counterfactual Learning

Adith Swaminathan, Thorsten Joachims

Introduces SNIPS reducing IPS variance without bias; widely used in production.

2021 46 cited

Counterfactual Learning and Evaluation for Recommender Systems

Yuta Saito, Thorsten Joachims

Comprehensive tutorial covering IPS, SNIPS, Doubly Robust estimators with implementation guidance.

2019 250 cited

Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random

Xiaojie Wang, Rui Zhang, Yu Sun, Jianzhong Qi

Combines imputation models with propensity scoring for unbiased performance estimation; theoretically principled doubly robust framework specifically for recommender systems with MNAR data.

2018 400 cited

Causal Embeddings for Recommendation

Stephen Bonner, Flavian Vasile

CausE optimizes recommendation policy for causal treatment effects (not just prediction) using domain adaptation to handle biased logged data. RecSys Best Paper Award.

2020 180 cited

The Deconfounded Recommender: A Causal Inference Approach to Recommendation

Yixin Wang, Dawen Liang, Laurent Charlin, David Blei

Two-stage approach using substitute confounders from exposure modeling; addresses unobserved confounding in observational recommendation data from principled causal perspective.

Title	Authors	Year	Citations
Recommendations as Treatments: Debiasing Learning and Evaluation Foundational paper applying causal inference to recommenders; introduces IPS-based unbiased estimators.	Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, Thorsten Joachims	2016	142
Unbiased Learning-to-Rank with Biased Feedback Addresses position bias in click data; propensity-weighted ranking SVM; essential for implicit feedback.	Thorsten Joachims, Adith Swaminathan, Tobias Schnabel	2017	495
The Self-Normalized Estimator for Counterfactual Learning Introduces SNIPS reducing IPS variance without bias; widely used in production.	Adith Swaminathan, Thorsten Joachims	2015	200
Counterfactual Learning and Evaluation for Recommender Systems Comprehensive tutorial covering IPS, SNIPS, Doubly Robust estimators with implementation guidance.	Yuta Saito, Thorsten Joachims	2021	46
Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random Combines imputation models with propensity scoring for unbiased performance estimation; theoretically principled doubly robust framework specifically for recommender systems with MNAR data.	Xiaojie Wang, Rui Zhang, Yu Sun, Jianzhong Qi	2019	250
Causal Embeddings for Recommendation CausE optimizes recommendation policy for causal treatment effects (not just prediction) using domain adaptation to handle biased logged data. RecSys Best Paper Award.	Stephen Bonner, Flavian Vasile	2018	400
The Deconfounded Recommender: A Causal Inference Approach to Recommendation Two-stage approach using substitute confounders from exposure modeling; addresses unobserved confounding in observational recommendation data from principled causal perspective.	Yixin Wang, Dawen Liang, Laurent Charlin, David Blei	2020	180

Exploration & Diversity

Balance showing what users like vs. new discoveries

2005 1834 cited

Topic Diversification for Recommendation Lists

Cai-Nicolas Ziegler, Sean McNee, Joseph Konstan, Georg Lausen

Pioneering paper on balancing accuracy with diversity; introduces intra-list diversity metrics.

2014 393 cited

Exploring the Filter Bubble

Tien Nguyen, Pik-Mai Hui, F. Maxwell Harper, Loren Terveen, Joseph Konstan

First rigorous longitudinal study measuring filter bubble effects at individual level using MovieLens.

2009 1 cited

Blockbuster Culture's Next Rise or Fall

Daniel Fleder, Kartik Hosanagar

Analyzes how recommenders affect aggregate sales diversity (long tail vs. blockbusters).

2018 143 cited

Explore, Exploit, and Explain

James McInerney, et al. (Spotify)

Spotify's production system combining contextual bandits for exploration with explainable recommendations.

2010 4000 cited

A Contextual-Bandit Approach to Personalized News Article Recommendation

Lihong Li, Wei Chu, John Langford, Robert Schapire

Introduces LinUCB algorithm for contextual bandits; 12.5% CTR lift over context-free bandits on 33M+ Yahoo events. Foundational paper for bandit-based recommendations.

2011 2600 cited

An Empirical Evaluation of Thompson Sampling

Olivier Chapelle, Lihong Li

Comprehensive evaluation across display advertising and news recommendation demonstrating Thompson Sampling achieves state-of-the-art results; reignited practical interest in TS.

2018 400 cited

Calibrated Recommendations

Harald Steck

Shows accuracy-optimized recommenders crowd out users' lesser interests; proposes KL-divergence calibration metrics and efficient greedy re-ranking. Netflix research, RecSys Best Paper Nominee.

Title	Authors	Year	Citations
Topic Diversification for Recommendation Lists Pioneering paper on balancing accuracy with diversity; introduces intra-list diversity metrics.	Cai-Nicolas Ziegler, Sean McNee, Joseph Konstan, Georg Lausen	2005	1834
Exploring the Filter Bubble First rigorous longitudinal study measuring filter bubble effects at individual level using MovieLens.	Tien Nguyen, Pik-Mai Hui, F. Maxwell Harper, Loren Terveen, Joseph Konstan	2014	393
Blockbuster Culture's Next Rise or Fall Analyzes how recommenders affect aggregate sales diversity (long tail vs. blockbusters).	Daniel Fleder, Kartik Hosanagar	2009	1
Explore, Exploit, and Explain Spotify's production system combining contextual bandits for exploration with explainable recommendations.	James McInerney, et al. (Spotify)	2018	143
A Contextual-Bandit Approach to Personalized News Article Recommendation Introduces LinUCB algorithm for contextual bandits; 12.5% CTR lift over context-free bandits on 33M+ Yahoo events. Foundational paper for bandit-based recommendations.	Lihong Li, Wei Chu, John Langford, Robert Schapire	2010	4000
An Empirical Evaluation of Thompson Sampling Comprehensive evaluation across display advertising and news recommendation demonstrating Thompson Sampling achieves state-of-the-art results; reignited practical interest in TS.	Olivier Chapelle, Lihong Li	2011	2600
Calibrated Recommendations Shows accuracy-optimized recommenders crowd out users' lesser interests; proposes KL-divergence calibration metrics and efficient greedy re-ranking. Netflix research, RecSys Best Paper Nominee.	Harald Steck	2018	400

Cold Start

Recommend to new users or items with no history

2017 154 cited

DropoutNet: Addressing Cold Start in Recommender Systems

Maksims Volkovs, Guangwei Yu, Tomi Poutanen

Uses dropout to force reliance on content features when collaborative signals unavailable; practical and widely adopted.

2017 148 cited

A Meta-Learning Perspective on Cold-Start Recommendations

Manasi Vartak, et al. (Twitter)

Applies meta-learning to item cold-start; learns to adapt quickly from user history.

2015 1613 cited

Collaborative Deep Learning for Recommender Systems

Hao Wang, Naiyan Wang, Dit-Yan Yeung

Combines deep learning for content (stacked denoising autoencoders) with CF; influential hybrid approach.

2019 321 cited

MeLU: Meta-Learned User Preference Estimator

Hoyeop Lee, et al.

Applies MAML framework to user cold-start; learns initialization that quickly adapts to new users.

2019 150 cited

From Zero-Shot Learning to Cold-Start Recommendation

Jingjing Li, Mengmeng Jing, Ke Lu, Zhengming Zhu, Lei Zhu, Yang Huang

First paper framing cold-start as zero-shot learning problem; proposes Low-rank Linear Auto-Encoder (LLAE) addressing domain shift and spurious correlations. Novel theoretical connection enabling attribute-based recommendations for entirely new users.

2021 120 cited

Transfer-Meta Framework for Cross-domain Recommendation to Cold-Start Users

Yongchun Zhu, Kaikai Ge, Fuzhen Zhuang, Ruobing Xie, Dongbo Xi, Xu Zhang, Leyu Lin, Hui Xiong

State-of-the-art combining transfer learning and meta-learning for cross-domain cold start; uses source domain to warm-start target domain predictions for zero-interaction users.

Title	Authors	Year	Citations
DropoutNet: Addressing Cold Start in Recommender Systems Uses dropout to force reliance on content features when collaborative signals unavailable; practical and widely adopted.	Maksims Volkovs, Guangwei Yu, Tomi Poutanen	2017	154
A Meta-Learning Perspective on Cold-Start Recommendations Applies meta-learning to item cold-start; learns to adapt quickly from user history.	Manasi Vartak, et al. (Twitter)	2017	148
Collaborative Deep Learning for Recommender Systems Combines deep learning for content (stacked denoising autoencoders) with CF; influential hybrid approach.	Hao Wang, Naiyan Wang, Dit-Yan Yeung	2015	1613
MeLU: Meta-Learned User Preference Estimator Applies MAML framework to user cold-start; learns initialization that quickly adapts to new users.	Hoyeop Lee, et al.	2019	321
From Zero-Shot Learning to Cold-Start Recommendation First paper framing cold-start as zero-shot learning problem; proposes Low-rank Linear Auto-Encoder (LLAE) addressing domain shift and spurious correlations. Novel theoretical connection enabling attribute-based recommendations for entirely new users.	Jingjing Li, Mengmeng Jing, Ke Lu, Zhengming Zhu, Lei Zhu, Yang Huang	2019	150
Transfer-Meta Framework for Cross-domain Recommendation to Cold-Start Users State-of-the-art combining transfer learning and meta-learning for cross-domain cold start; uses source domain to warm-start target domain predictions for zero-interaction users.	Yongchun Zhu, Kaikai Ge, Fuzhen Zhuang, Ruobing Xie, Dongbo Xi, Xu Zhang, Leyu Lin, Hui Xiong	2021	120

Graph Neural Networks for Recommendations

Leverage network structure for better recommendations

2018 3500 cited

PinSage: Graph Convolutional Neural Networks for Web-Scale Recommender Systems

Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William Hamilton, Jure Leskovec

First billion-scale GCN deployment. Random-walk neighborhood sampling, curriculum training with hard negatives, MapReduce inference for Pinterest's 3B node graph. Seminal industrial GNN paper.

2019 2500 cited

Neural Graph Collaborative Filtering

Xiang Wang, Xiangnan He, Meng Wang, Fuli Feng, Tat-Seng Chua

First to explicitly encode collaborative signal via high-order connectivity on user-item bipartite graphs through embedding propagation. Foundational theoretical framework for GNN-based CF.

2020 3000 cited

LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation

Xiangnan He, Kuan Deng, Xiang Wang, Yan Li, Yongdong Zhang, Meng Wang

Demonstrates feature transformation and nonlinear activation contribute little to CF; proposes simplified GCN using only neighborhood aggregation—16% improvement over NGCF with simpler architecture.

2019 1500 cited

KGAT: Knowledge Graph Attention Network for Recommendation

Xiang Wang, Xiangnan He, Yixin Cao, Meng Liu, Tat-Seng Chua

Unifies user-item interaction graphs with knowledge graphs via attention-based neighbor discrimination; enables more accurate, diverse, and explainable recommendations.

Title	Authors	Year	Citations
PinSage: Graph Convolutional Neural Networks for Web-Scale Recommender Systems First billion-scale GCN deployment. Random-walk neighborhood sampling, curriculum training with hard negatives, MapReduce inference for Pinterest's 3B node graph. Seminal industrial GNN paper.	Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William Hamilton, Jure Leskovec	2018	3500
Neural Graph Collaborative Filtering First to explicitly encode collaborative signal via high-order connectivity on user-item bipartite graphs through embedding propagation. Foundational theoretical framework for GNN-based CF.	Xiang Wang, Xiangnan He, Meng Wang, Fuli Feng, Tat-Seng Chua	2019	2500
LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation Demonstrates feature transformation and nonlinear activation contribute little to CF; proposes simplified GCN using only neighborhood aggregation—16% improvement over NGCF with simpler architecture.	Xiangnan He, Kuan Deng, Xiang Wang, Yan Li, Yongdong Zhang, Meng Wang	2020	3000
KGAT: Knowledge Graph Attention Network for Recommendation Unifies user-item interaction graphs with knowledge graphs via attention-based neighbor discrimination; enables more accurate, diverse, and explainable recommendations.	Xiang Wang, Xiangnan He, Yixin Cao, Meng Liu, Tat-Seng Chua	2019	1500

Multi-Task & Multi-Objective Learning

Optimize multiple recommendation goals together

2018 1500 cited

Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts

Jiaqi Ma, Zhe Zhao, Xinyang Yi, Jilin Chen, Lichan Hong, Ed Chi

Task-specific gating networks over shared expert submodels; explicitly learns task relationships from data. Deployed at Google; widely adopted at Pinterest, LinkedIn, Meta.

2020 700 cited

Progressive Layered Extraction (PLE): A Novel Multi-Task Learning Model for Personalized Recommendations

Hongyan Tang, Junning Liu, Ming Zhao, Xudong Gong

Addresses 'seesaw phenomenon' where improving one task hurts others; explicitly separates shared/task-specific experts with progressive routing. Achieved 2.23% lift at Tencent.

2018 1200 cited

Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate

Xiao Ma, Liqin Zhao, Guan Huang, Zhi Wang, Zelin Hu, Xiaoqiang Zhu, Kun Gai

Models CVR over entire impression space (not just clicks) to eliminate sample selection bias; jointly trains CTR and CTCVR with shared embeddings. Alibaba/Taobao deployment.

2019 200 cited

A Pareto-Efficient Algorithm for Multiple Objective Optimization in E-Commerce Recommendation

Yao Lin, Dongjin Chen, Mingsheng Shang, Youfang Lin, Shenghua Bao, Wei Xiao, Tat-Seng Chua

PE-LTR framework using coordinate descent with theoretical Pareto efficiency guarantees for balancing CTR and GMV. Alibaba e-commerce production system.

Title	Authors	Year	Citations
Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts Task-specific gating networks over shared expert submodels; explicitly learns task relationships from data. Deployed at Google; widely adopted at Pinterest, LinkedIn, Meta.	Jiaqi Ma, Zhe Zhao, Xinyang Yi, Jilin Chen, Lichan Hong, Ed Chi	2018	1500
Progressive Layered Extraction (PLE): A Novel Multi-Task Learning Model for Personalized Recommendations Addresses 'seesaw phenomenon' where improving one task hurts others; explicitly separates shared/task-specific experts with progressive routing. Achieved 2.23% lift at Tencent.	Hongyan Tang, Junning Liu, Ming Zhao, Xudong Gong	2020	700
Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate Models CVR over entire impression space (not just clicks) to eliminate sample selection bias; jointly trains CTR and CTCVR with shared embeddings. Alibaba/Taobao deployment.	Xiao Ma, Liqin Zhao, Guan Huang, Zhi Wang, Zelin Hu, Xiaoqiang Zhu, Kun Gai	2018	1200
A Pareto-Efficient Algorithm for Multiple Objective Optimization in E-Commerce Recommendation PE-LTR framework using coordinate descent with theoretical Pareto efficiency guarantees for balancing CTR and GMV. Alibaba e-commerce production system.	Yao Lin, Dongjin Chen, Mingsheng Shang, Youfang Lin, Shenghua Bao, Wei Xiao, Tat-Seng Chua	2019	200

Reinforcement Learning for Recommendations

Optimize for long-term user satisfaction

2019 500 cited

Top-K Off-Policy Correction for a REINFORCE Recommender System

Minmin Chen, Alex Beutel, Paul Covington, Sagar Jain, Francois Belletti, Ed Chi

Scales REINFORCE to YouTube production with millions of items; introduces top-K off-policy correction for learning from biased logged feedback. Seminal industry RL paper.

2019 300 cited

SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets

Eugene Ie, Vihan Jain, Jing Wang, Sanmit Narvekar, Ritesh Agarwal, Rui Wu, Heng-Tze Cheng, Tushar Chandra, Craig Boutilier

Value-based RL decomposition for slate recommendations; long-term slate value decomposes into tractable item-wise LTVs. YouTube production deployment solving combinatorial slate optimization.

2018 350 cited

Deep Reinforcement Learning for List-wise Recommendations

Xiangyu Zhao, Liang Zhang, Zhuoye Ding, Long Xia, Jiliang Tang, Dawei Yin

Actor-Critic framework capturing inter-item relationships in list recommendations; addresses sequential decision-making for e-commerce. JD.com application.

2020 150 cited

Pseudo Dyna-Q: A Reinforcement Learning Framework for Interactive Recommendation

Lixin Zou, Long Xia, Zhuoye Ding, Dawei Yin, Jiaxing Song, Weidong Liu, Yongjun Bao

Model-based RL using world models for recommendation; addresses data efficiency challenge of learning from limited online interactions.

Title	Authors	Year	Citations
Top-K Off-Policy Correction for a REINFORCE Recommender System Scales REINFORCE to YouTube production with millions of items; introduces top-K off-policy correction for learning from biased logged feedback. Seminal industry RL paper.	Minmin Chen, Alex Beutel, Paul Covington, Sagar Jain, Francois Belletti, Ed Chi	2019	500
SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets Value-based RL decomposition for slate recommendations; long-term slate value decomposes into tractable item-wise LTVs. YouTube production deployment solving combinatorial slate optimization.	Eugene Ie, Vihan Jain, Jing Wang, Sanmit Narvekar, Ritesh Agarwal, Rui Wu, Heng-Tze Cheng, Tushar Chandra, Craig Boutilier	2019	300
Deep Reinforcement Learning for List-wise Recommendations Actor-Critic framework capturing inter-item relationships in list recommendations; addresses sequential decision-making for e-commerce. JD.com application.	Xiangyu Zhao, Liang Zhang, Zhuoye Ding, Long Xia, Jiliang Tang, Dawei Yin	2018	350
Pseudo Dyna-Q: A Reinforcement Learning Framework for Interactive Recommendation Model-based RL using world models for recommendation; addresses data efficiency challenge of learning from limited online interactions.	Lixin Zou, Long Xia, Zhuoye Ding, Dawei Yin, Jiaxing Song, Weidong Liu, Yongjun Bao	2020	150

Fairness & Responsible Recommendations

Ensure fair exposure for creators and sellers

2018 900 cited

Fairness of Exposure in Rankings

Ashudeep Singh, Thorsten Joachims

Foundational framework for exposure-based fairness; develops efficient algorithms maximizing user utility while satisfying fairness constraints on content provider exposure. Applicable to job platforms, e-commerce, any two-sided marketplace.

2018 350 cited

Towards a Fair Marketplace: Counterfactual Evaluation of the Trade-off between Relevance, Fairness & Satisfaction

Rishabh Mehrotra, James McInerney, Hugues Bouchard, Mounia Lalmas, Fernando Diaz

Studies relevance-fairness trade-offs in two-sided marketplaces; proposes counterfactual evaluation for balancing consumer and supplier objectives. Spotify production research.

2020 200 cited

Controlling Fairness and Bias in Dynamic Learning-to-Rank

Marco Morik, Ashudeep Singh, Jessica Hong, Thorsten Joachims

Learning-to-rank enforcing merit-based fairness guarantees to item groups while accounting for selection bias in implicit feedback. Addresses fairness maintenance over time.

2019 150 cited

Recommendations and Their Impact on the Provider Side

Rishabh Mehrotra, James McInerney, Hugues Bouchard, Mounia Lalmas

Comprehensive study of how recommendations affect content providers; examines supplier-side impacts and multi-stakeholder trade-offs. Spotify research.

Title	Authors	Year	Citations
Fairness of Exposure in Rankings Foundational framework for exposure-based fairness; develops efficient algorithms maximizing user utility while satisfying fairness constraints on content provider exposure. Applicable to job platforms, e-commerce, any two-sided marketplace.	Ashudeep Singh, Thorsten Joachims	2018	900
Towards a Fair Marketplace: Counterfactual Evaluation of the Trade-off between Relevance, Fairness & Satisfaction Studies relevance-fairness trade-offs in two-sided marketplaces; proposes counterfactual evaluation for balancing consumer and supplier objectives. Spotify production research.	Rishabh Mehrotra, James McInerney, Hugues Bouchard, Mounia Lalmas, Fernando Diaz	2018	350
Controlling Fairness and Bias in Dynamic Learning-to-Rank Learning-to-rank enforcing merit-based fairness guarantees to item groups while accounting for selection bias in implicit feedback. Addresses fairness maintenance over time.	Marco Morik, Ashudeep Singh, Jessica Hong, Thorsten Joachims	2020	200
Recommendations and Their Impact on the Provider Side Comprehensive study of how recommendations affect content providers; examines supplier-side impacts and multi-stakeholder trade-offs. Spotify research.	Rishabh Mehrotra, James McInerney, Hugues Bouchard, Mounia Lalmas	2019	150

Evaluation Methodology

Measure recommendation quality beyond clicks

2018 200 cited

Offline A/B Testing for Recommender Systems

Alexandre Gilotte, Clément Calauzènes, Thomas Nedelec, Alexandre Abraham, Simon Dollé

Counterfactual estimators for offline evaluation correlating with online A/B tests; addresses bias-variance tradeoffs in off-policy evaluation. Criteo production system enabling faster iteration.

2010 1500 cited

Performance of Recommender Algorithms on Top-N Recommendation Tasks

Paolo Cremonesi, Yehuda Koren, Roberto Turrin

Demonstrates RMSE optimization doesn't translate to top-N accuracy; establishes methodology for evaluating recommendation quality beyond prediction error metrics. Netflix context.

2019 1200 cited

Are We Really Making Much Progress? A Worrying Analysis of Recent Neural Recommendation Approaches

Maurizio Ferrari Dacrema, Paolo Cremonesi, Dietmar Jannach

Critical analysis showing many neural approaches don't outperform well-tuned baselines; emphasizes reproducibility and rigorous evaluation methodology. Essential for experimental design.