no code implementations • 30 Jan 2021 • Milad Vaali Esfahaani, Yanbo Xue, Peyman Setoodeh
This paper provides a comparative study between value-based and policy-based deep RL algorithms for designing recommender systems for online advertising.
Recommendation Systems reinforcement-learning +2