Citation:
Masood MA, Doshi-Velez F. Diversity-Inducing Policy Gradient: Using MMD to find a set of policies that are diverse in terms of stete-visitation. International Conference on Machine Learning (ICML) Exploration in Reinforcement Learning Workshop. 2018.
Paper | 251 KB |