Citation:
Parbhoo S, Gottesman O, Doshi-Velez F. Shaping Control Variates for Off-Policy Evaluation, in NeurIPS Workshop on Offline Reinforcement Learning. ; 2020 :1-9.
Paper | 262 KB |
SEC 150 Western Ave Room 2.336
finale@seas.harvard.edu
Paper | 262 KB |