Citation:
Prasad N, Engelhardt B, Doshi-Velez F. Defining Admissible Rewards for High-Confidence Policy Evaluation in Batch Reinforcement Learning. ACM Conference on Health, Inference and Learning. 2020;2 :1-9.
Paper | 1.1 MB |
Science Center 316.04
finale@seas.harvard.edu
Paper | 1.1 MB |