Citation:
Prasad N, Engelhardt B, Doshi-Velez F. Defining Admissible Rewards for High-Confidence Policy Evaluation in Batch Reinforcement Learning. ACM Conference on Health, Inference and Learning. 2020;2 :1-9.
Paper | 1.1 MB |
SEC 150 Western Ave Room 2.336
finale@seas.harvard.edu
Paper | 1.1 MB |