Defining Admissible Rewards for High Confidence Policy Evaluation