Transfer Learning Across Patient Variations with Hidden Parameter Markov Decision Processes