On formalizing causal off-policy evaluation for sequential decision-making