Breaking the curse of horizon in conformal off-policy prediction
Determine how to break the curse of horizon in conformal off-policy prediction for sequential decision making by constructing prediction interval procedures that remain efficient for long decision horizons while retaining valid coverage guarantees for the potential outcome under a given target policy.
References
It can be seen that the proposed method is able to achieve nominal coverage in general. Nonetheless, as commented in our paper, it suffers from the curse of horizon and would be inefficient in long-horizon settings. It remains unclear how to break the curse of horizon and we leave it as future work.
— Conformal Off-policy Prediction
(2206.06711 - Zhang et al., 2022) in Section 5, Synthetic Data Analysis (Results, Example 3)