Off-Policy Evaluation and Learning for the Future under Non-Stationarity
Published in Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2025
We study the novel problem of future off-policy evaluation (F-OPE) and learning (F-OPL) for estimating and optimizing the future value of policies in non-stationary environments, where distributions vary over time.
Recommended citation: Shimizu, Tatsuhiro, et al. "Off-Policy Evaluation and Learning for the Future under Non-Stationarity." Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 1. 2025.
Download Paper