Doubly Robust Estimator for Off-Policy Evaluation with Large Action Spaces
Published in 2023 IEEE Symposium Series On Computational Intelligence, 2023
We study Off-Policy Evaluation (OPE) in contextual bandit settings with large action spaces.
Recommended citation: Shimizu, Tatsuhiro and Forastiere, Laura. (2023). "Doubly Robust Estimator for Off-Policy Evaluation with Large Action Spaces." in Proceedings of 2023 IEEE Symposium Series On Computational Intelligence. http://tatsu432.github.io/files/MDR.pdf