- Academic Search

Articles

Scholar

About 80 results (0.02 sec)

My profile My library

Off-policy policy evaluation for sequential decisions under unobserved confounding

Search within citing articles

[Free GPT-4]

[PDF] arxiv.org

A review of off-policy evaluation in reinforcement learning

M Uehara, C Shi, N Kallus - ar** accurate off-policy estimators is crucial for both evaluating and optimizing for
new policies. The main challenge in off-policy estimation is the distribution shift between the …

Save Cite Cited by 10 Related articles All 6 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Off-policy policy evaluation for sequential decisions under unobserved confounding

A review of off-policy evaluation in reinforcement learning