Loading...
The system can't perform the operation now. Try again later.
Cite
Advanced search
Find articles
with
all
of the words
with the
exact phrase
with
at least one
of the words
without
the words
where my words occur
anywhere in the article
in the title of the article
Return articles
authored
by
e.g.,
"PJ Hayes"
or
McCarthy
Return articles
published
in
e.g.,
J Biol Chem
or
Nature
Return articles
dated
between
—
e.g.,
1996
Saved to My library
Done
Remove article
Articles
Case law
Profiles
My profile
My library
Alerts
Metrics
Advanced search
Settings
Get journal articles
Free ChatGPT
Get journal articles
Articles
Scholar
About 80 results (
0.02
sec)
My profile
My library
Year
Any time
Since 2025
Since 2024
Since 2021
Sort by relevance
Sort by date
Any time
Since 2025
Since 2024
Since 2021
Custom range...
—
Search
Sort by relevance
Sort by date
Create alert
Off-policy policy evaluation for sequential decisions under unobserved confounding
Search within citing articles
[Free GPT-4]
[PDF]
arxiv.org
A review of off-policy evaluation in reinforcement learning
M Uehara
,
C Shi
,
N Kallus
- ar** accurate off-policy estimators is crucial for both evaluating and optimizing for
new policies. The main challenge in off-policy estimation is the distribution shift between the …
Save
Cite
Cited by 10
Related articles
All 6 versions
Free GPT-4
Create alert
Previous
1
2
3
4
5
6
7
8
Next
1
2
3
4
5
6
7
8
Privacy
Terms
Help
About Scholar
Search help