Q-learning: Theory and applications

J Clifton, E Laber - Annual Review of Statistics and Its …, 2020 - annualreviews.org
Q-learning, originally an incremental algorithm for estimating an optimal decision strategy in
an infinite-horizon decision problem, now refers to a general class of reinforcement learning …

A review of off-policy evaluation in reinforcement learning

M Uehara, C Shi, N Kallus - arxiv preprint arxiv:2212.06355, 2022 - arxiv.org
Reinforcement learning (RL) is one of the most vibrant research frontiers in machine
learning and has been recently applied to solve a number of challenging problems. In this …

Learning when-to-treat policies

X Nie, E Brunskill, S Wager - Journal of the American Statistical …, 2021 - Taylor & Francis
Many applied decision-making problems have a dynamic component: The policymaker
needs not only to choose whom to treat, but also when to start which treatment. For example …

[KİTAP][B] Dynamic treatment regimes: Statistical methods for precision medicine

AA Tsiatis, M Davidian, ST Holloway, EB Laber - 2019 - taylorfrancis.com
Dynamic Treatment Regimes: Statistical Methods for Precision Medicine provides a
comprehensive introduction to statistical methodology for the evaluation and discovery of …

Learning cost-effective and interpretable treatment regimes

H Lakkaraju, C Rudin - Artificial intelligence and statistics, 2017 - proceedings.mlr.press
Decision makers, such as doctors and judges, make crucial decisions such as
recommending treatments to patients, and granting bails to defendants on a daily basis …

[HTML][HTML] Machine Learning and Health Science Research: Tutorial

H Cho, J She, D De Marchi, H El-Zaatari… - Journal of Medical …, 2024 - jmir.org
Machine learning (ML) has seen impressive growth in health science research due to its
capacity for handling complex data to perform a range of tasks, including unsupervised …

[HTML][HTML] Tree-based reinforcement learning for estimating optimal dynamic treatment regimes

Y Tao, L Wang, D Almirall - The annals of applied statistics, 2018 - ncbi.nlm.nih.gov
Dynamic treatment regimes (DTRs) are sequences of treatment decision rules, in which
treatment may be adapted over time in response to the changing course of an individual …

Ambiguous dynamic treatment regimes: A reinforcement learning approach

S Saghafian - Management Science, 2024 - pubsonline.informs.org
A main research goal in various studies is to use an observational data set and provide a
new set of counterfactual guidelines that can yield causal improvements. Dynamic …

Estimating and improving dynamic treatment regimes with a time-varying instrumental variable

S Chen, B Zhang - Journal of the Royal Statistical Society Series …, 2023 - academic.oup.com
Estimating dynamic treatment regimes (DTRs) from retrospective observational data is
challenging as some degree of unmeasured confounding is often expected. In this work, we …

Deep Transfer -Learning for Offline Non-Stationary Reinforcement Learning

J Chai, E Chen, J Fan - arxiv preprint arxiv:2501.04870, 2025 - arxiv.org
In dynamic decision-making scenarios across business and healthcare, leveraging sample
trajectories from diverse populations can significantly enhance reinforcement learning (RL) …