Q-learning: Theory and applications
J Clifton, E Laber - Annual Review of Statistics and Its …, 2020 - annualreviews.org
Q-learning, originally an incremental algorithm for estimating an optimal decision strategy in
an infinite-horizon decision problem, now refers to a general class of reinforcement learning …
an infinite-horizon decision problem, now refers to a general class of reinforcement learning …
A review of off-policy evaluation in reinforcement learning
Reinforcement learning (RL) is one of the most vibrant research frontiers in machine
learning and has been recently applied to solve a number of challenging problems. In this …
learning and has been recently applied to solve a number of challenging problems. In this …
Learning when-to-treat policies
Many applied decision-making problems have a dynamic component: The policymaker
needs not only to choose whom to treat, but also when to start which treatment. For example …
needs not only to choose whom to treat, but also when to start which treatment. For example …
[KİTAP][B] Dynamic treatment regimes: Statistical methods for precision medicine
Dynamic Treatment Regimes: Statistical Methods for Precision Medicine provides a
comprehensive introduction to statistical methodology for the evaluation and discovery of …
comprehensive introduction to statistical methodology for the evaluation and discovery of …
Learning cost-effective and interpretable treatment regimes
Decision makers, such as doctors and judges, make crucial decisions such as
recommending treatments to patients, and granting bails to defendants on a daily basis …
recommending treatments to patients, and granting bails to defendants on a daily basis …
[HTML][HTML] Machine Learning and Health Science Research: Tutorial
Machine learning (ML) has seen impressive growth in health science research due to its
capacity for handling complex data to perform a range of tasks, including unsupervised …
capacity for handling complex data to perform a range of tasks, including unsupervised …
[HTML][HTML] Tree-based reinforcement learning for estimating optimal dynamic treatment regimes
Dynamic treatment regimes (DTRs) are sequences of treatment decision rules, in which
treatment may be adapted over time in response to the changing course of an individual …
treatment may be adapted over time in response to the changing course of an individual …
Ambiguous dynamic treatment regimes: A reinforcement learning approach
S Saghafian - Management Science, 2024 - pubsonline.informs.org
A main research goal in various studies is to use an observational data set and provide a
new set of counterfactual guidelines that can yield causal improvements. Dynamic …
new set of counterfactual guidelines that can yield causal improvements. Dynamic …
Estimating and improving dynamic treatment regimes with a time-varying instrumental variable
Estimating dynamic treatment regimes (DTRs) from retrospective observational data is
challenging as some degree of unmeasured confounding is often expected. In this work, we …
challenging as some degree of unmeasured confounding is often expected. In this work, we …
Deep Transfer -Learning for Offline Non-Stationary Reinforcement Learning
In dynamic decision-making scenarios across business and healthcare, leveraging sample
trajectories from diverse populations can significantly enhance reinforcement learning (RL) …
trajectories from diverse populations can significantly enhance reinforcement learning (RL) …