Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Reinforcement learning in healthcare: A survey
As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …
making by using interaction samples of an agent with its environment and the potentially …
Reinforcement learning for intelligent healthcare applications: A survey
Discovering new treatments and personalizing existing ones is one of the major goals of
modern clinical research. In the last decade, Artificial Intelligence (AI) has enabled the …
modern clinical research. In the last decade, Artificial Intelligence (AI) has enabled the …
Towards optimal off-policy evaluation for reinforcement learning with marginalized importance sampling
Motivated by the many real-world applications of reinforcement learning (RL) that require
safe-policy iterations, we consider the problem of off-policy evaluation (OPE)---the problem …
safe-policy iterations, we consider the problem of off-policy evaluation (OPE)---the problem …
[BOK][B] Statistical methods for dynamic treatment regimes
B Chakraborty, EEM Moodie - 2013 - Springer
This book was written to summarize and describe the state of the art of statistical methods
developed to address questions of estimation and inference for dynamic treatment regimes …
developed to address questions of estimation and inference for dynamic treatment regimes …
[BOK][B] Reinforcement learning and dynamic programming using function approximators
From household appliances to applications in robotics, engineered systems involving
complex dynamics can only be as effective as the algorithms that control them. While …
complex dynamics can only be as effective as the algorithms that control them. While …
A reinforcement learning approach to weaning of mechanical ventilation in intensive care units
The management of invasive mechanical ventilation, and the regulation of sedation and
analgesia during ventilation, constitutes a major part of the care of patients admitted to …
analgesia during ventilation, constitutes a major part of the care of patients admitted to …
Off-policy policy gradient with state distribution correction
We study the problem of off-policy policy optimization in Markov decision processes, and
develop a novel off-policy policy gradient method. Prior off-policy policy gradient …
develop a novel off-policy policy gradient method. Prior off-policy policy gradient …
Experience replay for real-time reinforcement learning control
Reinforcement-learning (RL) algorithms can automatically learn optimal control strategies
for nonlinear, possibly stochastic systems. A promising approach for RL control is …
for nonlinear, possibly stochastic systems. A promising approach for RL control is …
Reinforcement learning design for cancer clinical trials
We develop reinforcement learning trials for discovering individualized treatment regimens
for life‐threatening diseases such as cancer. A temporal‐difference learning method called …
for life‐threatening diseases such as cancer. A temporal‐difference learning method called …
Leveraging factored action spaces for efficient offline reinforcement learning in healthcare
Many reinforcement learning (RL) applications have combinatorial action spaces, where
each action is a composition of sub-actions. A standard RL approach ignores this inherent …
each action is a composition of sub-actions. A standard RL approach ignores this inherent …