Generative models as an emerging paradigm in the chemical sciences
Traditional computational approaches to design chemical species are limited by the need to
compute properties for a vast number of candidates, eg, by discriminative modeling …
compute properties for a vast number of candidates, eg, by discriminative modeling …
Safe learning in robotics: From learning-based control to safe reinforcement learning
The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …
methods for real-world robotic deployments from both the control and reinforcement learning …
Open problems and fundamental limitations of reinforcement learning from human feedback
Reinforcement learning from human feedback (RLHF) is a technique for training AI systems
to align with human goals. RLHF has emerged as the central method used to finetune state …
to align with human goals. RLHF has emerged as the central method used to finetune state …
Eureka: Human-level reward design via coding large language models
Large Language Models (LLMs) have excelled as high-level semantic planners for
sequential decision-making tasks. However, harnessing them to learn complex low-level …
sequential decision-making tasks. However, harnessing them to learn complex low-level …
Rlprompt: Optimizing discrete text prompts with reinforcement learning
Prompting has shown impressive success in enabling large pretrained language models
(LMs) to perform diverse NLP tasks, especially when only few downstream data are …
(LMs) to perform diverse NLP tasks, especially when only few downstream data are …
Stable-baselines3: Reliable reinforcement learning implementations
STABLE-BASELINES3 provides open-source implementations of deep reinforcement
learning (RL) algorithms in Python. The implementations have been benchmarked against …
learning (RL) algorithms in Python. The implementations have been benchmarked against …
Deep reinforcement learning at the edge of the statistical precipice
Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing
their relative performance on a large suite of tasks. Most published results on deep RL …
their relative performance on a large suite of tasks. Most published results on deep RL …
A survey of zero-shot generalisation in deep reinforcement learning
The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …
produce RL algorithms whose policies generalise well to novel unseen situations at …
A minimalist approach to offline reinforcement learning
Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …
Deep reinforcement learning in smart manufacturing: A review and prospects
To facilitate the personalized smart manufacturing paradigm with cognitive automation
capabilities, Deep Reinforcement Learning (DRL) has attracted ever-increasing attention by …
capabilities, Deep Reinforcement Learning (DRL) has attracted ever-increasing attention by …