Deep reinforcement learning

SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer
Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …

Statistical relational artificial intelligence: Logic, probability, and computation

LD Raedt, K Kersting, S Natarajan, D Poole - Synthesis lectures on …, 2016 - Springer
An intelligent agent interacting with the real world will encounter individual people, courses,
test results, drugs prescriptions, chairs, boxes, etc., and needs to reason about properties of …

An overview of cooperative and competitive multiagent learning

PJ Hoen, K Tuyls, L Panait, S Luke… - Learning and Adaption in …, 2006 - Springer
Multi-agent systems (MASs) is an area of distributed artificial intelligence that emphasizes
the joint behaviors of agents with some degree of autonomy and the complexities arising …

Coordination of electric vehicle charging through multiagent reinforcement learning

FL Da Silva, CEH Nishida, DM Roijers… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
The number of Electric Vehicle (EV) owners is expected to significantly increase in the near
future, since EVs are regarded as valuable assets both for transportation and energy storage …

Learning neuro-symbolic relational transition models for bilevel planning

R Chitnis, T Silver, JB Tenenbaum… - 2022 IEEE/RSJ …, 2022 - ieeexplore.ieee.org
In robotic domains, learning and planning are complicated by continuous state spaces,
continuous action spaces, and long task horizons. In this work, we address these challenges …

A review of symbolic, subsymbolic and hybrid methods for sequential decision making

C Núñez-Molina, P Mesejo… - ACM Computing …, 2024 - dl.acm.org
In the field of Sequential Decision Making (SDM), two paradigms have historically vied for
supremacy: Automated Planning (AP) and Reinforcement Learning (RL). In the spirit of …

Preference-based reinforcement learning: a formal framework and a policy iteration algorithm

J Fürnkranz, E Hüllermeier, W Cheng, SH Park - Machine learning, 2012 - Springer
This paper makes a first step toward the integration of two subfields of machine learning,
namely preference learning and reinforcement learning (RL). An important motivation for a …

PDDLGym: Gym environments from PDDL problems

T Silver, R Chitnis - arxiv preprint arxiv:2002.06432, 2020 - arxiv.org
We present PDDLGym, a framework that automatically constructs OpenAI Gym
environments from PDDL domains and problems. Observations and actions in PDDLGym …

Learning value functions with relational state representations for guiding task-and-motion planning

B Kim, L Shimanuki - Conference on robot learning, 2020 - proceedings.mlr.press
We propose a novel relational state representation and an action-value function learning
algorithm that learns from planning experience for geometric task-and-motion planning …

Structured machine learning: the next ten years

TG Dietterich, P Domingos, L Getoor, S Muggleton… - Machine Learning, 2008 - Springer
The field of inductive logic programming (ILP) has made steady progress, since the first ILP
workshop in 1991, based on a balance of developments in theory, implementations and …