Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Deep reinforcement learning
SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer
Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …
decision strategies. However, in many cases, it is desirable to learn directly from …
Statistical relational artificial intelligence: Logic, probability, and computation
An intelligent agent interacting with the real world will encounter individual people, courses,
test results, drugs prescriptions, chairs, boxes, etc., and needs to reason about properties of …
test results, drugs prescriptions, chairs, boxes, etc., and needs to reason about properties of …
An overview of cooperative and competitive multiagent learning
Multi-agent systems (MASs) is an area of distributed artificial intelligence that emphasizes
the joint behaviors of agents with some degree of autonomy and the complexities arising …
the joint behaviors of agents with some degree of autonomy and the complexities arising …
Coordination of electric vehicle charging through multiagent reinforcement learning
The number of Electric Vehicle (EV) owners is expected to significantly increase in the near
future, since EVs are regarded as valuable assets both for transportation and energy storage …
future, since EVs are regarded as valuable assets both for transportation and energy storage …
Learning neuro-symbolic relational transition models for bilevel planning
In robotic domains, learning and planning are complicated by continuous state spaces,
continuous action spaces, and long task horizons. In this work, we address these challenges …
continuous action spaces, and long task horizons. In this work, we address these challenges …
A review of symbolic, subsymbolic and hybrid methods for sequential decision making
In the field of Sequential Decision Making (SDM), two paradigms have historically vied for
supremacy: Automated Planning (AP) and Reinforcement Learning (RL). In the spirit of …
supremacy: Automated Planning (AP) and Reinforcement Learning (RL). In the spirit of …
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm
This paper makes a first step toward the integration of two subfields of machine learning,
namely preference learning and reinforcement learning (RL). An important motivation for a …
namely preference learning and reinforcement learning (RL). An important motivation for a …
PDDLGym: Gym environments from PDDL problems
We present PDDLGym, a framework that automatically constructs OpenAI Gym
environments from PDDL domains and problems. Observations and actions in PDDLGym …
environments from PDDL domains and problems. Observations and actions in PDDLGym …
Learning value functions with relational state representations for guiding task-and-motion planning
We propose a novel relational state representation and an action-value function learning
algorithm that learns from planning experience for geometric task-and-motion planning …
algorithm that learns from planning experience for geometric task-and-motion planning …
Structured machine learning: the next ten years
The field of inductive logic programming (ILP) has made steady progress, since the first ILP
workshop in 1991, based on a balance of developments in theory, implementations and …
workshop in 1991, based on a balance of developments in theory, implementations and …