Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition
A major goal of artificial intelligence (AI) is to create an agent capable of acquiring a general
understanding of the world. Such an agent would require the ability to continually …
understanding of the world. Such an agent would require the ability to continually …
Explainable reinforcement learning (XRL): a systematic literature review and taxonomy
Y Bekkemoen - Machine Learning, 2024 - Springer
In recent years, reinforcement learning (RL) systems have shown impressive performance
and remarkable achievements. Many achievements can be attributed to combining RL with …
and remarkable achievements. Many achievements can be attributed to combining RL with …
A survey on interpretable reinforcement learning
Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …
for sequential decision-making problems, it is still not mature enough for high-stake domains …
Rl-gpt: Integrating reinforcement learning and code-as-policy
Abstract Large Language Models (LLMs) have demonstrated proficiency in utilizing various
tools by coding, yet they face limitations in handling intricate logic and precise control. In …
tools by coding, yet they face limitations in handling intricate logic and precise control. In …
Generating code world models with large language models guided by monte carlo tree search
In this work we consider Code World Models, world models generated by a Large Language
Model (LLM) in the form of Python code for model-based Reinforcement Learning (RL) …
Model (LLM) in the form of Python code for model-based Reinforcement Learning (RL) …
Interpretable and editable programmatic tree policies for reinforcement learning
Deep reinforcement learning agents are prone to goal misalignments. The black-box nature
of their policies hinders the detection and correction of such misalignments, and the trust …
of their policies hinders the detection and correction of such misalignments, and the trust …
Artificial collective intelligence engineering: a survey of concepts and perspectives
R Casadei - Artificial Life, 2023 - ieeexplore.ieee.org
Collectiveness is an important property of many systems—both natural and artificial. By
exploiting a large number of individuals, it is often possible to produce effects that go far …
exploiting a large number of individuals, it is often possible to produce effects that go far …
Instructing goal-conditioned reinforcement learning agents with temporal logic objectives
Goal-conditioned reinforcement learning (RL) is a powerful approach for learning general-
purpose skills by reaching diverse goals. However, it has limitations when it comes to task …
purpose skills by reaching diverse goals. However, it has limitations when it comes to task …
Show me the way! Bilevel search for synthesizing programmatic strategies
DS Aleixo, LHS Lelis - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
The synthesis of programmatic strategies requires one to search in large non-differentiable
spaces of computer programs. Current search algorithms use self-play approaches to guide …
spaces of computer programs. Current search algorithms use self-play approaches to guide …
Synthesizing programmatic reinforcement learning policies with large language model guided search
Programmatic reinforcement learning (PRL) has been explored for representing policies
through programs as a means to achieve interpretability and generalization. Despite …
through programs as a means to achieve interpretability and generalization. Despite …