- Academic Search

A Heuillet, F Couthouis, N Díaz-Rodríguez - Knowledge-Based Systems, 2021 - Elsevier

A large set of the explainable Artificial Intelligence (XAI) literature is emerging on feature
relevance techniques to explain a deep neural network (DNN) output or explaining models …

Enregistrer Citer Cité 405 fois Autres articles Les 12 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A survey on explainable reinforcement learning: Concepts, algorithms, challenges

Y Qing, S Liu, J Song, H Wang, M Song - arxiv preprint arxiv:2211.06665, 2022 - arxiv.org

Reinforcement Learning (RL) is a popular machine learning paradigm where intelligent
agents interact with the environment to fulfill a long-term goal. Driven by the resurgence of …

Enregistrer Citer Cité 35 fois Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition

JA Mendez, E Eaton - arxiv preprint arxiv:2207.07730, 2022 - arxiv.org

A major goal of artificial intelligence (AI) is to create an agent capable of acquiring a general
understanding of the world. Such an agent would require the ability to continually …

Enregistrer Citer Cité 32 fois Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] neurips.cc

On the expressivity of markov reward

D Abel, W Dabney, A Harutyunyan… - Advances in …, 2021 - proceedings.neurips.cc

Reward is the driving force for reinforcement-learning agents. This paper is dedicated to
understanding the expressivity of reward as a way to capture tasks that we would want an …

Enregistrer Citer Cité 106 fois Autres articles Les 12 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer

Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

Enregistrer Citer Cité 106 fois Autres articles Les 3 versions Free GPT-4

[Free GPT-4]

[PDF] jair.org Full View

Autotelic agents with intrinsically motivated goal-conditioned reinforcement learning: a short survey

C Colas, T Karch, O Sigaud, PY Oudeyer - Journal of Artificial Intelligence …, 2022 - jair.org

Building autonomous machines that can explore open-ended environments, discover
possible interactions and build repertoires of skills is a general objective of artificial …

Enregistrer Citer Cité 135 fois Autres articles Les 25 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] mlr.press

Optimistic linear support and successor features as a basis for optimal policy transfer

LN Alegre, A Bazzan… - … conference on machine …, 2022 - proceedings.mlr.press

In many real-world applications, reinforcement learning (RL) agents might have to solve
multiple tasks, each one typically modeled via a reward function. If reward functions are …

Enregistrer Citer Cité 42 fois Autres articles Les 7 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] neurips.cc

Constraint-conditioned policy optimization for versatile safe reinforcement learning

Y Yao, Z Liu, Z Cen, J Zhu, W Yu… - Advances in Neural …, 2024 - proceedings.neurips.cc

Safe reinforcement learning (RL) focuses on training reward-maximizing agents subject to
pre-defined safety constraints. Yet, learning versatile safe policies that can adapt to varying …

Enregistrer Citer Cité 13 fois Autres articles Les 8 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] neurips.cc

Mocoda: Model-based counterfactual data augmentation

S Pitis, E Creager, A Mandlekar… - Advances in Neural …, 2022 - proceedings.neurips.cc

The number of states in a dynamic process is exponential in the number of objects, making
reinforcement learning (RL) difficult in complex, multi-object domains. For agents to scale to …

Enregistrer Citer Cité 42 fois Autres articles Les 6 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Diversifying ai: Towards creative chess with alphazero

T Zahavy, V Veeriah, S Hou, K Waugh, M Lai… - arxiv preprint arxiv …, 2023 - arxiv.org

In recent years, Artificial Intelligence (AI) systems have surpassed human intelligence in a
variety of computational tasks. However, AI systems, like humans, make mistakes, have …

Enregistrer Citer Cité 22 fois Autres articles Les 2 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

A boolean task algebra for reinforcement learning

Explainability in deep reinforcement learning

A survey on explainable reinforcement learning: Concepts, algorithms, challenges

How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition

On the expressivity of markov reward

A survey on interpretable reinforcement learning

Autotelic agents with intrinsically motivated goal-conditioned reinforcement learning: a short survey

Optimistic linear support and successor features as a basis for optimal policy transfer

Constraint-conditioned policy optimization for versatile safe reinforcement learning

Mocoda: Model-based counterfactual data augmentation

Diversifying ai: Towards creative chess with alphazero