Explainable artificial intelligence by genetic programming: A survey

Y Mei, Q Chen, A Lensen, B Xue… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Explainable artificial intelligence (XAI) has received great interest in the recent decade, due
to its importance in critical application domains, such as self-driving cars, law, and …

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z **ong, L Zintgraf… - arxiv preprint arxiv …, 2023 - arxiv.org
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

Hyperparameters in reinforcement learning and how to tune them

T Eimer, M Lindauer… - … conference on machine …, 2023 - proceedings.mlr.press
In order to improve reproducibility, deep reinforcement learning (RL) has been adopting
better scientific practices such as standardized evaluation metrics and reporting. However …

Discovered policy optimisation

C Lu, J Kuba, A Letcher, L Metz… - Advances in …, 2022 - proceedings.neurips.cc
Tremendous progress has been made in reinforcement learning (RL) over the past decade.
Most of these advancements came through the continual development of new algorithms …

General-purpose in-context learning by meta-learning transformers

L Kirsch, J Harrison, J Sohl-Dickstein, L Metz - arxiv preprint arxiv …, 2022 - arxiv.org
Modern machine learning requires system designers to specify aspects of the learning
pipeline, such as losses, architectures, and optimizers. Meta-learning, or learning-to-learn …

Discovering preference optimization algorithms with and for large language models

C Lu, S Holt, C Fanconi, A Chan… - Advances in …, 2025 - proceedings.neurips.cc
Offline preference optimization is a key method for enhancing and controlling the quality of
Large Language Model (LLM) outputs. Typically, preference optimization is approached as …

Evolutionary reinforcement learning: A survey

H Bai, R Cheng, Y ** - Intelligent Computing, 2023 - spj.science.org
Reinforcement learning (RL) is a machine learning approach that trains agents to maximize
cumulative rewards through interactions with environments. The integration of RL with deep …

On the effectiveness of fine-tuning versus meta-reinforcement learning

M Zhao, P Abbeel, S James - Advances in neural …, 2022 - proceedings.neurips.cc
Intelligent agents should have the ability to leverage knowledge from previously learned
tasks in order to learn new ones quickly and efficiently. Meta-learning approaches have …

[HTML][HTML] A survey of deep reinforcement learning application in 5G and beyond network slicing and virtualization

C Ssengonzi, OP Kogeda, TO Olwal - Array, 2022 - Elsevier
Abstract The 5th Generation (5G) and beyond networks are expected to offer huge
throughputs, connect large number of devices, support low latency and large numbers of …