Bridging evolutionary algorithms and reinforcement learning: A comprehensive survey on hybrid algorithms

P Li, J Hao, H Tang, X Fu, Y Zhen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Evolutionary Reinforcement Learning (ERL), which integrates Evolutionary Algorithms (EAs)
and Reinforcement Learning (RL) for optimization, has demonstrated remarkable …

A survey on evolutionary reinforcement learning algorithms

Q Zhu, X Wu, Q Lin, L Ma, J Li, Z Ming, J Chen - Neurocomputing, 2023 - Elsevier
Reinforcement Learning (RL) has proven to be highly effective in various real-world
applications. However, in certain scenarios, Evolutionary Algorithms (EAs) have been …

Prediction of coalbed methane production based on deep learning

Z Guo, J Zhao, Z You, Y Li, S Zhang, Y Chen - Energy, 2021 - Elsevier
Coalbed methane (CBM) is a clean energy source. The prediction of CBM production is a
critical step during CBM exploitation and utilization, especially for geological well selection …

Combining evolution and deep reinforcement learning for policy search: A survey

O Sigaud - ACM Transactions on Evolutionary Learning, 2023 - dl.acm.org
Deep neuroevolution and deep Reinforcement Learning have received a lot of attention
over the past few years. Some works have compared them, highlighting their pros and cons …

Evolutionary reinforcement learning: a systematic review and future directions

Y Lin, F Lin, G Cai, H Chen, L Zou, P Wu - arxiv preprint arxiv:2402.13296, 2024 - arxiv.org
In response to the limitations of reinforcement learning and evolutionary algorithms (EAs) in
complex problem-solving, Evolutionary Reinforcement Learning (EvoRL) has emerged as a …

Diversity policy gradient for sample efficient quality-diversity optimization

T Pierrot, V Macé, F Chalumeau, A Flajolet… - Proceedings of the …, 2022 - dl.acm.org
A fascinating aspect of nature lies in its ability to produce a large and diverse collection of
organisms that are all high-performing in their niche. By contrast, most AI algorithms focus …

An off-policy trust region policy optimization method with monotonic improvement guarantee for deep reinforcement learning

W Meng, Q Zheng, Y Shi, G Pan - IEEE Transactions on Neural …, 2021 - ieeexplore.ieee.org
In deep reinforcement learning, off-policy data help reduce on-policy interaction with the
environment, and the trust region policy optimization (TRPO) method is efficient to stabilize …

Forecasting short-term methane based on corrected numerical weather prediction outputs

S Zhao, L Wu, Y **ang, F Zhang - Journal of Cleaner Production, 2024 - Elsevier
Methane (CH 4) represents a significant greenhouse gas, and the control of its emissions is
crucial in impacting global climate change. Accurate forecasting of CH 4 emissions is …

Neuroevolution is a competitive alternative to reinforcement learning for skill discovery

F Chalumeau, R Boige, B Lim, V Macé, M Allard… - arxiv preprint arxiv …, 2022 - arxiv.org
Deep Reinforcement Learning (RL) has emerged as a powerful paradigm for training neural
policies to solve complex control tasks. However, these policies tend to be overfit to the …

Recognition of chronic renal failure based on Raman spectroscopy and convolutional neural network

R Gao, B Yang, C Chen, F Chen, C Chen… - Photodiagnosis and …, 2021 - Elsevier
Purpose Chronic renal failure (CRF) is a disease with a high morbidity rate that can develop
into uraemia, resulting in a series of complications, such as dyspnoea, mental disorders …