A survey on transformers in reinforcement learning

W Li, H Luo, Z Lin, C Zhang, Z Lu, D Ye - arxiv preprint arxiv:2301.03044, 2023 - arxiv.org
Transformer has been considered the dominating neural architecture in NLP and CV, mostly
under supervised settings. Recently, a similar surge of using Transformers has appeared in …

MERLIN: Multi-agent offline and transfer learning for occupant-centric operation of grid-interactive communities

K Nweye, S Sankaranarayanan, Z Nagy - Applied Energy, 2023 - Elsevier
Building and power generation decarbonization present new challenges in electric grid
reliability as a result of renewable energy source intermittency and increase in grid load …

Future-conditioned unsupervised pretraining for decision transformer

Z **e, Z Lin, D Ye, Q Fu, Y Wei… - … Conference on Machine …, 2023 - proceedings.mlr.press
Recent research in offline reinforcement learning (RL) has demonstrated that return-
conditioned supervised learning is a powerful paradigm for decision-making problems …

Transformer in reinforcement learning for decision-making: a survey

W Yuan, J Chen, S Chen, D Feng, Z Hu, P Li… - Frontiers of Information …, 2024 - Springer
Reinforcement learning (RL) has become a dominant decision-making paradigm and has
achieved notable success in many real-world applications. Notably, deep neural networks …

A coordinated active and reactive power optimization approach for multi-microgrids connected to distribution networks with multi-actor-attention-critic deep …

L Dong, H Lin, J Qiao, T Zhang, S Zhang, T Pu - Applied Energy, 2024 - Elsevier
As a promising approach to managing distributed energy, the use of microgrids has attracted
significant attention among those managing continuous connections to distribution networks …

Investigating multi-task pretraining and generalization in reinforcement learning

AA Taiga, R Agarwal, J Farebrother… - The Eleventh …, 2023 - openreview.net
Deep reinforcement learning~(RL) has achieved remarkable successes in complex single-
task settings. However, designing RL agents that can learn multiple tasks and leverage prior …

Learning to discover skills through guidance

H Kim, BK Lee, H Lee, D Hwang… - Advances in …, 2024 - proceedings.neurips.cc
In the field of unsupervised skill discovery (USD), a major challenge is limited exploration,
primarily due to substantial penalties when skills deviate from their initial trajectories. To …

A survey on influence maximization: From an ml-based combinatorial optimization

Y Li, H Gao, Y Gao, J Guo, W Wu - ACM Transactions on Knowledge …, 2023 - dl.acm.org
Influence Maximization (IM) is a classical combinatorial optimization problem, which can be
widely used in mobile networks, social computing, and recommendation systems. It aims at …

Intra-domain knowledge reuse assisted reinforcement learning for fast anti-jamming communication

Q Zhou, Y Niu, P **ang, Y Li - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The novel technology based on reinforcement learning (RL) is considered as a promising
direction to achieve cognitive and even intelligent anti-jamming communication. However …

End-to-end (instance)-image goal navigation through correspondence as an emergent phenomenon

G Bono, L Antsfeld, B Chidlovskii… - arxiv preprint arxiv …, 2023 - arxiv.org
Most recent work in goal oriented visual navigation resorts to large-scale machine learning
in simulated environments. The main challenge lies in learning compact representations …