A survey on transformers in reinforcement learning
Transformer has been considered the dominating neural architecture in NLP and CV, mostly
under supervised settings. Recently, a similar surge of using Transformers has appeared in …
under supervised settings. Recently, a similar surge of using Transformers has appeared in …
MERLIN: Multi-agent offline and transfer learning for occupant-centric operation of grid-interactive communities
Building and power generation decarbonization present new challenges in electric grid
reliability as a result of renewable energy source intermittency and increase in grid load …
reliability as a result of renewable energy source intermittency and increase in grid load …
Future-conditioned unsupervised pretraining for decision transformer
Recent research in offline reinforcement learning (RL) has demonstrated that return-
conditioned supervised learning is a powerful paradigm for decision-making problems …
conditioned supervised learning is a powerful paradigm for decision-making problems …
Transformer in reinforcement learning for decision-making: a survey
Reinforcement learning (RL) has become a dominant decision-making paradigm and has
achieved notable success in many real-world applications. Notably, deep neural networks …
achieved notable success in many real-world applications. Notably, deep neural networks …
A coordinated active and reactive power optimization approach for multi-microgrids connected to distribution networks with multi-actor-attention-critic deep …
L Dong, H Lin, J Qiao, T Zhang, S Zhang, T Pu - Applied Energy, 2024 - Elsevier
As a promising approach to managing distributed energy, the use of microgrids has attracted
significant attention among those managing continuous connections to distribution networks …
significant attention among those managing continuous connections to distribution networks …
Investigating multi-task pretraining and generalization in reinforcement learning
Deep reinforcement learning~(RL) has achieved remarkable successes in complex single-
task settings. However, designing RL agents that can learn multiple tasks and leverage prior …
task settings. However, designing RL agents that can learn multiple tasks and leverage prior …
Learning to discover skills through guidance
In the field of unsupervised skill discovery (USD), a major challenge is limited exploration,
primarily due to substantial penalties when skills deviate from their initial trajectories. To …
primarily due to substantial penalties when skills deviate from their initial trajectories. To …
A survey on influence maximization: From an ml-based combinatorial optimization
Influence Maximization (IM) is a classical combinatorial optimization problem, which can be
widely used in mobile networks, social computing, and recommendation systems. It aims at …
widely used in mobile networks, social computing, and recommendation systems. It aims at …
Intra-domain knowledge reuse assisted reinforcement learning for fast anti-jamming communication
Q Zhou, Y Niu, P **ang, Y Li - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The novel technology based on reinforcement learning (RL) is considered as a promising
direction to achieve cognitive and even intelligent anti-jamming communication. However …
direction to achieve cognitive and even intelligent anti-jamming communication. However …
End-to-end (instance)-image goal navigation through correspondence as an emergent phenomenon
Most recent work in goal oriented visual navigation resorts to large-scale machine learning
in simulated environments. The main challenge lies in learning compact representations …
in simulated environments. The main challenge lies in learning compact representations …