Large sequence models for sequential decision-making: a survey

M Wen, R Lin, H Wang, Y Yang, Y Wen, L Mai… - Frontiers of Computer …, 2023 - Springer
Transformer architectures have facilitated the development of large-scale and general-
purpose sequence models for prediction tasks in natural language processing and computer …

Performance enhancement of artificial intelligence: A survey

M Krichen, MS Abdalzaher - Journal of Network and Computer Applications, 2024 - Elsevier
The advent of machine learning (ML) and Artificial intelligence (AI) has brought about a
significant transformation across multiple industries, as it has facilitated the automation of …

[PDF][PDF] Theoretical approaches to AI in supply chain optimization: Pathways to efficiency and resilience

EA Abaku, TE Edunjobi… - International Journal of …, 2024 - pdfs.semanticscholar.org
Abstract The integration of Artificial Intelligence (AI) into supply chain management has
emerged as a pivotal avenue for enhancing efficiency and resilience in contemporary …

Energy management for demand response in networked greenhouses with multi-agent deep reinforcement learning

A Ajagekar, B Decardi-Nelson, F You - Applied Energy, 2024 - Elsevier
Greenhouses are key to ensuring food security and realizing a sustainable future for
agriculture. However, to ensure crop growth efficiency, greenhouses consume a significant …

The state of AI-empowered backscatter communications: A comprehensive survey

F Xu, T Hussain, M Ahmed, K Ali… - IEEE Internet of …, 2023 - ieeexplore.ieee.org
The Internet of Things (IoT) is undergoing significant advancements, driven by the
emergence of backscatter communication (BC) and artificial intelligence (AI). BC is an …

Bridging evolutionary algorithms and reinforcement learning: A comprehensive survey on hybrid algorithms

P Li, J Hao, H Tang, X Fu, Y Zhen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Evolutionary Reinforcement Learning (ERL), which integrates Evolutionary Algorithms (EAs)
and Reinforcement Learning (RL) for optimization, has demonstrated remarkable …

Small batch deep reinforcement learning

J Obando Ceron, M Bellemare… - Advances in Neural …, 2024 - proceedings.neurips.cc
In value-based deep reinforcement learning with replay memories, the batch size parameter
specifies how many transitions to sample for each gradient update. Although critical to the …

Transient gas path fault diagnosis of aero-engine based on domain adaptive offline reinforcement learning

J Xu, Y Wang, Z Wang, X Wang, Y Zhao - Aerospace Science and …, 2024 - Elsevier
Real-time measurement parameters are crucial for diagnosing faults in aero-engine gas
path performance, ensuring engine reliability, and mitigating potential economic losses …

Behavior contrastive learning for unsupervised skill discovery

R Yang, C Bai, H Guo, S Li, B Zhao… - International …, 2023 - proceedings.mlr.press
In reinforcement learning, unsupervised skill discovery aims to learn diverse skills without
extrinsic rewards. Previous methods discover skills by maximizing the mutual information …

Semantically aligned task decomposition in multi-agent reinforcement learning

W Li, D Qiao, B Wang, X Wang, B **, H Zha - arxiv preprint arxiv …, 2023 - arxiv.org
The difficulty of appropriately assigning credit is particularly heightened in cooperative
MARL with sparse reward, due to the concurrent time and structural scales involved …