Synergistic integration between machine learning and agent-based modeling: A multidisciplinary review

W Zhang, A Valencia, NB Chang - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Agent-based modeling (ABM) involves develo** models in which agents make adaptive
decisions in a changing environment. Machine-learning (ML) based inference models can …

Automatic web testing using curiosity-driven reinforcement learning

Y Zheng, Y Liu, X **e, Y Liu, L Ma… - 2021 IEEE/ACM 43rd …, 2021 - ieeexplore.ieee.org
Web testing has long been recognized as a notoriously difficult task. Even nowadays, web
testing still mainly relies on manual efforts in many cases while automated web testing is still …

Vulnerability assessment of deep reinforcement learning models for power system topology optimization

Y Zheng, Z Yan, K Chen, J Sun, Y Xu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This paper studies the vulnerability of deep reinforcement learning (DRL) models for power
systems topology optimization under data perturbations and cyber-attack. DRL has recently …

GALOIS: boosting deep reinforcement learning via generalizable logic synthesis

Y Cao, Z Li, T Yang, H Zhang… - Advances in …, 2022 - proceedings.neurips.cc
Despite achieving superior performance in human-level control problems, unlike humans,
deep reinforcement learning (DRL) lacks high-order intelligence (eg, logic deduction and …

Neural episodic control with state abstraction

Z Li, D Zhu, Y Hu, X **e, L Ma, Y Zheng, Y Song… - arxiv preprint arxiv …, 2023 - arxiv.org
Existing Deep Reinforcement Learning (DRL) algorithms suffer from sample inefficiency.
Generally, episodic control-based approaches are solutions that leverage highly-rewarded …

L2E: Learning to exploit your opponent

Z Wu, K Li, H Xu, Y Zang, B An… - 2022 International Joint …, 2022 - ieeexplore.ieee.org
Opponent modeling is essential to exploit sub-optimal opponents in strategic interactions.
Most previous works focus on building explicit models to predict the opponents' styles or …

Depthwise convolution for multi-agent communication with enhanced mean-field approximation

D **e, Z Wang, C Chen, D Dong - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org
Multi-Agent settings remain a fundamental challenge in the reinforcement learning (RL)
domain due to the partial observability and the lack of accurate real-time interactions across …

Efficient Bayesian policy reuse with a scalable observation model in deep reinforcement learning

J Liu, Z Wang, C Chen, D Dong - IEEE Transactions on Neural …, 2023 - ieeexplore.ieee.org
Bayesian policy reuse (BPR) is a general policy transfer framework for selecting a source
policy from an offline library by inferring the task belief based on some observation signals …

Accurate policy detection and efficient knowledge reuse against multi-strategic opponents

H Chen, Q Liu, K Fu, J Huang, C Wang… - Knowledge-Based Systems, 2022 - Elsevier
In Markov games, how to respond quickly and optimally for an agent against opponents that
follow changing policies is an open problem. Most state-of-the-art algorithms assume that …

Multi-expert distillation for few-shot coordination (student abstract)

Y Zhu, H Ding, Z Zhang - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Ad hoc teamwork is a crucial challenge that aims to design an agent capable of effective
collaboration with teammates employing diverse strategies without prior coordination …