Google 학술 검색

Synergistic integration between machine learning and agent-based modeling: A multidisciplinary review

W Zhang, A Valencia, NB Chang - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Agent-based modeling (ABM) involves develo** models in which agents make adaptive
decisions in a changing environment. Machine-learning (ML) based inference models can …

저장 인용 69회 인용 관련 학술자료 전체 4개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Automatic web testing using curiosity-driven reinforcement learning

Y Zheng, Y Liu, X **e, Y Liu, L Ma… - 2021 IEEE/ACM 43rd …, 2021 - ieeexplore.ieee.org

Web testing has long been recognized as a notoriously difficult task. Even nowadays, web
testing still mainly relies on manual efforts in many cases while automated web testing is still …

저장 인용 99회 인용 관련 학술자료 전체 9개의 버전

Vulnerability assessment of deep reinforcement learning models for power system topology optimization

Y Zheng, Z Yan, K Chen, J Sun, Y Xu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

This paper studies the vulnerability of deep reinforcement learning (DRL) models for power
systems topology optimization under data perturbations and cyber-attack. DRL has recently …

저장 인용 49회 인용 관련 학술자료

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

GALOIS: boosting deep reinforcement learning via generalizable logic synthesis

Y Cao, Z Li, T Yang, H Zhang… - Advances in …, 2022 - proceedings.neurips.cc

Despite achieving superior performance in human-level control problems, unlike humans,
deep reinforcement learning (DRL) lacks high-order intelligence (eg, logic deduction and …

저장 인용 27회 인용 관련 학술자료 전체 7개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Neural episodic control with state abstraction

Z Li, D Zhu, Y Hu, X **e, L Ma, Y Zheng, Y Song… - arxiv preprint arxiv …, 2023 - arxiv.org

Existing Deep Reinforcement Learning (DRL) algorithms suffer from sample inefficiency.
Generally, episodic control-based approaches are solutions that leverage highly-rewarded …

저장 인용 12회 인용 관련 학술자료 전체 8개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

L2E: Learning to exploit your opponent

Z Wu, K Li, H Xu, Y Zang, B An… - 2022 International Joint …, 2022 - ieeexplore.ieee.org

Opponent modeling is essential to exploit sub-optimal opponents in strategic interactions.
Most previous works focus on building explicit models to predict the opponents' styles or …

저장 인용 28회 인용 관련 학술자료 전체 5개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Depthwise convolution for multi-agent communication with enhanced mean-field approximation

D **e, Z Wang, C Chen, D Dong - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org

Multi-Agent settings remain a fundamental challenge in the reinforcement learning (RL)
domain due to the partial observability and the lack of accurate real-time interactions across …

저장 인용 9회 인용 관련 학술자료 전체 4개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Efficient Bayesian policy reuse with a scalable observation model in deep reinforcement learning

J Liu, Z Wang, C Chen, D Dong - IEEE Transactions on Neural …, 2023 - ieeexplore.ieee.org

Bayesian policy reuse (BPR) is a general policy transfer framework for selecting a source
policy from an offline library by inferring the task belief based on some observation signals …

저장 인용 3회 인용 관련 학술자료 전체 5개의 버전

Accurate policy detection and efficient knowledge reuse against multi-strategic opponents

H Chen, Q Liu, K Fu, J Huang, C Wang… - Knowledge-Based Systems, 2022 - Elsevier

In Markov games, how to respond quickly and optimally for an agent against opponents that
follow changing policies is an open problem. Most state-of-the-art algorithms assume that …

저장 인용 4회 인용 관련 학술자료 전체 2개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Multi-expert distillation for few-shot coordination (student abstract)

Y Zhu, H Ding, Z Zhang - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

Ad hoc teamwork is a crucial challenge that aims to design an agent capable of effective
collaboration with teammates employing diverse strategies without prior coordination …

저장 인용 관련 학술자료 전체 2개의 버전 HTML 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Efficient policy detecting and reusing for non-stationarity in markov games

Synergistic integration between machine learning and agent-based modeling: A multidisciplinary review

Automatic web testing using curiosity-driven reinforcement learning

Vulnerability assessment of deep reinforcement learning models for power system topology optimization

GALOIS: boosting deep reinforcement learning via generalizable logic synthesis

Neural episodic control with state abstraction

L2E: Learning to exploit your opponent

Depthwise convolution for multi-agent communication with enhanced mean-field approximation

Efficient Bayesian policy reuse with a scalable observation model in deep reinforcement learning

Accurate policy detection and efficient knowledge reuse against multi-strategic opponents

Multi-expert distillation for few-shot coordination (student abstract)