Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Synergistic integration between machine learning and agent-based modeling: A multidisciplinary review
Agent-based modeling (ABM) involves develo** models in which agents make adaptive
decisions in a changing environment. Machine-learning (ML) based inference models can …
decisions in a changing environment. Machine-learning (ML) based inference models can …
Automatic web testing using curiosity-driven reinforcement learning
Web testing has long been recognized as a notoriously difficult task. Even nowadays, web
testing still mainly relies on manual efforts in many cases while automated web testing is still …
testing still mainly relies on manual efforts in many cases while automated web testing is still …
Vulnerability assessment of deep reinforcement learning models for power system topology optimization
This paper studies the vulnerability of deep reinforcement learning (DRL) models for power
systems topology optimization under data perturbations and cyber-attack. DRL has recently …
systems topology optimization under data perturbations and cyber-attack. DRL has recently …
GALOIS: boosting deep reinforcement learning via generalizable logic synthesis
Despite achieving superior performance in human-level control problems, unlike humans,
deep reinforcement learning (DRL) lacks high-order intelligence (eg, logic deduction and …
deep reinforcement learning (DRL) lacks high-order intelligence (eg, logic deduction and …
Neural episodic control with state abstraction
Existing Deep Reinforcement Learning (DRL) algorithms suffer from sample inefficiency.
Generally, episodic control-based approaches are solutions that leverage highly-rewarded …
Generally, episodic control-based approaches are solutions that leverage highly-rewarded …
L2E: Learning to exploit your opponent
Opponent modeling is essential to exploit sub-optimal opponents in strategic interactions.
Most previous works focus on building explicit models to predict the opponents' styles or …
Most previous works focus on building explicit models to predict the opponents' styles or …
Depthwise convolution for multi-agent communication with enhanced mean-field approximation
Multi-Agent settings remain a fundamental challenge in the reinforcement learning (RL)
domain due to the partial observability and the lack of accurate real-time interactions across …
domain due to the partial observability and the lack of accurate real-time interactions across …
Efficient Bayesian policy reuse with a scalable observation model in deep reinforcement learning
Bayesian policy reuse (BPR) is a general policy transfer framework for selecting a source
policy from an offline library by inferring the task belief based on some observation signals …
policy from an offline library by inferring the task belief based on some observation signals …
Accurate policy detection and efficient knowledge reuse against multi-strategic opponents
In Markov games, how to respond quickly and optimally for an agent against opponents that
follow changing policies is an open problem. Most state-of-the-art algorithms assume that …
follow changing policies is an open problem. Most state-of-the-art algorithms assume that …
Multi-expert distillation for few-shot coordination (student abstract)
Y Zhu, H Ding, Z Zhang - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Ad hoc teamwork is a crucial challenge that aims to design an agent capable of effective
collaboration with teammates employing diverse strategies without prior coordination …
collaboration with teammates employing diverse strategies without prior coordination …