Sledovat
Weixun Wang
Weixun Wang
Alibaba
E-mailová adresa ověřena na: tju.edu.cn - Domovská stránka
Název
Citace
Citace
Rok
Multi-Agent Game Abstraction via Graph Attention Neural Network
Y Liu*, W Wang*, Y Hu, J Hao, X Chen, Y Gao
AAAI 2020, 2020
2872020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Y Hu, W Wang, H Jia, Y Wang, Y Chen, J Hao, F Wu, C Fan
Advances in Neural Information Processing Systems 33, 2020
2102020
The 37 implementation details of proximal policy optimization
S Huang, RFJ Dossa, A Raffin, A Kanervisto, W Wang
ICLR Blog Track, 2022
1302022
From Few to More: Large-scale Dynamic Multiagent Curriculum Learning
W Wang, T Yang, Y Liu, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao
AAAI 2020, 2020
1282020
Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-agent Reinforcement Learning
J Hu, W Siying, S Jiang, W Wang
ICLR Blogposts, 2023
1022023
Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas
W Wang, J Hao, Y Wang, M Taylor
Proceedings of the First International Conference on Distributed Artificial …, 2019
59*2019
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
T Yang*, W Wang*, H Tang*, HAO Jianye, Z Meng, H Mao, D Li, W Liu, ...
Thirty-Fifth Conference on Neural Information Processing Systems, 2021
44*2021
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
W Wang, T Yang, Y Liu, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao
ICLR 2020, 2020
422020
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library
S Hu, Y Zhong, M Gao, W Wang, H Dong, X Liang, Z Li, X Chang, Y Yang
Journal of Machine Learning Research 24 (315), 1-23, 2023
40*2023
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge
P Zhang, J Hao, W Wang, H Tang, Y Ma, Y Duan, Y Zheng
IJCAI2020, 2020
402020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, W Liu, ...
IJCAI 2020, 2020
392020
Individual Reward Assisted Multi-Agent Reinforcement Learning
L Wang, Y Zhang, Y Hu, W Wang, C Zhang, Y Gao, J Hao, T Lv, C Fan
International Conference on Machine Learning, 23417-23432, 2022
382022
Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks
HAO Jianye, X Hao, H Mao, W Wang, Y Yang, D Li, Y Zheng, Z Wang
The Eleventh International Conference on Learning Representations, 2023
37*2023
A2C is a special case of PPO
S Huang, A Kanervisto, A Raffin, W Wang, S Ontañón, RFJ Dossa
arXiv preprint arXiv:2205.09123, 2022
292022
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems
X Hao*, W Wang*, J Hao, Y Yang
Proceedings of the 18th International Conference on Autonomous Agents and …, 2019
292019
Cooperative Multi-Agent Transfer Learning with Coalition Pattern Decomposition
T Zhou, F Zhang, K Shao, Z Dai, K Li, W Huang, W Wang, B Wang, D Li, ...
IEEE Transactions on Games, 2023
25*2023
Background-free upconversion-encoded microspheres for mycotoxin detection based on a rapid visualization method
M Yang, M Cui, W Wang, Y Yang, J Chang, J Hao, H Wang
Analytical and bioanalytical chemistry 412, 81-91, 2020
222020
Learning Adaptive Display Exposure for Real-Time Advertising
W Wang, J Jin, J Hao, C Chen, C Yu, W Zhang, J Wang, X Hao, Y Wang, ...
CIKM 2019, 2019
22*2019
The N+ Implementation Details of RLHF with PPO: A Case Study on TL; DR Summarization
S Huang, M Noukhovitch, A Hosseini, K Rasul, W Wang, L Tunstall
arXiv preprint arXiv:2403.17031, 2024
182024
Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing
Y Yang, G Chen, W Wang, X Hao, HAO Jianye, PA Heng
Advances in Neural Information Processing Systems, 2022
172022
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–20