Прати
Mingxiao Feng
Mingxiao Feng
Верификована је имејл адреса на mail.ustc.edu.cn
Наслов
Навело
Навело
Година
H-tsp: Hierarchically solving the large-scale traveling salesman problem
X Pan, Y Jin, Y Ding, M Feng, L Zhao, L Song, J Bian
Proceedings of the AAAI Conference on Artificial Intelligence 37 (8), 9345-9353, 2023
532023
Playvirtual: Augmenting cycle-consistent virtual trajectories for reinforcement learning
T Yu, C Lan, W Zeng, M Feng, Z Zhang, Z Chen
Advances in Neural Information Processing Systems 34, 5276-5289, 2021
352021
Multi-agent reinforcement learning with shared resources for inventory management
Y Ding, M Feng, G Liu, W Jiang, C Zhang, L Zhao, L Song, H Li, Y Jin, ...
arXiv preprint arXiv:2212.07684, 2022
212022
Stabilizing voltage in power distribution networks via multi-agent reinforcement learning with transformer
M Wang, M Feng, W Zhou, H Li
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022
142022
MA2CL: masked attentive contrastive learning for multi-agent reinforcement learning
H Song, M Feng, W Zhou, H Li
arXiv preprint arXiv:2306.02006, 2023
102023
Suf: Stabilized unconstrained fine-tuning for offline-to-online reinforcement learning
J Feng, M Feng, H Song, W Zhou, H Li
Proceedings of the AAAI Conference on Artificial Intelligence 38 (11), 11961 …, 2024
32024
Sample efficient reinforcement learning with double importance sampling weight clipping
J Han, M Feng, W Zhou, H Li
2023 IEEE Conference on Games (CoG), 1-8, 2023
32023
Multi-Agent Reinforcement Learning with Safety Layer for Active Voltage Control.
Y Shi, M Feng, M Wang, W Zhou, H Li
AAMAS, 1533-1541, 2023
22023
Multi-Agent Hierarchical Graph Attention Reinforcement Learning for Grid-Aware Energy Management
B FENG, M FENG, M WANG, W ZHOU, H LI
ZTE Communications 21 (3), 11, 2023
12023
TIMAR: Transition-informed representation for sample-efficient multi-agent reinforcement learning
M Feng, Y Yang, W Zhou, H Li
Neural Networks 184, 107081, 2025
2025
Recovering Permuted Sequential Features for effective Reinforcement Learning
Y Jiang, M Feng, W Zhou, H Li
Neural Networks 182, 106795, 2025
2025
Систем тренутно не може да изврши ову радњу. Пробајте поново касније.
Чланци 1–11