Stebėti
Haobo Fu
Haobo Fu
Tencent AI Lab, University of Birmingham
Patvirtintas el. paštas tencent.com
Pavadinimas
Cituota
Cituota
Metai
Parametrized deep q-networks learning: Reinforcement learning with discrete-continuous hybrid action space
J Xiong, Q Wang, Z Yang, P Sun, L Han, Y Zheng, H Fu, T Zhang, J Liu, ...
arXiv preprint arXiv:1810.06394, 2018
2432018
Finding robust solutions to dynamic optimization problems
H Fu, B Sendhoff, K Tang, X Yao
European Conference on the Applications of Evolutionary Computation, 616-625, 2013
692013
Robust optimization over time: Problem difficulties and benchmark problems
H Fu, B Sendhoff, K Tang, X Yao
IEEE Transactions on Evolutionary Computation 19 (5), 731-745, 2014
612014
What are dynamic optimization problems?
H Fu, PR Lewis, B Sendhoff, K Tang, X Yao
2014 IEEE congress on evolutionary computation (CEC), 1550-1557, 2014
402014
Actor-critic policy optimization in a large-scale imperfect-information game
H Fu, W Liu, S Wu, Y Wang, T Yang, K Li, J Xing, B Li, B Ma, Q Fu, Y Wei
International Conference on Learning Representations, 2021
312021
L2E: Learning to exploit your opponent
Z Wu, K Li, H Xu, Y Zang, B An, J Xing
2022 International Joint Conference on Neural Networks (IJCNN), 1-8, 2022
292022
Characterizing environmental changes in robust optimization over time
H Fu, B Sendhoff, K Tang, X Yao
2012 IEEE Congress on Evolutionary Computation, 1-8, 2012
292012
Find robust solutions over time by two-layer multi-objective optimization method
Y Guo, M Chen, H Fu, Y Liu
2014 IEEE Congress on Evolutionary Computation (CEC), 1528-1535, 2014
252014
Enhance reasoning for large language models in the game werewolf
S Wu, L Zhu, T Yang, S Xu, Q Fu, Y Wei, H Fu
arXiv preprint arXiv:2402.02330, 2024
222024
Quality-similar diversity via population based reinforcement learning
S Wu, J Yao, H Fu, Y Tian, C Qian, Y Yang, Q Fu, Y Wei
The eleventh international conference on learning representations, 2023
202023
Heterogeneous multi-agent zero-shot coordination by coevolution
K Xue, Y Wang, C Guan, L Yuan, H Fu, Q Fu, C Qian, Y Yu
IEEE Transactions on Evolutionary Computation, 2024
172024
Memetic algorithm with heuristic candidate list strategy for capacitated arc routing problem
H Fu, Y Mei, K Tang, Y Zhu
IEEE Congress on Evolutionary Computation, 1-8, 2010
162010
Greedy when sure and conservative when uncertain about the opponents
H Fu, Y Tian, H Yu, W Liu, S Wu, J Xiong, Y Wen, K Li, J Xing, Q Fu, ...
International Conference on Machine Learning, 6829-6848, 2022
152022
Automatic grouping for efficient cooperative multi-agent reinforcement learning
Y Zang, J He, K Li, H Fu, Q Fu, J Xing, J Cheng
Advances in Neural Information Processing Systems 36, 46105-46121, 2023
142023
Policy space diversity for non-transitive games
J Yao, W Liu, H Fu, Y Yang, S McAleer, Q Fu, W Yang
Advances in Neural Information Processing Systems 36, 67771-67793, 2023
142023
Maximum entropy heterogeneous-agent reinforcement learning
J Liu, Y Zhong, S Hu, H Fu, Q Fu, X Chang, Y Yang
arXiv preprint arXiv:2306.10715, 2023
142023
Autocfr: Learning to design counterfactual regret minimization algorithms
H Xu, K Li, H Fu, Q Fu, J Xing
Proceedings of the AAAI Conference on Artificial Intelligence 36 (5), 5244-5251, 2022
132022
Sequential cooperative multi-agent reinforcement learning
Y Zang, J He, K Li, H Fu, Q Fu, J Xing
Proceedings of the 2023 International Conference on Autonomous Agents and …, 2023
102023
Curriculum-based co-design of morphology and control of voxel-based soft robots
Y Wang, S Wu, H Fu, Q Fu, T Zhang, Y Chang, X Wang
The Eleventh International Conference on Learning Representations, 2022
102022
Dynamic discounted counterfactual regret minimization
H Xu, K Li, H Fu, Q Fu, J Xing, J Cheng
The Twelfth International Conference on Learning Representations, 2024
72024
Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.
Straipsniai 1–20