Learning Diverse Risk Preferences in Population-Based Self-Play Y Jiang, Q Liu, X Ma, C Li, Y Yang, J Yang, B Liang, Q Zhao Proceedings of the AAAI Conference on Artificial Intelligence 38 (11), 12910 …, 2024 | 4 | 2024 |
Episodic Novelty Through Temporal Distance Y Jiang, Q Liu, Y Yang, X Ma, D Zhong, H Hu, J Yang, B Liang, B Xu, ... arXiv preprint arXiv:2501.15418, 2025 | | 2025 |