Large language models play starcraft ii: Benchmarks and a chain of summarization approach W Ma, Q Mi, Y Zeng, X Yan, Y Wu, R Lin, H Zhang, J Wang arXiv preprint arXiv:2312.11865, 2023 | 36 | 2023 |
Estimating -Rank from A Few Entries with Low Rank Matrix Completion Y Du, X Yan, X Chen, J Wang, H Zhang International Conference on Machine Learning, 2870-2879, 2021 | 12 | 2021 |
An efficient end-to-end training approach for zero-shot human-AI coordination X Yan, J Guo, X Lou, J Wang, H Zhang, Y Du Advances in Neural Information Processing Systems 36, 2024 | 10 | 2024 |
Learning to identify top elo ratings: A dueling bandits approach X Yan, Y Du, B Ru, J Wang, H Zhang, X Chen Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8797-8805, 2022 | 9 | 2022 |
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models X Yan, Y Song, X Cui, F Christianos, H Zhang, DH Mguni, J Wang arXiv preprint arXiv:2310.18127, 2023 | 6 | 2023 |
Efficient Reinforcement Learning with Large Language Model Priors X Yan, Y Song, X Feng, M Yang, H Zhang, HB Ammar, J Wang arXiv preprint arXiv:2410.07927, 2024 | 1 | 2024 |