Principle-driven self-alignment of language models from scratch with minimal human supervision Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan Advances in Neural Information Processing Systems 36, 2024 | 318 | 2024 |
Building cooperative embodied agents modularly with large language models H Zhang, W Du, J Shan, Q Zhou, Y Du, JB Tenenbaum, T Shu, C Gan arXiv preprint arXiv:2307.02485, 2023 | 191 | 2023 |
On second thought, let's not think step by step! Bias and toxicity in zero-shot reasoning O Shaikh, H Zhang, W Held, M Bernstein, D Yang arXiv preprint arXiv:2212.08061, 2022 | 166 | 2022 |
SALMON: Self-Alignment with Instructable Reward Models Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen, DD Cox, Y Yang, C Gan The Twelfth International Conference on Learning Representations, 2024 | 61* | 2024 |
Bounding the capabilities of large language models in open text generation with prompt constraints A Lu, H Zhang, Y Zhang, X Wang, D Yang arXiv preprint arXiv:2302.09185, 2023 | 27 | 2023 |
Robustness of demonstration-based learning under limited data scenario H Zhang, Y Zhang, R Zhang, D Yang arXiv preprint arXiv:2210.10693, 2022 | 14 | 2022 |
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation H Zhang, Z Wang, Q Lyu, Z Zhang, S Chen, T Shu, Y Du, C Gan arXiv preprint arXiv:2404.10775, 2024 | 10 | 2024 |
HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments Q Zhou, S Chen, Y Wang, H Xu, W Du, H Zhang, Y Du, JB Tenenbaum, ... arXiv preprint arXiv:2401.12975, 2024 | 8 | 2024 |
Werewolf among us: A multimodal dataset for modeling persuasion behaviors in social deduction games B Lai, H Zhang, M Liu, A Pariani, F Ryan, W Jia, SA Hayati, JM Rehg, ... arXiv preprint arXiv:2212.08279, 2022 | 5 | 2022 |
SnapMem: Snapshot-based 3D Scene Memory for Embodied Exploration and Reasoning Y Yang, H Yang, J Zhou, P Chen, H Zhang, Y Du, C Gan arXiv preprint arXiv:2411.17735, 2024 | | 2024 |
Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge W Du, Q Lyu, J Shan, Z Qi, H Zhang, S Chen, A Peng, T Shu, K Lee, ... arXiv preprint arXiv:2411.01796, 2024 | | 2024 |