Trustllm: Trustworthiness in large language models Y Huang, L Sun, H Wang, S Wu, Q Zhang, Y Li, C Gao, Y Huang, W Lyu, ... [ICML 2024] 41st International Conference on Machine Learning, 2024 | 345* | 2024 |
TrustGPT: Benchmark for Trustworthy and Responsible Large Language Models Y Huang, Q Zhang, PS Yu, L Sun arXiv preprint arXiv:2306.11507, 2023 | 81 | 2023 |
Metatool benchmark for large language models: Deciding whether to use tools and which to use Y Huang, J Shi, Y Li, C Fan, S Wu, Q Zhang, Y Liu, P Zhou, Y Wan, ... [ICLR 2024] 12th International Conference on Learning Representations, 2023 | 70 | 2023 |
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark D Chen, R Chen, S Zhang, Y Liu, Y Wang, H Zhou, Q Zhang, P Zhou, ... [ICML 2024 (Oral)] 41st International Conference on Machine Learning, 2024 | 57 | 2024 |
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected? Q Zhang, C Gao, D Chen, Y Huang, Y Huang, Z Sun, S Zhang, W Li, Z Fu, ... [NAACL 2024 Findings] 2024 Annual Conference of the North American Chapter …, 2024 | 34* | 2024 |
GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents D Chen, Y Huang, S Wu, J Tang, L Chen, Y Bai, Z He, C Wang, H Zhou, ... [ICLR 2025] 13th International Conference on Learning Representations, 2025 | 29 | 2025 |
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge J Ye, Y Wang, Y Huang, D Chen, Q Zhang, N Moniz, T Gao, W Geyer, ... [ICLR 2025] 13th International Conference on Learning Representations, 2025 | 26 | 2025 |
DataGen: Unified Synthetic Dataset Generation via Large Language Models Y Huang, S Wu, C Gao, D Chen, Q Zhang, Y Wan, T Zhou, C Xiao, J Gao, ... The Thirteenth International Conference on Learning Representations, 0 | 15* | |
HonestLLM: Toward an Honest and Helpful Large Language Model C Gao, Q Zhang, S Wu, Y Huang, D Chen, Z Fu, Y Wan, L Sun, X Zhang [NeurIPS 2024] 38th Annual Conference on Neural Information Processing Systems, 2024 | 11* | 2024 |
Triplet-constraint Transformer with Multi-scale Refinement for Dose Prediction in Radiotherapy L Wen, Q Zhang, Z Feng, Y Xu, X Chen, J Zhou, Y Wang [ISBI 2024] IEEE 21st International Symposium on Biomedical Imaging, 2024 | 1 | 2024 |
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective Y Huang, C Gao, S Wu, H Wang, X Wang, Y Zhou, Y Wang, J Ye, J Shi, ... arXiv preprint arXiv:2502.14296, 2025 | | 2025 |
Cliff: Leveraging Ambiguous Samples for Enhanced Test-Time Adaptation X Chen, Q Zhang, Y Wang [ECAI 2024] 27th European Conference on Artificial Intelligence 392, 642 - 649, 2024 | | 2024 |