Trustllm: Trustworthiness in large language models Y Huang, L Sun, H Wang, S Wu, Q Zhang, Y Li, C Gao, Y Huang, W Lyu, ... [ICML 2024] 41st International Conference on Machine Learning, 2024 | 318* | 2024 |
TrustGPT: Benchmark for Trustworthy and Responsible Large Language Models Y Huang, Q Zhang, PS Yu, L Sun arXiv preprint arXiv:2306.11507, 2023 | 72 | 2023 |
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use Y Huang, J Shi, Y Li, C Fan, S Wu, Q Zhang, Y Liu, P Zhou, Y Wan, ... [ICLR 2024] 12th International Conference on Learning Representations, 2023 | 65 | 2023 |
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark D Chen, R Chen, S Zhang, Y Liu, Y Wang, H Zhou, Q Zhang, P Zhou, ... [ICML 2024 (Oral)] 41st International Conference on Machine Learning, 2024 | 47 | 2024 |
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected? Q Zhang, C Gao, D Chen, Y Huang, Y Huang, Z Sun, S Zhang, W Li, Z Fu, ... [NAACL 2024 Findings] 2024 Annual Conference of the North American Chapter …, 2024 | 28* | 2024 |
GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents D Chen, Y Huang, S Wu, J Tang, L Chen, Y Bai, Z He, C Wang, H Zhou, ... [ICLR 2025] 13th International Conference on Learning Representations, 2025 | 23 | 2025 |
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge J Ye, Y Wang, Y Huang, D Chen, Q Zhang, N Moniz, T Gao, W Geyer, ... [ICLR 2025] 13th International Conference on Learning Representations, 2025 | 15 | 2025 |
Unigen: A unified framework for textual dataset generation using large language models S Wu, Y Huang, C Gao, D Chen, Q Zhang, Y Wan, T Zhou, X Zhang, ... [ICLR 2025] 13th International Conference on Learning Representations, 2025 | 12 | 2025 |
HonestLLM: Toward an Honest and Helpful Large Language Model C Gao, Q Zhang, S Wu, Y Huang, D Chen, Z Fu, Y Wan, L Sun, X Zhang [NeurIPS 2024] 38th Annual Conference on Neural Information Processing Systems, 2024 | 8* | 2024 |
Triplet-constraint Transformer with Multi-scale Refinement for Dose Prediction in Radiotherapy L Wen, Q Zhang, Z Feng, Y Xu, X Chen, J Zhou, Y Wang [ISBI 2024] IEEE 21st International Symposium on Biomedical Imaging, 2024 | 1 | 2024 |
Cliff: Leveraging Ambiguous Samples for Enhanced Test-Time Adaptation X Chen, Q Zhang, Y Wang [ECAI 2024] 27th European Conference on Artificial Intelligence 392, 642 - 649, 2024 | | 2024 |