FacTool: Factuality Detection in Generative AI--A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios IC Chern, S Chern, S Chen, W Yuan, K Feng, C Zhou, J He, G Neubig, ... arXiv preprint arXiv:2307.13528, 2023 | 177 | 2023 |
Felm: Benchmarking factuality evaluation of large language models S Chen, Y Zhao, J Zhang, I Chern, S Gao, P Liu, J He Advances in Neural Information Processing Systems 36, 44502-44523, 2023 | 75 | 2023 |
Alignment for honesty Y Yang, E Chern, X Qiu, G Neubig, P Liu NeurIPS 2024, 2023 | 62 | 2023 |
Generative ai for math: Abel E Chern*, H Zou*, X Li*, J Hu*, K Feng, J Li, P Liu https://github.com/GAIR-NLP/abel, 2023 | 26 | 2023 |
O1 Replication Journey--Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Z Huang*, H Zou*, X Li*, Y Liu*, Y Zheng*, E Chern*, S Xia*, Y Qin, ... arXiv preprint arXiv:2411.16489, 2024 | 22* | 2024 |
Decoding of quantum data-syndrome codes via belief propagation KY Kuo, IC Chern, CY Lai ISIT 2021, 2021 | 20 | 2021 |
Olympicarena: Benchmarking multi-discipline cognitive reasoning for superintelligent ai Z Huang, Z Wang, S Xia, X Li, H Zou, R Xu, RZ Fan, L Ye, E Chern, Y Ye, ... Advances in Neural Information Processing Systems 37, 19209-19253, 2025 | 19* | 2025 |
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation E Chern*, J Su*, Y Ma*, P Liu arXiv preprint arXiv:2407.06135, 2024 | 19 | 2024 |
Can Large Language Models be Trusted for Evaluation? Scalable Meta-Evaluation of LLMs as Evaluators via Agent Debate S Chern, E Chern, G Neubig, P Liu arXiv preprint arXiv:2401.16788, 2024 | 17 | 2024 |
Audio-visual speech enhancement and separation by utilizing multi-modal self-supervised embeddings IC Chern, KH Hung, YT Chen, T Hussain, M Gogate, A Hussain, Y Tsao, ... ICASSPW 2023, 2023 | 17* | 2023 |
Reformatted Alignment RZ Fan, X Li, H Zou, J Li, S He, E Chern, J Hu, P Liu EMNLP 2024, 2024 | 16 | 2024 |
Align on the Fly: Adapting Chatbot Behavior to Established Norms C Xu, S Chern, E Chern, G Zhang, Z Wang, R Liu, J Li, J Fu, P Liu arXiv preprint arXiv:2312.15907, 2023 | 16 | 2023 |
Improving Factuality of Abstractive Summarization via Contrastive Reward Learning IC Chern, Z Wang, S Das, B Sharma, P Liu, G Neubig The Third Workshop on Trustworthy Natural Language Processing @ ACL 2023, 2023 | 13* | 2023 |
BeHonest: Benchmarking Honesty of Large Language Models S Chern, Z Hu, Y Yang, E Chern, Y Guo, J Jin, B Wang, P Liu arXiv preprint arXiv:2406.13261, 2024 | 5 | 2024 |
LIMO: Less is More for Reasoning Y Ye, Z Huang, Y Xiao, E Chern, S Xia, P Liu arXiv preprint arXiv:2502.03387, 2025 | 4 | 2025 |
Halu-J: Critique-Based Hallucination Judge B Wang, S Chern, E Chern, P Liu arXiv preprint arXiv:2407.12943, 2024 | 4 | 2024 |
Chinesefacteval: A factuality benchmark for chinese llms B Wang, E Chern, P Liu https://gair-nlp.github.io/ChineseFactEval/, 2023 | 4 | 2023 |
Voice Direction-Of-Arrival Conversion IC Chern, S Chern, HC Kuo, HH Tseng, KH Hung, Y Tsao MLSP 2023, 2023 | | 2023 |