Human-like summarization evaluation with chatgpt M Gao, J Ruan, R Sun, X Yin, S Yang, X Wan arXiv preprint arXiv:2304.02554, 2023 | 117 | 2023 |
How do seq2seq models perform on end-to-end data-to-text generation? X Yin, X Wan ACL 2022 (Volume 1: Long Papers), 7701-7710, 2022 | 20 | 2022 |
ALCUNA: large language models meet new knowledge X Yin, B Huang, X Wan EMNLP 2023, 2023 | 18 | 2023 |
History matters: Temporal knowledge editing in large language model X Yin, J Jiang, L Yang, X Wan AAAI 2024 38 (17), 19413-19421, 2024 | 15 | 2024 |
Benchmarking knowledge boundary for large language model: A different perspective on model evaluation X Yin, X Zhang, J Ruan, X Wan ACL 2024, 2024 | 6 | 2024 |
Themis: A reference-free nlg evaluation language model with flexibility and interpretability X Hu, L Lin, M Gao, X Yin, X Wan EMNLP 2024, 2024 | 4* | 2024 |
MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency J Zhang, H Zhang, X Yin, B Huang, X Zhang, X Hu, X Wan arXiv preprint arXiv:2406.13219, 2024 | 3 | 2024 |
Error-Robust Retrieval for Chinese Spelling Check X Yin, X Hu, J Jiang, X Wan COLING 2024, 6257-6267, 2022 | 3* | 2022 |
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement X Yin, X Wang, L Pan, X Wan, WY Wang arXiv preprint arXiv:2410.04444, 2024 | 2 | 2024 |
Exploring Context-Aware Evaluation Metrics for Machine Translation X Hu, X Yin, X Wan Findings of EMNLP 2023, 15291-15298, 2023 | 2 | 2023 |
COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement Y Xie, A Goyal, X Wu, X Yin, X Xu, MY Kan, L Pan, WY Wang arXiv preprint arXiv:2410.09675, 2024 | 1 | 2024 |
Understanding the interplay between parametric and contextual knowledge for large language models S Cheng, L Pan, X Yin, X Wang, WY Wang arXiv preprint arXiv:2410.08414, 2024 | 1 | 2024 |
ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions X Zhang, X Yin, X Wan arXiv preprint arXiv:2406.08842, 2024 | 1 | 2024 |
A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check X Yin, X Wan arXiv preprint arXiv:2307.13655, 2023 | 1 | 2023 |
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models J Li, X Hu, X Yin, X Wan NAACL 2025 findings, 2025 | | 2025 |
DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models J Xie, Y Li, X Yin, X Wan AAAI 2025, 2024 | | 2024 |
Contextual Modeling for Document-level ASR Error Correction J Jiang, X Yin, X Wan, W Peng, R Li, J Yang, Y Zhou COLING 2024, 3855-3867, 2022 | | 2022 |