Folgen
Peiwen Yuan
Peiwen Yuan
Bestätigte E-Mail-Adresse bei bit.edu.cn
Titel
Zitiert von
Zitiert von
Jahr
Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data
Y Li*, P Yuan*, S Feng, B Pan, B Sun, X Wang, H Wang, K Li
AAAI 2024 oral, 2023
162023
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning
Y Li*, P Yuan*, S Feng, B Pan, X Wang, B Sun, H Wang, K Li
ICLR 2024 poster, 2024
92024
BatchEval: Towards Human-like Text Evaluation
P Yuan, S Feng, Y Li, X Wang, B Pan, H Wang, K Li
ACL 2024 oral, 2023
92023
Generative Dense Retrieval: Memory Can Be a Burden
P Yuan*, X Wang*, S Feng, B Pan, Y Li, H Wang, X Miao, K Li
EACL 2024 oral, 2024
62024
Make every penny count: Difficulty-adaptive self-consistency for cost-efficient reasoning
X Wang, S Feng, Y Li, P Yuan, Y Zhang, B Pan, H Wang, Y Hu, K Li
arXiv preprint arXiv:2408.13457, 2024
52024
Better correlation and robustness: a distribution-balanced self-supervised learning framework for automatic dialogue evaluation
P Yuan, X Wang, J Shi, B Sun, Y Li
NeurIPS 2023 poster, 2024
32024
Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language Generation
X Wang, Y Li, S Feng, P Yuan, B Pan, H Wang, Y Hu, K Li
ACL 2024 main, 2024
22024
Instruction Embedding: Latent Representations of Instructions Towards Task Identification
Y Li, J Shi, S Feng, P Yuan, X Wang, B Pan, H Wang, Y Hu, K Li
NeurIPS 2024 DB poster, 2024
12024
Focused Large Language Models are Stable Many-Shot Learners
P Yuan, S Feng, Y Li, X Wang, Y Zhang, C Tan, B Pan, H Wang, Y Hu, ...
EMNLP 2024 main, 2024
12024
Poor-Supervised Evaluation for SuperLLM via Mutual Consistency
P Yuan, S Feng, Y Li, X Wang, B Pan, H Wang, Y Hu, K Li
ACL 2024 findings, 2024
12024
CogLM: Tracking Cognitive Development of Large Language Models
X Wang, P Yuan, S Feng, Y Li, B Pan, H Wang, Y Hu, K Li
arXiv preprint arXiv:2408.09150, 2024
12024
Parallel Corpora Alignment Framework for Multilingual and Robust Automatic Dialogue Evaluation
X Wang*, J Shi*, P Yuan*, K Li
Proceedings of The Eleventh Dialog System Technology Challenge, 123-132, 2023
2023
InsBank: Evolving Instruction Subset for Ongoing Alignment
J Shi, Y Li, S Feng, P Yuan, X Wang, Y Zhang, C Tan, B Pan, H Ren, Y Hu, ...
Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Model Evaluation
P Yuan, Y Zhang, S Feng, Y Li, X Wang, J Shi, C Tan, B Pan, Y Hu, K Li
Mode: A Benchmark and a Probe into Multimodal Open-Domain Dialogue Evaluation
H Yin, X Wang, Y Zhang, P Lu, B Sun, P Yuan, K Li
Available at SSRN 4888542, 0
Tracking Cognitive Development of Large Language Models
X Wang*, P Yuan*, S Feng, B Pan, Y Li, B Sun, H Wang, K Li
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–16