Sledovat
Kaiyan Zhang
Kaiyan Zhang
PhD Student at Tsinghua University
E-mailová adresa ověřena na: mails.tsinghua.edu.cn - Domovská stránka
Název
Citace
Citace
Rok
BoB: BERT over BERT for training persona-based dialogue models from limited personalized data
H Song, Y Wang, K Zhang, WN Zhang, T Liu
ACL 2021, 2021
1362021
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning
X Zhu, B Qi, K Zhang, X Long, Z Lin, B Zhou
NAACL 2024, 2023
41*2023
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
B Qi*, K Zhang*, K Tian, H Li, ZR Chen, S Zeng, E Hua, H Jinfang, B Zhou
COLM 2024, 2024
33*2024
Generative Multi-Modal Knowledge Retrieval with Large Language Models
X Long, J Zeng, F Meng, Z Ma, K Zhang, B Zhou, J Zhou
AAAI 2024, 2024
262024
Ultramedical: Building specialized generalists in biomedicine
K Zhang, S Zeng, E Hua, N Ding, ZR Chen, Z Ma, H Li, G Cui, B Qi, X Zhu, ...
NeurIPS 2024 D&B Track, 2024
172024
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following
K Zhang, J Wang, E Hua, B Qi, N Ding, B Zhou
ACL 2024, 2024
92024
Free Process Rewards without Process Labels
L Yuan, W Li, H Chen, G Cui, N Ding, K Zhang, B Zhou, Z Liu, H Peng
Preprint, 2024
82024
A static and dynamic attention framework for multi turn dialogue generation
W Zhang, Y Cui, K Zhang, Y Wang, Q Zhu, L Li, T Liu
ACM Transactions on Information Systems 41 (1), 1-30, 2023
82023
Process reinforcement through implicit rewards
G Cui, L Yuan, Z Wang, H Wang, W Li, B He, Y Fan, T Yu, Q Xu, W Chen, ...
Preprint, 2025
62025
Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing
B Qi, P Li, F Li, J Gao, K Zhang, B Zhou
Preprint, 2024
62024
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Z Ma, Y Zhang, G Jia, L Zhao, Y Ma, M Ma, G Liu, K Zhang, J Li, B Zhou
Preprint, 2024
42024
Towards Building Specialized Generalist AI with System 1 and System 2 Fusion
K Zhang, B Qi, B Zhou
Preprint, 2024
42024
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding
K Zhang, J Wang, N Ding, B Qi, E Hua, X Lv, B Zhou
Preprint, 2024
42024
Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
E Hua, B Qi, K Zhang, Y Yu, N Ding, X Lv, K Tian, B Zhou
Preprint, 2024
4*2024
A stack-propagation framework for low-resource personalized dialogue generation
H Song, WN Zhang, K Zhang, T Liu
ACM Transactions on Information Systems 41 (3), 1-36, 2023
42023
Automating Exploratory Proteomics Research via Language Models
N Ding, S Qu, L Xie, Y Li, Z Liu, K Zhang, Y Xiong, Y Zuo, Z Chen, E Hua, ...
Preprint, 2024
32024
SMR: State Memory Replay for Long Sequence Modeling
B Qi, J Gao, K Zhang, D Li, J Liu, L Wu, B Zhou
ACL 2024 findings, 2024
3*2024
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
K Zhang, N Ding, B Qi, X Zhu, X Long, B Zhou
EMNLP 2023, 2023
32023
A survey of multi-party dialogue research based on deep learning
K Zhang, WN Zhang, T Liu
SCIENTIA SINICA Informationis 51 (8), 1217-1232, 2021
3*2021
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
Y Zuo, S Qu, Y Li, Z Chen, X Zhu, E Hua, K Zhang, N Ding, B Zhou
Preprint, 2025
22025
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–20