RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space Z Sun, ZH Deng, JY Nie, J Tang International Conference on Learning Representations (ICLR), 2019 | 2871 | 2019 |
Bloom: A 176b-parameter open-access multilingual language model TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... arXiv preprint arXiv:2211.05100, 2022 | 1776 | 2022 |
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices Z Sun, H Yu, X Song, R Liu, Y Yang, D Zhou 2020 Annual Conference of the Association for Computational Linguistics, 2020 | 874 | 2020 |
Active Retrieval Augmented Generation Z Jiang, FF Xu, L Gao, Z Sun, Q Liu, J Dwivedi-Yu, Y Yang, J Callan, ... Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023 | 447 | 2023 |
Rethinking Transformer-based Set Prediction for Object Detection Z Sun, S Cao, Y Yang, K Kitani International Conference on Computer Vision (ICCV) 2021, 2021 | 412 | 2021 |
Promptsource: An integrated development environment and repository for natural language prompts SH Bach, V Sanh, ZX Yong, A Webson, C Raffel, NV Nayak, A Sharma, ... arXiv preprint arXiv:2202.01279, 2022 | 327 | 2022 |
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan Advances in Neural Information Processing Systems (NeurIPS), 2023 | 321 | 2023 |
Aligning Large Multimodal Models with Factually Augmented RLHF Z Sun, S Shen, S Cao, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ... 2024 Annual Conference of the Association for Computational Linguistics …, 2024 | 251 | 2024 |
Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View Y Lu, Z Li, D He, Z Sun, B Dong, T Qin, L Wang, TY Liu arXiv preprint arXiv:1906.02762, 2019 | 215 | 2019 |
A Re-evaluation of Knowledge Graph Completion Methods Z Sun, S Vashishth, S Sanyal, P Talukdar, Y Yang 2020 Annual Conference of the Association for Computational Linguistics, 2020 | 197 | 2020 |
Fast Structured Decoding for Sequence Models Z Sun, Z Li, H Wang, D He, Z Lin, Z Deng Advances in Neural Information Processing Systems, 3011-3020, 2019 | 128 | 2019 |
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization Z Sun, Y Yang Advances in Neural Information Processing Systems (NeurIPS), 2023 | 120 | 2023 |
Recitation-Augmented Language Models Z Sun, X Wang, Y Tay, Y Yang, D Zhou International Conference on Learning Representations (ICLR), 2023 | 117 | 2023 |
DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems R Qiu, Z Sun, Y Yang Advances in Neural Information Processing Systems (NeurIPS), 2022 | 96 | 2022 |
Dynamically Pruned Message Passing Networks for Large-Scale Knowledge Graph Reasoning X Xu, W Feng, Y Jiang, X Xie, Z Sun, ZH Deng International Conference on Learning Representations (ICLR), 2020 | 86 | 2020 |
Self-Play Preference Optimization for Language Model Alignment Y Wu, Z Sun, H Yuan, K Ji, Y Yang, Q Gu arXiv preprint arXiv:2405.00675, 2024 | 74 | 2024 |
SALMON: Self-Alignment with Instructable Reward Models Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen, DD Cox, Y Yang, C Gan The Twelfth International Conference on Learning Representations, 2024 | 65* | 2024 |
An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models Y Wu, Z Sun, S Li, S Welleck, Y Yang | 64* | 2024 |
DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases Z Sun, J Tang, P Du, ZH Deng, JY Nie International ACM SIGIR Conference on Research and Development in …, 2019 | 60 | 2019 |
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Z Sun, L Yu, Y Shen, W Liu, Y Yang, S Welleck, C Gan arXiv preprint arXiv:2403.09472, 2024 | 42 | 2024 |