Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 3253 | 2023 |
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 1583 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 1195 | 2024 |
Minimum risk training for neural machine translation S Shen, Y Cheng, Z He, W He, H Wu, M Sun, Y Liu ACL, 2015 | 522 | 2015 |
Semi-supervised learning for neural machine translation Y Cheng, W Xu, Z He, W He, H Wu, M Sun, Y Liu ACL, 2016 | 320 | 2016 |
Robust Neural Machine Translation with Doubly Adversarial Inputs Y Cheng, L Jiang, W Macherey ACL, 2019 | 289 | 2019 |
Language Model Beats Diffusion--Tokenizer is Key to Visual Generation L Yu, J Lezama, NB Gundavarapu, L Versari, K Sohn, D Minnen, Y Cheng, ... arXiv preprint arXiv:2310.05737, 2023 | 222 | 2023 |
Magvit: Masked generative video transformer L Yu, Y Cheng, K Sohn, J Lezama, H Zhang, H Chang, AG Hauptmann, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 220 | 2023 |
Videopoet: A large language model for zero-shot video generation D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, G Schindler, R Hornung, ... arXiv preprint arXiv:2312.14125, 2023 | 198 | 2023 |
Towards robust neural machine translation Y Cheng, Z Tu, F Meng, J Zhai, Y Liu ACL, 2018 | 181 | 2018 |
A teacher-student framework for zero-resource neural machine translation Y Chen, Y Liu, Y Cheng, VOK Li ACL, 2017 | 158 | 2017 |
Towards conversational diagnostic AI T Tu, A Palepu, M Schaekermann, K Saab, J Freyberg, R Tanno, A Wang, ... arXiv preprint arXiv:2401.05654, 2024 | 157 | 2024 |
Capabilities of gemini models in medicine K Saab, T Tu, WH Weng, R Tanno, D Stutz, E Wulczyn, F Zhang, ... arXiv preprint arXiv:2404.18416, 2024 | 147 | 2024 |
mslam: Massively multilingual joint pre-training for speech and text A Bapna, C Cherry, Y Zhang, Y Jia, M Johnson, Y Cheng, S Khanuja, ... arXiv preprint arXiv:2202.01374, 2022 | 127 | 2022 |
Advaug: Robust adversarial augmentation for neural machine translation Y Cheng, L Jiang, W Macherey, J Eisenstein ACL, 2020 | 117 | 2020 |
Joint Training for Pivot-based Neural Machine Translation Y Cheng, Q Yang, Y Liu, M Sun, W Xu IJCAI, 3974-3980, 2017 | 115 | 2017 |
Agreement-based joint training for bidirectional attention-based neural machine translation Y Cheng, Q Yang, Y Liu, M Sun, W Xu IJCAI, 2016 | 92 | 2016 |
Palm 2 technical report. arXiv 2023 R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 0 | 84 | |
Towards accurate differential diagnosis with large language models D McDuff, M Schaekermann, T Tu, A Palepu, A Wang, J Garrison, ... arXiv preprint arXiv:2312.00164, 2023 | 79 | 2023 |
Reducing Word Omission Errors in Neural Machine Translation: A Contrastive Learning Approach Z Yang, Y Cheng, Y Liu, M Sun ACL, 2019 | 79 | 2019 |