Efficient training of bert by progressively stacking L Gong, D He, Z Li, T Qin, L Wang, T Liu International conference on machine learning, 2337-2346, 2019 | 172 | 2019 |
Joint language semantic and structure embedding for knowledge graph completion J Shen, C Wang, L Gong, D Song arXiv preprint arXiv:2209.08721, 2022 | 56 | 2022 |
Microsoft Research Asia's systems for WMT19 Y Xia, X Tan, F Tian, F Gao, W Chen, Y Fan, L Gong, Y Leng, R Luo, ... arXiv preprint arXiv:1911.06191, 2019 | 27 | 2019 |
Plotcoder: Hierarchical decoding for synthesizing visualization code in programmatic context X Chen, L Gong, A Cheung, D Song Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 23 | 2021 |
Anytime sampling for autoregressive models via ordered autoencoding Y Xu, Y Song, S Garg, L Gong, R Shu, A Grover, S Ermon arXiv preprint arXiv:2102.11495, 2021 | 21 | 2021 |
Ast-t5: Structure-aware pretraining for code generation and understanding L Gong, M Elhoushi, A Cheung arXiv preprint arXiv:2401.03003, 2024 | 19 | 2024 |
Mc-bert: Efficient language pre-training via a meta controller Z Xu, L Gong, G Ke, D He, S Zheng, L Wang, J Bian, TY Liu arXiv preprint arXiv:2006.05744, 2020 | 18 | 2020 |
Improved clinical abbreviation expansion via non-sense-based approaches J Kim, L Gong, J Khim, JC Weiss, P Ravikumar Machine Learning for Health, 161-178, 2020 | 9 | 2020 |
Evaluation of llms on syntax-aware code fill-in-the-middle tasks L Gong, S Wang, M Elhoushi, A Cheung arXiv preprint arXiv:2403.04814, 2024 | 7 | 2024 |
ADELT: Transpilation between deep learning frameworks L Gong, J Wang, A Cheung arXiv preprint arXiv:2303.03593, 2023 | 3 | 2023 |
Model-generated pretraining signals improves zero-shot generalization of text-to-text transformers L Gong, C Xiong, X Liu, P Bajaj, Y Xie, A Cheung, J Gao, X Song arXiv preprint arXiv:2305.12567, 2023 | 2 | 2023 |