No language left behind: Scaling human-centered machine translation MR Costa-jussà, J Cross, O Çelebi, M Elbayad, K Heafield, K Heffernan, ... arXiv preprint arXiv:2207.04672, 2022 | 800 | 2022 |
Deep encoder, shallow decoder: Reevaluating non-autoregressive machine translation J Kasai, N Pappas, H Peng, J Cross, NA Smith arXiv preprint arXiv:2006.10369, 2020 | 181 | 2020 |
Monotonic multihead attention X Ma, J Pino, J Cross, L Puzon, J Gu arXiv preprint arXiv:1909.12406, 2019 | 150 | 2019 |
Span-Based Constituency Parsing with a Structure-Label System and Provably Optimal Dynamic Oracles J Cross, L Huang EMNLP, 2016 | 134 | 2016 |
Lifting the curse of multilinguality by pre-training modular transformers J Pfeiffer, N Goyal, XV Lin, X Li, J Cross, S Riedel, M Artetxe arXiv preprint arXiv:2205.06266, 2022 | 117 | 2022 |
Incremental Parsing with Minimal Features Using Bi-Directional LSTM J Cross, L Huang ACL, 2016 | 102 | 2016 |
Non-autoregressive machine translation with disentangled context transformer J Kasai, J Cross, M Ghazvininejad, J Gu International conference on machine learning, 5144-5155, 2020 | 100 | 2020 |
Facebook ai wmt21 news translation task submission C Tran, S Bhosale, J Cross, P Koehn, S Edunov, A Fan arXiv preprint arXiv:2108.03265, 2021 | 99 | 2021 |
Simple Fusion: Return of the Language Model F Stahlberg, J Cross, V Stoyanov WMT, 2018 | 84 | 2018 |
Improving zero-shot translation by disentangling positional information D Liu, J Niehues, J Cross, F Guzmán, X Li arXiv preprint arXiv:2012.15127, 2020 | 46 | 2020 |
On the evaluation of machine translation for terminology consistency A Anastasopoulos, L Besacier, J Cross, M Gallé, P Koehn, V Nikoulina arXiv preprint arXiv:2106.11891, 2021 | 34 | 2021 |
Parallel machine translation with disentangled context transformer J Kasai, J Cross, M Ghazvininejad, J Gu arXiv preprint arXiv:2001.05136, 2020 | 32 | 2020 |
Multilingual neural machine translation with deep encoder and multiple shallow decoders X Kong, A Renduchintala, J Cross, Y Tang, J Gu, X Li arXiv preprint arXiv:2206.02079, 2022 | 30 | 2022 |
Multilingual machine translation with hyper-adapters C Baziotis, M Artetxe, J Cross, S Bhosale arXiv preprint arXiv:2205.10835, 2022 | 25 | 2022 |
Tricks for training sparse translation models D Dua, S Bhosale, V Goswami, J Cross, M Lewis, A Fan arXiv preprint arXiv:2110.08246, 2021 | 22 | 2021 |
No language left behind: Scaling human-centered machine translation (2022) NLLB Team, MR Costa-jussà, J Cross, O Çelebi, M Elbayad, K Heafield, ... URL https://arxiv. org/abs/2207.04672, 2022 | 20 | 2022 |
Data selection curriculum for neural machine translation T Mohiuddin, P Koehn, V Chaudhary, J Cross, S Bhosale, S Joty arXiv preprint arXiv:2203.13867, 2022 | 19 | 2022 |
Generating distributed word embeddings using structured information JH Cross, JJ Fan, B Xiang, B Zhou US Patent 9,892,113, 2018 | 16 | 2018 |
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? S Zhang, V Chaudhary, N Goyal, J Cross, G Wenzek, M Bansal, ... arXiv preprint arXiv:2204.14268, 2022 | 15 | 2022 |
XLEnt: Mining a large cross-lingual entity dataset with lexical-semantic-phonetic word alignment A El-Kishky, A Renduchintala, J Cross, F Guzmán, P Koehn arXiv preprint arXiv:2104.08597, 2021 | 14 | 2021 |