Llama 2: Open foundation and fine-tuned chat models H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... arXiv preprint arXiv:2307.09288, 2023 | 12176 | 2023 |
The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 2281 | 2024 |
Lima: Less is more for alignment C Zhou, P Liu, P Xu, S Iyer, J Sun, Y Mao, X Ma, A Efrat, P Yu, L Yu, ... Advances in Neural Information Processing Systems 36, 2024 | 907 | 2024 |
Llama 2: open foundation and fine-tuned chat models. arXiv H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... arXiv preprint arXiv:2307.09288, 2023 | 151 | 2023 |
Llama 2: Open foundation and fine-tuned chat models, 2023b H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... URL https://arxiv. org/abs/2307.09288, 2023 | 144 | 2023 |
Llama 2: Open foundation and fine-tuned chat models. arXiv 2023 H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... arXiv preprint arXiv:2307.09288, 0 | 142 | |
Scaling autoregressive multi-modal models: Pretraining and instruction tuning L Yu, B Shi, R Pasunuru, B Muller, O Golovneva, T Wang, A Babu, B Tang, ... arXiv preprint arXiv:2309.02591 2 (3), 2023 | 130 | 2023 |
The llama 3 herd of models A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ... arXiv e-prints, arXiv: 2407.21783, 2024 | 69 | 2024 |
Llama 2: open foundation and fine-tuned chat models. CoRR abs/2307.09288 (2023) H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... arXiv preprint arXiv:2307.09288 10, 2023 | 62 | 2023 |
A theory on adam instability in large-scale machine learning I Molybog, P Albert, M Chen, Z DeVito, D Esiobu, N Goyal, PS Koura, ... arXiv preprint arXiv:2304.09871, 2023 | 26 | 2023 |
LIMA: Less Is More for Alignment. CoRR abs/2305.11206 (2023) C Zhou, P Liu, P Xu, S Iyer, J Sun, Y Mao, X Ma, A Efrat, P Yu, L Yu, ... arXiv preprint arXiv:2305.11206 10, 2023 | 21 | 2023 |
LIMA: less is more for alignment (2023) C Zhou, P Liu, P Xu, S Iyer, J Sun, Y Mao, X Ma, A Efrat, P Yu, L Yu, ... WARNING: APPENDIX Ccontains EXAMPLES OF TOXIC USER INPUTS, WHICH MAY INCLUDE …, 0 | 15 | |
Llama 2: Open foundation and fine-tuned chat models. arXiv [Preprint](2023) H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... URL https://arxiv. org/abs/2307 9288, 12, 0 | 14 | |
LIMA: Less Is More for Alignment. arXiv 2023 C Zhou, P Liu, P Xu, S Iyer, J Sun, Y Mao, X Ma, A Efrat, P Yu, L Yu arXiv preprint arXiv:2305.11206, 0 | 6 | |