The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 2867 | 2024 |
The llama 3 herd of models A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ... arXiv e-prints, arXiv: 2407.21783, 2024 | 96 | 2024 |
Multi-objective gflownets M Jain, SC Raparthy, A Hernández-Garcıa, J Rector-Brooks, Y Bengio, ... ICML 2023, 14631-14653, 2023 | 75 | 2023 |
Rainbow teaming: Open-ended generation of diverse adversarial prompts M Samvelyan*, SC Raparthy*, A Lupu*, E Hambro, AH Markosyan, ... NeurIPS 2024, 2024 | 52 | 2024 |
Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024 | 50 | 2024 |
Glore: When, where, and how to improve llm reasoning via global and local refinements A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ... ICML 2024, 2024 | 32 | 2024 |
Compositional Attention: Disentangling Search and Retrieval S Mittal, SC Raparthy, I Rish, Y Bengio, G Lajoie ICLR 2022, 2021 | 25 | 2021 |
Curriculum in Gradient-Based Meta-Reinforcement Learning B Mehta, T Deleu, SC Raparthy, CJ Pal, L Paull arXiv preprint arXiv:2002.07956, 2020 | 23 | 2020 |
Generalization to New Sequential Decision Making Tasks with In-Context Learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu ICML 2024, 2023 | 15 | 2023 |
ML reproducibility challenge 2021 K Sinha, J Dodge, S Luccioni, J Forde, SC Raparthy, J Pineau, R Stojnic ReScience C 8 (2), 48, 2022 | 12 | 2022 |
Llama 3 Model Card AI Meta https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md, 2024 | 10 | 2024 |
Continual learning in environments with polynomial mixing times M Riemer*, SC Raparthy*, I Cases, G Subbaraj, M Puelma Touzel, I Rish Advances in Neural Information Processing Systems (NeurIPS) 2022 35, 21961-21973, 2022 | 9 | 2022 |
Generating automatic curricula via self-supervised active domain randomization SC Raparthy, B Mehta, F Golemo, L Paull arXiv preprint arXiv:2002.07911, 2020 | 9 | 2020 |
Data Efficient Stagewise Knowledge Distillation A Kulkarni, N Panchi, SC Raparthy, S Chiddarwar arXiv preprint arXiv:1911.06786, 2019 | 8 | 2019 |
A game-theoretic perspective on risk-sensitive reinforcement learning. M Godbout, M Heuillet, SC Raparthy, R Bhati, A Durand SafeAI@ AAAI, 2022 | 3 | 2022 |
Yoshua 448 Bengio, Santiago Miret, and Emmanuel Bengio M Jain, SC Raparthy, A Hernandez-Garcia, J Rector-Brooks Multi-objective gflownets 449, 0 | 2 | |
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models A Havrilla, A Dai, L O'Mahony, K Oostermeijer, V Zisler, A Albalak, F Milo, ... arXiv preprint arXiv:2412.02980, 2024 | 1 | 2024 |
CuNAS Curiosity-driven Neural-Augmented Simulator SC Raparthy, M Mozifian, L Paull, F Golemo 2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics. RSS, 2020 | 1 | 2020 |
On impact of mixing times in continual reinforcement learning SC Raparthy | | 2023 |
Explicit Sequence Proximity Models for Hidden State Identification TD Anil Kota, SC Raparthy, Parag Khanna Thirty-second Conference on Neural Information Processing Systems (NeurIPS …, 2018 | | 2018 |