Підписатись
Sharath Chandra Raparthy
Sharath Chandra Raparthy
Member of Technical Staff at Reka AI
Немає підтвердженої електронної адреси - Домашня сторінка
Назва
Посилання
Посилання
Рік
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv preprint arXiv:2407.21783, 2024
28672024
The llama 3 herd of models
A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ...
arXiv e-prints, arXiv: 2407.21783, 2024
962024
Multi-objective gflownets
M Jain, SC Raparthy, A Hernández-Garcıa, J Rector-Brooks, Y Bengio, ...
ICML 2023, 14631-14653, 2023
752023
Rainbow teaming: Open-ended generation of diverse adversarial prompts
M Samvelyan*, SC Raparthy*, A Lupu*, E Hambro, AH Markosyan, ...
NeurIPS 2024, 2024
522024
Teaching large language models to reason with reinforcement learning
A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ...
arXiv preprint arXiv:2403.04642, 2024
502024
Glore: When, where, and how to improve llm reasoning via global and local refinements
A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ...
ICML 2024, 2024
322024
Compositional Attention: Disentangling Search and Retrieval
S Mittal, SC Raparthy, I Rish, Y Bengio, G Lajoie
ICLR 2022, 2021
252021
Curriculum in Gradient-Based Meta-Reinforcement Learning
B Mehta, T Deleu, SC Raparthy, CJ Pal, L Paull
arXiv preprint arXiv:2002.07956, 2020
232020
Generalization to New Sequential Decision Making Tasks with In-Context Learning
SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu
ICML 2024, 2023
152023
ML reproducibility challenge 2021
K Sinha, J Dodge, S Luccioni, J Forde, SC Raparthy, J Pineau, R Stojnic
ReScience C 8 (2), 48, 2022
122022
Llama 3 Model Card
AI Meta
https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md, 2024
102024
Continual learning in environments with polynomial mixing times
M Riemer*, SC Raparthy*, I Cases, G Subbaraj, M Puelma Touzel, I Rish
Advances in Neural Information Processing Systems (NeurIPS) 2022 35, 21961-21973, 2022
92022
Generating automatic curricula via self-supervised active domain randomization
SC Raparthy, B Mehta, F Golemo, L Paull
arXiv preprint arXiv:2002.07911, 2020
92020
Data Efficient Stagewise Knowledge Distillation
A Kulkarni, N Panchi, SC Raparthy, S Chiddarwar
arXiv preprint arXiv:1911.06786, 2019
82019
A game-theoretic perspective on risk-sensitive reinforcement learning.
M Godbout, M Heuillet, SC Raparthy, R Bhati, A Durand
SafeAI@ AAAI, 2022
32022
Yoshua 448 Bengio, Santiago Miret, and Emmanuel Bengio
M Jain, SC Raparthy, A Hernandez-Garcia, J Rector-Brooks
Multi-objective gflownets 449, 0
2
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models
A Havrilla, A Dai, L O'Mahony, K Oostermeijer, V Zisler, A Albalak, F Milo, ...
arXiv preprint arXiv:2412.02980, 2024
12024
CuNAS Curiosity-driven Neural-Augmented Simulator
SC Raparthy, M Mozifian, L Paull, F Golemo
2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics. RSS, 2020
12020
On impact of mixing times in continual reinforcement learning
SC Raparthy
2023
Explicit Sequence Proximity Models for Hidden State Identification
TD Anil Kota, SC Raparthy, Parag Khanna
Thirty-second Conference on Neural Information Processing Systems (NeurIPS …, 2018
2018
У даний момент система не може виконати операцію. Спробуйте пізніше.
Статті 1–20