Sharath Chandra Raparthy

Посилання

	Усі	З 2020
Цитування	3290	3290
h-індекс	10	10
i10-індекс	11	11

2200

1100

550

1650

2021202220232024202510 21 63 2191 997

Співавтори

Eric HambroAnthropicПідтверджена електронна адреса в anthropic.com
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARПідтверджена електронна адреса в umontreal.ca
Irina RishUniversity of Montreal / Mila -Quebec AI InstituteПідтверджена електронна адреса в mila.quebec
Emmanuel BengioMcGill University; Recursion/Valence LabsПідтверджена електронна адреса в mail.mcgill.ca
Mikayel SamvelyanGoogle DeepMindПідтверджена електронна адреса в google.com
Mikael HenaffMetaПідтверджена електронна адреса в nyu.edu
Guillaume LajoieAssistant Professor, Applied Mathematics, Université de MontréalПідтверджена електронна адреса в umontreal.ca
Matthew RiemerIBM, MilaПідтверджена електронна адреса в us.ibm.com
Roberta RaileanuResearch Scientist at Meta, Honorary Lecturer at UCL

Підписатись

Sharath Chandra Raparthy

Member of Technical Staff at Reka AI

Немає підтвердженої електронної адреси - Домашня сторінка

Reinforcement Learning Deep Learning


Назва Сортувати за цитуваннями Сортувати за роком Сортувати за назвою	Посилання Посилання	Рік
The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024	2867	2024
The llama 3 herd of models A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ... arXiv e-prints, arXiv: 2407.21783, 2024	96	2024
Multi-objective gflownets M Jain, SC Raparthy, A Hernández-Garcıa, J Rector-Brooks, Y Bengio, ... ICML 2023, 14631-14653, 2023	75	2023
Rainbow teaming: Open-ended generation of diverse adversarial prompts M Samvelyan, SC Raparthy, A Lupu*, E Hambro, AH Markosyan, ... NeurIPS 2024, 2024	52	2024
Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024	50	2024
Glore: When, where, and how to improve llm reasoning via global and local refinements A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ... ICML 2024, 2024	32	2024
Compositional Attention: Disentangling Search and Retrieval S Mittal, SC Raparthy, I Rish, Y Bengio, G Lajoie ICLR 2022, 2021	25	2021
Curriculum in Gradient-Based Meta-Reinforcement Learning B Mehta, T Deleu, SC Raparthy, CJ Pal, L Paull arXiv preprint arXiv:2002.07956, 2020	23	2020
Generalization to New Sequential Decision Making Tasks with In-Context Learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu ICML 2024, 2023	15	2023
ML reproducibility challenge 2021 K Sinha, J Dodge, S Luccioni, J Forde, SC Raparthy, J Pineau, R Stojnic ReScience C 8 (2), 48, 2022	12	2022
Llama 3 Model Card AI Meta https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md, 2024	10	2024
Continual learning in environments with polynomial mixing times M Riemer, SC Raparthy, I Cases, G Subbaraj, M Puelma Touzel, I Rish Advances in Neural Information Processing Systems (NeurIPS) 2022 35, 21961-21973, 2022	9	2022
Generating automatic curricula via self-supervised active domain randomization SC Raparthy, B Mehta, F Golemo, L Paull arXiv preprint arXiv:2002.07911, 2020	9	2020
Data Efficient Stagewise Knowledge Distillation A Kulkarni, N Panchi, SC Raparthy, S Chiddarwar arXiv preprint arXiv:1911.06786, 2019	8	2019
A game-theoretic perspective on risk-sensitive reinforcement learning. M Godbout, M Heuillet, SC Raparthy, R Bhati, A Durand SafeAI@ AAAI, 2022	3	2022
Yoshua 448 Bengio, Santiago Miret, and Emmanuel Bengio M Jain, SC Raparthy, A Hernandez-Garcia, J Rector-Brooks Multi-objective gflownets 449, 0	2
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models A Havrilla, A Dai, L O'Mahony, K Oostermeijer, V Zisler, A Albalak, F Milo, ... arXiv preprint arXiv:2412.02980, 2024	1	2024
CuNAS Curiosity-driven Neural-Augmented Simulator SC Raparthy, M Mozifian, L Paull, F Golemo 2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics. RSS, 2020	1	2020
On impact of mixing times in continual reinforcement learning SC Raparthy		2023
Explicit Sequence Proximity Models for Hidden State Identification TD Anil Kota, SC Raparthy, Parag Khanna Thirty-second Conference on Neural Information Processing Systems (NeurIPS …, 2018		2018

У даний момент система не може виконати операцію. Спробуйте пізніше.

Статті 1–20

Кількість бібліографічних посилань на рік

Повторювані посилання

Об’єднані посилання

Додати співавторівСпівавтори

Підписатись

Посилання

Співавтори