Bridging offline reinforcement learning and imitation learning: A tale of pessimism P Rashidinejad, B Zhu, C Ma, J Jiao, S Russell Advances in Neural Information Processing Systems (NeurIPS), 2021 | 337 | 2021 |
MADE: Exploration via maximizing deviation from explored regions T Zhang*, P Rashidinejad*, J Jiao, Y Tian, J Gonzalez, S Russell Advances in Neural Information Processing Systems (NeurIPS), 2021 | 49 | 2021 |
Optimal conservative offline RL with general function approximation via augmented Lagrangian P Rashidinejad, H Zhu, K Yang, S Russell, J Jiao International Conference on Learning Representations (ICLR) --- Spotlight, 2023 | 42 | 2023 |
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning H Zhu, P Rashidinejad, J Jiao Advances in Neural Information Processing Systems (NeurIPS), 2023 | 19 | 2023 |
SLIP: Learning to predict in unknown dynamical systems with long-term memory P Rashidinejad, J Jiao, S Russell Advances in Neural Information Processing Systems (NeurIPS) --- Oral, 2020 | 14 | 2020 |
Patient-adaptable intracranial pressure morphology analysis using a probabilistic model-based approach P Rashidinejad, X Hu, S Russell Physiological measurement 41 (10), 104003, 2020 | 7 | 2020 |
Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking P Rashidinejad, Y Tian International Conference on Learning Representations (ICLR), 2025 | | 2025 |
Reliable Prediction and Decision-Making in Sequential Environments P Rashidinejad University of California, Berkeley, 2022 | | 2022 |
Techniques for accurately estimating the reliability of storage systems P Rashidinejad, N Jamadagni, A Raghavan, C Schelp, C Gordon US Patent 11,416,324, 2020 | | 2020 |