ติดตาม
Yecheng Jason Ma
Yecheng Jason Ma
ชื่ออื่นๆYecheng Ma, Jason Ma, Jason Yecheng Ma
ยืนยันอีเมลแล้วที่ seas.upenn.edu - หน้าแรก
ชื่อ
อ้างโดย
อ้างโดย
ปี
Open x-embodiment: Robotic learning datasets and rt-x models
A O'Neill, A Rehman, A Gupta, A Maddukuri, A Gupta, A Padalkar, A Lee, ...
ICRA 2024; arXiv preprint arXiv:2310.08864, 2023
473*2023
Eureka: Human-level reward design via coding large language models
YJ Ma, W Liang, G Wang, DA Huang, O Bastani, D Jayaraman, Y Zhu, ...
ICLR 2024; arXiv preprint arXiv:2310.12931, 2023
3142023
VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
YJ Ma, S Sodhani, D Jayaraman, O Bastani, V Kumar, A Zhang
ICLR 2023; arXiv preprint arXiv:2210.00030, 2022
2642022
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
A Majumdar, K Yadav, S Arnaud, YJ Ma, C Chen, S Silwal, A Jain, ...
NeurIPS 2023; arXiv preprint arXiv:2303.18240, 2023
1372023
Droid: A large-scale in-the-wild robot manipulation dataset
A Khazatsky, K Pertsch, S Nair, A Balakrishna, S Dasari, S Karamcheti, ...
RSS 2024; arXiv preprint arXiv:2403.12945, 2024
1202024
LIV: Language-Image Representations and Rewards for Robotic Control
YJ Ma, W Liang, V Som, V Kumar, A Zhang, O Bastani, D Jayaraman
ICML 2023; arXiv preprint arXiv:2306.00958, 2023
1162023
Conservative offline distributional reinforcement learning
Y Ma, D Jayaraman, O Bastani
NeurIPS 2021; Advances in Neural Information Processing Systems 34, 2021
1012021
Versatile Offline Imitation from Observations and Examples
YJ Ma, A Shen, D Jayaraman, O Bastani
ICML 2022; arXiv preprint arXiv:2202.02433, 2022
63*2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via -Advantage Regression
YJ Ma, J Yan, D Jayaraman, O Bastani
NeurIPS 2022; arXiv preprint arXiv:2206.03023, 2022
62*2022
Likelihood-Based Diverse Sampling for Trajectory Forecasting
YJ Ma, JP Inala, D Jayaraman, O Bastani
ICCV 2021; Proceedings of the IEEE/CVF International Conference on Computer …, 2021
40*2021
DrEureka: Language Model Guided Sim-To-Real Transfer
YJ Ma, W Liang, HJ Wang, S Wang, Y Zhu, L Fan, O Bastani, ...
RSS 2024; arXiv preprint arXiv:2406.01967, 2024
352024
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
YJ Ma, A Shen, O Bastani, D Jayaraman
AAAI 2022; arXiv preprint arXiv:2112.07701, 2022
242022
Regret Bounds for Risk-Sensitive Reinforcement Learning
O Bastani, YJ Ma, E Shen, W Xu
NeurIPS 2022; arXiv preprint arXiv:2210.05650, 2022
222022
Safely bridging offline and online reinforcement learning
W Xu, YJ Ma, K Xu, H Bastani, O Bastani
AISTAT 2023; arXiv preprint arXiv:2110.13060, 2021
17*2021
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Z Zhang, Y Li, O Bastani, A Gupta, D Jayaraman, YJ Ma, L Weihs
ICRA 2024; arXiv preprint arXiv:2310.08581, 2023
152023
Safe Human-Interactive Control via Shielding
J Priya Inala, YJ Ma, O Bastani, X Zhang, A Solar-Lezama
arXiv e-prints, arXiv: 2110.05440, 2021
7*2021
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
YJ Ma, K Sivakumar, J Yan, O Bastani, D Jayaraman
L4DC 2023; arXiv preprint arXiv:2305.12663, 2023
62023
State Relevance for Off-Policy Evaluation
SP Shen, Y Ma, O Gottesman, F Doshi-Velez
ICML 2021; International Conference on Machine Learning (ICML), 9537-9546, 2021
42021
Environment curriculum generation via large language models
W Liang, S Wang, HJ Wang, O Bastani, D Jayaraman, YJ Ma
CORL 2024; arXiv preprint arXiv:2411.01775, 2024, 2024
3*2024
Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model
L Le, J Xie, W Liang, HJ Wang, Y Yang, YJ Ma, K Vedder, A Krishna, ...
arXiv preprint arXiv:2410.13882, 2024
22024
ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง
บทความ 1–20