Hao Sun

Citeras av

	Alla	Sedan 2020
Citat	743	742
h-index	14	14
i10-index	17	17

320

160

240

2020202120222023202420259 40 99 232 309 52

Offentlig åtkomst

Visa alla

4 artiklar

0 artiklar

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Mihaela van der SchaarUniversity of Cambridge, The Alan Turing InstituteVerifierad e-postadress på ee.ucla.edu
Bolei ZhouAssistant Professor at UCLAVerifierad e-postadress på ucla.edu
Qianggang DingUniversity of Montreal / Mila - Quebec AI InstituteVerifierad e-postadress på umontreal.ca
Rui YangUniversity of Illinois Urbana-ChampaignVerifierad e-postadress på illinois.edu
Alihan HüyükHarvard UniversityVerifierad e-postadress på seas.harvard.edu
Meng FangUniversity of LiverpoolVerifierad e-postadress på liverpool.ac.uk
Bo DaiThe University of Hong KongVerifierad e-postadress på hku.hk
Ziping XuPostdoc Fellow at Harvard UniversityVerifierad e-postadress på fas.harvard.edu
Zhenghao PengUniversity of California, Los AngelesVerifierad e-postadress på cs.ucla.edu
Dahua LinThe Chinese University of Hong KongVerifierad e-postadress på ie.cuhk.edu.hk
Boris van BreugelSenior ML Researcher, Qualcomm | PhD candidate, University of CambridgeVerifierad e-postadress på cam.ac.uk
Samuel HoltUniversity of CambridgeVerifierad e-postadress på cam.ac.uk
Xiaoteng Ma（马骁腾）Dept. Automation, Tsinghua University, Beijing, ChinaVerifierad e-postadress på mails.tsinghua.edu.cn
Alex J. ChanConvergenceVerifierad e-postadress på convergence.ai
Nabeel SeedatUniversity of CambridgeVerifierad e-postadress på cam.ac.uk
Thomas POUPLINPh.D. researcher, University of CambridgeVerifierad e-postadress på cam.ac.uk
Yunyi ShenMITVerifierad e-postadress på mit.edu
Jean-Francois TonByteDance ResearchVerifierad e-postadress på bytedance.com
Daniel JarrettResearch Scientist at DeepMindVerifierad e-postadress på deepmind.com

Följ

Hao Sun

PhD Candidate, DAMTP, University of Cambridge

Verifierad e-postadress på cam.ac.uk - Startsida

Reinforcement Learning Inverse RL RLHF Large Language Models


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
Hierarchical Multi-Scale Gaussian Transformer for Stock Movement Prediction Q Ding, S Wu, H Sun, J Guo, J Guo IJCAI 2020 (Proceedings of the Twenty-Ninth International Joint Conference …, 2020	223	2020
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL R Yang, Y Lu, W Li, H Sun, M Fang, Y Du, X Li, L Han, C Zhang ICLR 2022 (The Tenth International Conference on Learning Representations), 2022	77	2022
Membership Inference Attacks against Synthetic Data through Overfitting Detection B van Breugel, H Sun, Z Qian, M van der Schaar AISTATS 2023 (The 26th International Conference on Artificial Intelligence …, 2023	47	2023
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL H Sun, A Hüyük, M van der Schaar ICLR 2024 (The Twelfth International Conference on Learning Representations), 2024	43*	2024
Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping H Sun, L Han, R Yang, X Ma, J Guo, B Zhou NeurIPS 2022 (Advances in Neural Information Processing Systems) 35, 37719-37734, 2022	43*	2022
Policy Continuation with Hindsight Inverse Dynamics H Sun, Z Li, X Liu, D Lin, B Zhou NeurIPS 2019 (Advances in Neural Information Processing Systems 33), 2019	42	2019
Adaptive regularization of labels Q Ding, S Wu, H Sun, J Guo, ST Xia arXiv preprint arXiv:1908.05474, 2019	42	2019
Dense reward for free in reinforcement learning from human feedback AJ Chan, H Sun, S Holt, M van der Schaar ICML 2024 (The Forty-first International Conference on Machine Learning), 2024	29	2024
Safe exploration by solving early terminated mdp H Sun, Z Xu, M Fang, Z Peng, J Guo, B Dai, B Zhou arXiv preprint arXiv:2107.04200, 2021	20*	2021
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond H Sun arXiv preprint arXiv:2310.06147, 2023	19	2023
Inverse-RLignment: Inverse Reinforcement Learning from Demonstrations for LLM Alignment H Sun, M van der Schaar arXiv preprint arXiv:2405.15624, 2024	18*	2024
Adaptive regularization of labels Q Ding, S Wu, H Sun, J Guo, ST Xia AAAI 2021 (The Thirty-Fifth AAAI Conference on Artificial Intelligence), 2021	18*	2021
Novel policy seeking with constrained optimization H Sun, Z Peng, B Dai, D Lin, B Zhou NeurIPS 2022 Deep RL Workshop, 2022	16	2022
Supervised Q-Learning can be a Strong Baseline for Continuous Control H Sun, Z Xu, M Fang, B Zhou NeurIPS 2022 Foundation Models for Decision Making Workshop, 2022	16*	2022
Non-local policy optimization via diversity-regularized collaborative exploration Z Peng, H Sun, B Zhou arXiv preprint arXiv:2006.07781, 2020	14	2020
Neural Laplace Control for Continuous-time Delayed Systems S Holt, A Hüyük, Z Qian, H Sun, M van der Schaar AISTATS 2023 (The 26th International Conference on Artificial Intelligence …, 2023	12	2023
Accountability in offline reinforcement learning: Explaining decisions with a corpus of examples H Sun, A Hüyük, D Jarrett, M van der Schaar Advances in Neural Information Processing Systems 36, 2023	10*	2023
What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization H Sun, B van Breugel, J Crabbe, N Seedat, M van der Schaar NeurIPS 2023, 2023	9*	2023
Toward causal-aware RL: State-wise action-refined temporal difference H Sun, T Wang arXiv preprint arXiv:2201.00354, 2022	9	2022
On the guaranteed almost equivalence between imitation learning from observation and demonstration Z Cheng, L Liu, A Liu, H Sun, M Fang, D Tao TNNLS (IEEE Transactions on Neural Networks and Learning Systems), 2021	9	2021

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–20

Citat per år

Dubblettcitat

Sammanfogade citat

Lägg till medförfattareMedförfattare

Följ

Citeras av

Medförfattare