Följ
Hao Sun
Hao Sun
PhD Candidate, DAMTP, University of Cambridge
Verifierad e-postadress på cam.ac.uk - Startsida
Titel
Citeras av
Citeras av
År
Hierarchical Multi-Scale Gaussian Transformer for Stock Movement Prediction
Q Ding, S Wu, H Sun, J Guo, J Guo
IJCAI 2020 (Proceedings of the Twenty-Ninth International Joint Conference …, 2020
2232020
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
R Yang, Y Lu, W Li, H Sun, M Fang, Y Du, X Li, L Han, C Zhang
ICLR 2022 (The Tenth International Conference on Learning Representations), 2022
772022
Membership Inference Attacks against Synthetic Data through Overfitting Detection
B van Breugel, H Sun, Z Qian, M van der Schaar
AISTATS 2023 (The 26th International Conference on Artificial Intelligence …, 2023
472023
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
H Sun, A Hüyük, M van der Schaar
ICLR 2024 (The Twelfth International Conference on Learning Representations), 2024
43*2024
Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping
H Sun, L Han, R Yang, X Ma, J Guo, B Zhou
NeurIPS 2022 (Advances in Neural Information Processing Systems) 35, 37719-37734, 2022
43*2022
Policy Continuation with Hindsight Inverse Dynamics
H Sun, Z Li, X Liu, D Lin, B Zhou
NeurIPS 2019 (Advances in Neural Information Processing Systems 33), 2019
422019
Adaptive regularization of labels
Q Ding, S Wu, H Sun, J Guo, ST Xia
arXiv preprint arXiv:1908.05474, 2019
422019
Dense reward for free in reinforcement learning from human feedback
AJ Chan, H Sun, S Holt, M van der Schaar
ICML 2024 (The Forty-first International Conference on Machine Learning), 2024
292024
Safe exploration by solving early terminated mdp
H Sun, Z Xu, M Fang, Z Peng, J Guo, B Dai, B Zhou
arXiv preprint arXiv:2107.04200, 2021
20*2021
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
H Sun
arXiv preprint arXiv:2310.06147, 2023
192023
Inverse-RLignment: Inverse Reinforcement Learning from Demonstrations for LLM Alignment
H Sun, M van der Schaar
arXiv preprint arXiv:2405.15624, 2024
18*2024
Adaptive regularization of labels
Q Ding, S Wu, H Sun, J Guo, ST Xia
AAAI 2021 (The Thirty-Fifth AAAI Conference on Artificial Intelligence), 2021
18*2021
Novel policy seeking with constrained optimization
H Sun, Z Peng, B Dai, D Lin, B Zhou
NeurIPS 2022 Deep RL Workshop, 2022
162022
Supervised Q-Learning can be a Strong Baseline for Continuous Control
H Sun, Z Xu, M Fang, B Zhou
NeurIPS 2022 Foundation Models for Decision Making Workshop, 2022
16*2022
Non-local policy optimization via diversity-regularized collaborative exploration
Z Peng, H Sun, B Zhou
arXiv preprint arXiv:2006.07781, 2020
142020
Neural Laplace Control for Continuous-time Delayed Systems
S Holt, A Hüyük, Z Qian, H Sun, M van der Schaar
AISTATS 2023 (The 26th International Conference on Artificial Intelligence …, 2023
122023
Accountability in offline reinforcement learning: Explaining decisions with a corpus of examples
H Sun, A Hüyük, D Jarrett, M van der Schaar
Advances in Neural Information Processing Systems 36, 2023
10*2023
What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization
H Sun, B van Breugel, J Crabbe, N Seedat, M van der Schaar
NeurIPS 2023, 2023
9*2023
Toward causal-aware RL: State-wise action-refined temporal difference
H Sun, T Wang
arXiv preprint arXiv:2201.00354, 2022
92022
On the guaranteed almost equivalence between imitation learning from observation and demonstration
Z Cheng, L Liu, A Liu, H Sun, M Fang, D Tao
TNNLS (IEEE Transactions on Neural Networks and Learning Systems), 2021
92021
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20