Follow
Han Shen
Title
Cited by
Cited by
Year
On penalty-based bilevel gradient descent method
H Shen, T Chen
Mathematical Programming 210 (1-38), 2024
512024
Mitigating gradient bias in multi-objective learning: A provably convergent approach
H Fernando, H Shen, M Liu, S Chaudhury, K Murugesan, T Chen
International Conference on Learning Representations, 2023
51*2023
Towards understanding asynchronous advantage actor-critic: Convergence and linear speedup
H Shen, K Zhang, M Hong, T Chen
IEEE Transactions on Signal Processing 71, 2579-2594, 2023
47*2023
Adaptive temporal difference learning with linear function approximation
T Sun, H Shen, T Chen, D Li
IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (12), 8812 …, 2021
352021
Byzantine-resilient decentralized policy evaluation with linear function approximation
Z Wu, H Shen, T Chen, Q Ling
IEEE Transactions on Signal Processing 69, 3839-3853, 2021
282021
Alternating projected sgd for equality-constrained bilevel optimization
Q Xiao, H Shen, W Yin, T Chen
International Conference on Artificial Intelligence and Statistics, 987-1023, 2023
252023
A single-timescale analysis for stochastic approximation with multiple coupled sequences
H Shen, T Chen
Advances in Neural Information Processing Systems 35, 17415-17429, 2022
162022
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
H Shen, Z Yang, T Chen
International Conference on Machine Learning, 2024
92024
Seal: Safety-enhanced aligned llm fine-tuning via bilevel data selection
H Shen, PY Chen, P Das, T Chen
International Conference on Learning Representations, 2025
52025
Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization
AFM Saif, X Cui, H Shen, S Lu, B Kingsbury, T Chen
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
32024
Alternating implicit projected sgd and its efficient variants for equality-constrained bilevel optimization
Q Xiao, H Shen, W Yin, T Chen
arXiv preprint arXiv:2211.07096, 2022
32022
Mitigating forgetting in llm supervised fine-tuning and preference learning
H Fernando, H Shen, P Ram, Y Zhou, H Samulowitz, N Baracaldo, ...
arXiv preprint arXiv:2410.15483, 2024
22024
Distributed offline policy optimization over batch data
H Shen, S Lu, X Cui, T Chen
International Conference on Artificial Intelligence and Statistics, 4443-4472, 2023
22023
A Method for Bilevel Optimization with Convex Lower-Level Problem
H Shen, S Paternain, G Liu, R Kompella, T Chen
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–14