팔로우
QIWEI DI
QIWEI DI
Phd student, Department of Computer Science , University of California, Los Angeles
cs.ucla.edu의 이메일 확인됨 - 홈페이지
제목
인용
인용
연도
Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits
Q Di, T Jin, Y Wu, H Zhao, F Farnoud, Q Gu
International Conference on Learning Representations 2024, 2023
112023
Borda regret minimization for generalized linear dueling bandits
Y Wu, T Jin, H Lou, F Farnoud, Q Gu
ICML2024, 2023
92023
Pessimistic nonlinear least-squares value iteration for offline reinforcement learning
Q Di, H Zhao, J He, Q Gu
International Conference on Learning Representations 2024, 2023
52023
Unified convergence analysis for score-based diffusion models with deterministic samplers
R Li, Q Di, Q Gu
arXiv preprint arXiv:2410.14237, 2024
22024
Nearly optimal algorithms for contextual dueling bandits from adversarial feedback
Q Di, J He, Q Gu
arXiv preprint arXiv:2404.10776, 2024
12024
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Q Di, J He, D Zhou, Q Gu
International Conference on Machine Learning, 2023
12023
Relative-Translation Invariant Wasserstein Distance
B Wang, Q Di, M Yin, M Wang, Q Gu, P Wei
arXiv preprint arXiv:2409.02416, 2024
2024
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–7