Seuraa
Guojun Xiong
Guojun Xiong
Vahvistettu sähköpostiosoite verkkotunnuksessa seas.harvard.edu - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
The finben: An holistic financial benchmark for large language models
Q Xie, W Han, Z Chen, R Xiang, X Zhang, Y He, M Xiao, D Li, Y Dai, ...
Advances in Neural Information Processing Systems (Neurips) 37, 2024
522024
Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits
G Xiong, J Li, R Singh
Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022, 2022
28*2022
Whittle Index-Based Q-Learning for Wireless Edge Caching With Linear Function Approximation
G Xiong, S Wang, J Li, R Singh
IEEE/ACM Transactions on Networking, 2024
26*2024
Reinforcement learning for dynamic dimensioning of cloud caches: A restless bandit approach
G Xiong, S Wang, G Yan, J Li
IEEE/ACM Transactions on Networking 31 (5), 2147-2161, 2023
212023
Index-aware reinforcement learning for adaptive video streaming at the wireless edge
G Xiong, X Qin, B Li, R Singh, J Li
Proceedings of the Twenty-Third International Symposium of ACM MobiHoc, 81-90, 2022
212022
Straggler-resilient decentralized learning via adaptive asynchronous updates
G Xiong, G Yan, S Wang, J Li
Proceedings of the Twenty-fifth International Symposium on Theory …, 2024
19*2024
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
Y Yu, Z Yao, H Li, Z Deng, Y Cao, Z Chen, JW Suchow, R Liu, Z Cui, ...
Advances in Neural Information Processing Systems (Neurips) 37, 2024
182024
Finite-time analysis of whittle index based Q-learning for restless multi-armed bandits with neural network function approximation
G Xiong, J Li
Advances in Neural Information Processing Systems 36, 29048-29073, 2023
152023
Learning infinite-horizon average-reward restless multi-action bandits via index awareness
G Xiong, S Wang, J Li
Advances in Neural Information Processing Systems (Neurips) 35, 17911-17925, 2022
152022
Open-finllms: Open multimodal large language models for financial applications
Q Xie, D Li, M Xiao, Z Jiang, R Xiang, X Zhang, Z Chen, Y He, W Han, ...
arXiv preprint arXiv:2408.11878, 2024
142024
Online restless multi-armed bandits with long-term fairness constraints
S Wang, G Xiong, J Li
Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 15616 …, 2024
62024
Optimality conditions of performance-guaranteed power minimization in MIMO networks: A distributed algorithm and its feasibility
G Xiong, T Kim, DJ Love, E Perrins
IEEE Transactions on Signal Processing 69, 119-135, 2020
62020
Leveraging subspace information for low-rank matrix reconstruction
W Zhang, T Kim, G Xiong, SH Leung
Signal processing 163, 123-131, 2019
62019
DePRL: Achieving Linear Convergence Speedup in Personalized Decentralized Learning with Shared Representations
G Xiong, G Yan, S Wang, J Li
Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 16103 …, 2024
42024
Decorrelation deep learning for fingerprint-based indoor localization
G Xiong, T Kim, E Perrins
arXiv preprint arXiv:1908.02014, 2019
32019
Dopl: Direct online preference learning for restless bandits with preference feedback
G Xiong, U Dinesha, D Mukherjee, J Li, S Shakkottai
arXiv preprint arXiv:2410.05527, 2024
22024
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
G Xiong, J Li
International Conference on Machine Learning (ICML) 2024, 2024
22024
Optimizing vital sign monitoring in resource-constrained maternal care: An rl-based restless bandit approach
N Boehmer, Y Zhao, G Xiong, P Rodriguez-Diaz, PDC Cibrian, J Ngonzi, ...
arXiv preprint arXiv:2410.08377, 2024
12024
Personalized federated reinforcement learning with shared representations
G Xiong, S Wang, D Jiang, J Li
Deployable RL: From Research to Practice@ Reinforcement Learning Conference 2024, 2024
12024
Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation
G Xiong, H Wang, Y Pan, S Mandal, S Shah, N Boehmer, M Tambe
arXiv preprint arXiv:2501.06103, 2025
2025
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20