The finben: An holistic financial benchmark for large language models Q Xie, W Han, Z Chen, R Xiang, X Zhang, Y He, M Xiao, D Li, Y Dai, ... Advances in Neural Information Processing Systems (Neurips) 37, 2024 | 52 | 2024 |
Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits G Xiong, J Li, R Singh Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022, 2022 | 28* | 2022 |
Whittle Index-Based Q-Learning for Wireless Edge Caching With Linear Function Approximation G Xiong, S Wang, J Li, R Singh IEEE/ACM Transactions on Networking, 2024 | 26* | 2024 |
Reinforcement learning for dynamic dimensioning of cloud caches: A restless bandit approach G Xiong, S Wang, G Yan, J Li IEEE/ACM Transactions on Networking 31 (5), 2147-2161, 2023 | 21 | 2023 |
Index-aware reinforcement learning for adaptive video streaming at the wireless edge G Xiong, X Qin, B Li, R Singh, J Li Proceedings of the Twenty-Third International Symposium of ACM MobiHoc, 81-90, 2022 | 21 | 2022 |
Straggler-resilient decentralized learning via adaptive asynchronous updates G Xiong, G Yan, S Wang, J Li Proceedings of the Twenty-fifth International Symposium on Theory …, 2024 | 19* | 2024 |
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making Y Yu, Z Yao, H Li, Z Deng, Y Cao, Z Chen, JW Suchow, R Liu, Z Cui, ... Advances in Neural Information Processing Systems (Neurips) 37, 2024 | 18 | 2024 |
Finite-time analysis of whittle index based Q-learning for restless multi-armed bandits with neural network function approximation G Xiong, J Li Advances in Neural Information Processing Systems 36, 29048-29073, 2023 | 15 | 2023 |
Learning infinite-horizon average-reward restless multi-action bandits via index awareness G Xiong, S Wang, J Li Advances in Neural Information Processing Systems (Neurips) 35, 17911-17925, 2022 | 15 | 2022 |
Open-finllms: Open multimodal large language models for financial applications Q Xie, D Li, M Xiao, Z Jiang, R Xiang, X Zhang, Z Chen, Y He, W Han, ... arXiv preprint arXiv:2408.11878, 2024 | 14 | 2024 |
Online restless multi-armed bandits with long-term fairness constraints S Wang, G Xiong, J Li Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 15616 …, 2024 | 6 | 2024 |
Optimality conditions of performance-guaranteed power minimization in MIMO networks: A distributed algorithm and its feasibility G Xiong, T Kim, DJ Love, E Perrins IEEE Transactions on Signal Processing 69, 119-135, 2020 | 6 | 2020 |
Leveraging subspace information for low-rank matrix reconstruction W Zhang, T Kim, G Xiong, SH Leung Signal processing 163, 123-131, 2019 | 6 | 2019 |
DePRL: Achieving Linear Convergence Speedup in Personalized Decentralized Learning with Shared Representations G Xiong, G Yan, S Wang, J Li Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 16103 …, 2024 | 4 | 2024 |
Decorrelation deep learning for fingerprint-based indoor localization G Xiong, T Kim, E Perrins arXiv preprint arXiv:1908.02014, 2019 | 3 | 2019 |
Dopl: Direct online preference learning for restless bandits with preference feedback G Xiong, U Dinesha, D Mukherjee, J Li, S Shakkottai arXiv preprint arXiv:2410.05527, 2024 | 2 | 2024 |
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback G Xiong, J Li International Conference on Machine Learning (ICML) 2024, 2024 | 2 | 2024 |
Optimizing vital sign monitoring in resource-constrained maternal care: An rl-based restless bandit approach N Boehmer, Y Zhao, G Xiong, P Rodriguez-Diaz, PDC Cibrian, J Ngonzi, ... arXiv preprint arXiv:2410.08377, 2024 | 1 | 2024 |
Personalized federated reinforcement learning with shared representations G Xiong, S Wang, D Jiang, J Li Deployable RL: From Research to Practice@ Reinforcement Learning Conference 2024, 2024 | 1 | 2024 |
Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation G Xiong, H Wang, Y Pan, S Mandal, S Shah, N Boehmer, M Tambe arXiv preprint arXiv:2501.06103, 2025 | | 2025 |