The Impact of Batch Learning in Stochastic Linear Bandits D Provodin, P Gajane, M Pechenizkiy, M Kaptein 2022 IEEE International Conference on Data Mining (ICDM), 1149-1154, 2022 | 5 | 2022 |
Bandits for Sponsored Search Auctions under Unknown Valuation Model: Case Study in E-Commerce Advertising D Provodin, J Joudioux, E Duryev arXiv preprint arXiv:2304.00999, 2023 | 2* | 2023 |
The impact of batch learning in stochastic bandits D Provodin, P Gajane, M Pechenizkiy, M Kaptein Ecological Theory of Reinforcement Learning at NeurIPS 2021, 2021 | 2 | 2021 |
Rethinking Knowledge Transfer in Learning Using Privileged Information D Provodin, B Akker, C Katsimerou, M Kaptein, M Pechenizkiy arXiv preprint arXiv:2408.14319, 2024 | 1 | 2024 |
Efficient exploration in average-reward constrained reinforcement learning: achieving near-optimal regret with posterior sampling D Provodin, M Kaptein, M Pechenizkiy arXiv preprint arXiv:2405.19017, 2024 | | 2024 |
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning D Provodin, P Gajane, M Pechenizkiy, M Kaptein arXiv preprint arXiv:2209.03596, 2022 | | 2022 |