A survey on quantum reinforcement learning

N Meyer, C Ufrecht, M Periyasamy, DD Scherer… - arxiv preprint arxiv …, 2022 - arxiv.org
Quantum reinforcement learning is an emerging field at the intersection of quantum
computing and machine learning. While we intend to provide a broad overview of the …

Quantum bayesian optimization

Z Dai, GKR Lau, A Verma, Y Shu… - Advances in Neural …, 2023 - proceedings.neurips.cc
Kernelized bandits, also known as Bayesian optimization (BO), has been a prevalent
method for optimizing complicated black-box reward functions. Various BO algorithms have …

Efficient Pauli channel estimation with logarithmic quantum memory

S Chen, W Gong - arxiv preprint arxiv:2309.14326, 2023 - arxiv.org
Here we revisit one of the prototypical tasks for characterizing the structure of noise in
quantum devices: estimating every eigenvalue of an $ n $-qubit Pauli noise channel to error …

Quantum multi-armed bandits and stochastic linear bandits enjoy logarithmic regrets

Z Wan, Z Zhang, T Li, J Zhang, X Sun - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Multi-arm bandit (MAB) and stochastic linear bandit (SLB) are important models in
reinforcement learning, and it is well-known that classical algorithms for bandits with time …

Adaptive online learning of quantum states

X Chen, E Hazan, T Li, Z Lu, X Wang, R Yang - Quantum, 2024 - quantum-journal.org
The problem of efficient quantum state learning, also called shadow tomography, aims to
comprehend an unknown $ d $-dimensional quantum state through POVMs. Yet, these …

Linear bandits with polylogarithmic minimax regret

J Lumbreras, M Tomamichel - The Thirty Seventh Annual …, 2024 - proceedings.mlr.press
We study a noise model for linear stochastic bandits for which the subgaussian noise
parameter vanishes linearly as we select actions on the unit sphere closer and closer to the …

On adaptivity in quantum testing

O Fawzi, N Flammarion, A Garivier… - Transactions on Machine …, 2023 - hal.science
Can adaptive strategies outperform non-adaptive ones for quantum hypothesis selection?
We exhibit problems where adaptive strategies provably reduce the number of required …

Photon-Atom Hybrid Decision-Framework with Concurrent Exploration Acceleration

F Lu, JP Dou, H Tang, XY Xu, CN Zhang… - ACS …, 2025 - ACS Publications
Decision-making enables artificial intelligence to dynamically adjust and acquire knowledge
from experiences, distinguishing it from traditional computing intelligence based on …

Online learning of a panoply of quantum objects

A Bansal, I George, S Ghosh, J Sikora… - arxiv preprint arxiv …, 2024 - arxiv.org
In many quantum tasks, there is an unknown quantum object that one wishes to learn. An
online strategy for this task involves adaptively refining a hypothesis to reproduce such an …

Optimization Techniques in Reinforcement Learning for Healthcare: A Review

DA Aliyu, EAP Akhir, NA Osman… - 2024 8th …, 2024 - ieeexplore.ieee.org
One paradigm for machine learning that is transforming is reinforcement learning, or RL,
promising significant healthcare improvements through personalized treatment …