Contextual combinatorial bandits with probabilistically triggered arms

X Liu, J Zuo, S Wang, JCS Lui… - International …, 2023 - proceedings.mlr.press
We study contextual combinatorial bandits with probabilistically triggered arms (C $^ 2$
MAB-T) under a variety of smoothness conditions that capture a wide range of applications …

Batch-size independent regret bounds for combinatorial semi-bandits with probabilistically triggered arms or independent arms

X Liu, J Zuo, S Wang, C Joe-Wong… - Advances in Neural …, 2022 - proceedings.neurips.cc
In this paper, we study the combinatorial semi-bandits (CMAB) and focus on reducing the
dependency of the batch-size $ K $ in the regret bound, where $ K $ is the total number of …

Learning Context-Aware Probabilistic Maximum Coverage Bandits: A Variance-Adaptive Approach

X Liu, J Zuo, J Wang, Z Wang, Y Xu… - IEEE INFOCOM 2024 …, 2024 - ieeexplore.ieee.org
Probabilistic maximum coverage (PMC) is an important framework that can model many
network applications, including mobile crowdsensing, content delivery, and task repli¬ …

Variance-adaptive algorithm for probabilistic maximum coverage bandits with general feedback

X Liu, J Zuo, H **e, C Joe-Wong… - IEEE INFOCOM 2023 …, 2023 - ieeexplore.ieee.org
Probabilistic maximum coverage (PMC) is an important problem that can model many
network applications, including mobile crowdsensing, network content delivery, and …

Linkselfie: Link selection and fidelity estimation in quantum networks

M Liu, Z Li, X Wang, JCS Lui - IEEE INFOCOM 2024-IEEE …, 2024 - ieeexplore.ieee.org
Reliable transmission of fragile quantum information requires one to efficiently select and
utilize high-fidelity links among multiple noisy quantum links. However, the fidelity, a quality …

Online competitive influence maximization

J Zuo, X Liu, C Joe-Wong, JCS Lui… - International …, 2022 - proceedings.mlr.press
Online influence maximization has attracted much attention as a way to maximize influence
spread through a social network while learning the values of unknown network parameters …

Variance-Aware Bandit Framework for Dynamic Probabilistic Maximum Coverage Problem With Triggered or Self-Reliant Arms

X Dai, X Liu, J Zuo, H **e, C Joe-Wong… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
The Probabilistic Maximum Coverage (PMC) problem plays a pivotal role in modeling
various network applications, such as mobile crowdsensing, which involves selecting nodes …

Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond

X Liu, S Wang, J Zuo, H Zhong, X Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce a novel framework of combinatorial multi-armed bandits (CMAB) with
multivariant and probabilistically triggering arms (CMAB-MT), where the outcome of each …

Cost-Effective Online Multi-LLM Selection with Versatile Reward Models

X Dai, J Li, X Liu, A Yu, J Lui - arxiv preprint arxiv:2405.16587, 2024 - arxiv.org
With the rapid advancement of large language models (LLMs), the diversity of multi-LLM
tasks and the variability in their pricing structures have become increasingly important, as …

Fixed confidence community mode estimation

M Pai, N Karamchandani, J Nair - Performance Evaluation, 2023 - Elsevier
Our aim is to estimate the largest community (aka, mode) in a population composed of
multiple disjoint communities. This estimation is performed in a fixed confidence setting via …