Pervasive AI for IoT applications: A survey on resource-efficient distributed artificial intelligence

E Baccour, N Mhaisen, AA Abdellatif… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org
Artificial intelligence (AI) has witnessed a substantial breakthrough in a variety of Internet of
Things (IoT) applications and services, spanning from recommendation systems and speech …

Methods for robot behavior adaptation for cognitive neurorehabilitation

A Kubota, LD Riek - Annual review of control, robotics, and …, 2022 - annualreviews.org
An estimated 11% of adults report experiencing some form of cognitive decline, which may
be associated with conditions such as stroke or dementia and can impact their memory …

Hierarchical bayesian bandits

J Hong, B Kveton, M Zaheer… - International …, 2022 - proceedings.mlr.press
Abstract Meta-, multi-task, and federated learning can be all viewed as solving similar tasks,
drawn from a distribution that reflects task similarities. We provide a unified view of all these …

Online clustering of bandits with misspecified user models

Z Wang, J **e, X Liu, S Li, J Lui - Advances in Neural …, 2024 - proceedings.neurips.cc
The contextual linear bandit is an important online learning problem where given arm
features, a learning agent selects an arm at each round to maximize the cumulative rewards …

Conversational recommendation with online learning and clustering on misspecified users

X Dai, Z Wang, J **e, X Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In the domain of conversational recommendation systems (CRSs), the development of
recommenders capable of eliciting user preferences through conversation has marked a …

Metadata-based multi-task bandits with bayesian hierarchical models

R Wan, L Ge, R Song - Advances in Neural Information …, 2021 - proceedings.neurips.cc
How to explore efficiently is a central problem in multi-armed bandits. In this paper, we
introduce the metadata-based multi-task bandit problem, where the agent needs to solve a …

Provably efficient multi-task reinforcement learning with model transfer

C Zhang, Z Wang - Advances in Neural Information …, 2021 - proceedings.neurips.cc
We study multi-task reinforcement learning (RL) in tabular episodic Markov decision
processes (MDPs). We formulate a heterogeneous multi-player RL problem, in which a …

A gang of adversarial bandits

M Herbster, S Pasteris, F Vitale… - Advances in Neural …, 2021 - proceedings.neurips.cc
We consider running multiple instances of multi-armed bandit (MAB) problems in parallel. A
main motivation for this study are online recommendation systems, in which each of $ N …

Decentralized randomly distributed multi-agent multi-armed bandit with heterogeneous rewards

M Xu, D Klabjan - Advances in Neural Information …, 2024 - proceedings.neurips.cc
We study a decentralized multi-agent multi-armed bandit problem in which multiple clients
are connected by time dependent random graphs provided by an environment. The reward …

Multi-agent learning with heterogeneous linear contextual bandits

A Do, T Nguyen-Tang, R Arora - Advances in Neural …, 2023 - proceedings.neurips.cc
As trained intelligent systems become increasingly pervasive, multiagent learning has
emerged as a popular framework for studying complex interactions between autonomous …