A survey of machine learning techniques applied to software defined networking (SDN): Research issues and challenges

J **e, FR Yu, T Huang, R **e, J Liu… - … Surveys & Tutorials, 2018 - ieeexplore.ieee.org
In recent years, with the rapid development of current Internet and mobile communication
technologies, the infrastructure, devices and resources in networking systems are becoming …

Dual mirror descent for online allocation problems

S Balseiro, H Lu, V Mirrokni - International Conference on …, 2020 - proceedings.mlr.press
We consider online allocation problems with concave revenue functions and resource
constraints, which are central problems in revenue management and online advertising. In …

Reward is enough for convex mdps

T Zahavy, B O'Donoghue… - Advances in Neural …, 2021 - proceedings.neurips.cc
Maximising a cumulative reward function that is Markov and stationary, ie, defined over state-
action pairs and independent of time, is sufficient to capture many kinds of goals in a Markov …

Adaptive algorithms for online convex optimization with long-term constraints

R Jenatton, J Huang… - … Conference on Machine …, 2016 - proceedings.mlr.press
We present an adaptive online gradient descent algorithm to solve online convex
optimization problems with long-term constraints, which are constraints that need to be …

Linear contextual bandits with knapsacks

S Agrawal, N Devanur - Advances in neural information …, 2016 - proceedings.neurips.cc
We consider the linear contextual bandit problem with resource consumption, in addition to
reward generation. In each round, the outcome of pulling an arm is a reward as well as a …

Network revenue management with nonparametric demand learning:\sqrt {T}-regret and polynomial dimension dependency

S Miao, Y Wang - Available at SSRN 3948140, 2021 - papers.ssrn.com
This paper studies the classic price-based network revenue management (NRM) problem
with demand learning. The retailer dynamically decides prices of n products over a finite …

Autobidding with constraints

G Aggarwal, A Badanidiyuru, A Mehta - … , WINE 2019, New York, NY, USA …, 2019 - Springer
Autobidding is becoming increasingly important in the domain of online advertising, and has
become a critical tool used by many advertisers for optimizing their ad campaigns. We …

Fair dynamic rationing

V Manshadi, R Niazadeh, S Rodilitz - … of the 22nd ACM Conference on …, 2021 - dl.acm.org
We study the allocative challenges that governmental and nonprofit organizations face when
tasked with equitable and efficient rationing of a social good among agents whose needs …

The bayesian prophet: A low-regret framework for online decision making

A Vera, S Banerjee - ACM SIGMETRICS Performance Evaluation …, 2019 - dl.acm.org
Motivated by the success of using black-box predictive algorithms as subroutines for online
decision-making, we develop a new framework for designing online policies given access to …

Regularized online allocation problems: Fairness and beyond

S Balseiro, H Lu, V Mirrokni - International Conference on …, 2021 - proceedings.mlr.press
Online allocation problems with resource constraints have a rich history in computer science
and operations research. In this paper, we introduce the regularized online allocation …