[HTML][HTML] Deep reinforcement learning for inventory control: A roadmap

RN Boute, J Gijsbrechts, W Van Jaarsveld… - European Journal of …, 2022 - Elsevier
Deep reinforcement learning (DRL) has shown great potential for sequential decision-
making, including early developments in inventory control. Yet, the abundance of choices …

A state‐of‐the‐art review of optimal reservoir control for managing conflicting demands in a changing world

M Giuliani, JR Lamontagne, PM Reed… - Water Resources …, 2021 - Wiley Online Library
The state of the art for optimal water reservoir operations is rapidly evolving, driven by
emerging societal challenges. Changing values for balancing environmental resources …

Legged locomotion in challenging terrains using egocentric vision

A Agarwal, A Kumar, J Malik… - Conference on robot …, 2023 - proceedings.mlr.press
Animals are capable of precise and agile locomotion using vision. Replicating this ability
has been a long-standing goal in robotics. The traditional approach has been to decompose …

Provable benefits of actor-critic methods for offline reinforcement learning

A Zanette, MJ Wainwright… - Advances in neural …, 2021 - proceedings.neurips.cc
Actor-critic methods are widely used in offline reinforcement learningpractice, but are not so
well-understood theoretically. We propose a newoffline actor-critic algorithm that naturally …

[LIVRE][B] Mathematical control theory for stochastic partial differential equations

Q Lü, X Zhang - 2021 - Springer
It is well-known that Control Theory was founded by N. Wiener in 1948 ([349]). After that, this
theory was greatly extended to various complicated setting and widely used in sciences and …

[PDF][PDF] Foundations of machine learning

M Mohri - 2018 - dlib.hust.edu.vn
A new edition of a graduate-level machine learning textbook that focuses on the analysis
and theory of algorithms. This book is a general introduction to machine learning that can …

Neuromodulated spike-timing-dependent plasticity, and theory of three-factor learning rules

N Frémaux, W Gerstner - Frontiers in neural circuits, 2016 - frontiersin.org
Classical Hebbian learning puts the emphasis on joint pre-and postsynaptic activity, but
neglects the potential role of neuromodulators. Since neuromodulators convey information …

Model-based reinforcement learning with a generative model is minimax optimal

A Agarwal, S Kakade, LF Yang - Conference on Learning …, 2020 - proceedings.mlr.press
This work considers the sample and computational complexity of obtaining an $\epsilon $-
optimal policy in a discounted Markov Decision Process (MDP), given only access to a …

[LIVRE][B] Scheduling

ML Pinedo - 2012 - Springer
Michael L. Pinedo Theory, Algorithms, and Systems Sixth Edition Page 1 Scheduling Michael L.
Pinedo Theory, Algorithms, and Systems Sixth Edition Page 2 Scheduling Page 3 Michael L …

[LIVRE][B] Planning algorithms

SM LaValle - 2006 - books.google.com
Planning algorithms are impacting technical disciplines and industries around the world,
including robotics, computer-aided design, manufacturing, computer graphics, aerospace …