Μελετητής Google

R Degenne, WM Koolen - Advances in Neural Information …, 2019 - proceedings.neurips.cc

We determine the sample complexity of pure exploration bandit problems with multiple good
answers. We derive a lower bound using a new game equilibrium argument. We show how …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 89 Σχετικά άρθρα Όλες οι 14 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Partially observable total-cost Markov decision processes with weakly continuous transition probabilities

EA Feinberg, PO Kasyanov… - … of Operations Research, 2016 - pubsonline.informs.org

This paper describes sufficient conditions for the existence of optimal policies for partially
observable Markov decision processes (POMDPs) with Borel state, observation, and action …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 99 Σχετικά άρθρα Όλες οι 9 εκδοχές

[Free GPT-4]

[PDF] nsf.gov

On the feasibility and continuity of feedback controllers defined by multiple control barrier functions

A Isaly, M Ghanbarpour, RG Sanfelice… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Control barrier functions are a popular method for encoding safety specifications for
dynamical systems. In this paper, a notion of control barrier function is defined that permits …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 19 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Optimality conditions for inventory control

EA Feinberg - … Challenges in Complex, Networked and Risky …, 2016 - pubsonline.informs.org

This tutorial describes recently developed general optimality conditions for Markov decision
processes that have significant applications to inventory control. In particular, these …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 52 Σχετικά άρθρα Όλες οι 3 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Learning optimal antenna tilt control policies: A contextual linear bandits approach

F Vannella, A Proutiere, Y Jedra… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Controlling antenna tilts in cellular networks is critical to achieve a good trade-off between
network coverage and capacity. We devise algorithms learning optimal tilt control policies …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 14 Σχετικά άρθρα Όλες οι 5 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Continuity of discounted values and the structure of optimal policies for periodic‐review inventory systems with setup costs

EA Feinberg, DN Kraemer - Naval Research Logistics (NRL), 2023 - Wiley Online Library

This paper proves continuity of value functions in discounted periodic‐review single‐
commodity total‐cost inventory control problems with continuous inventory levels, fixed …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 7 Σχετικά άρθρα Όλες οι 4 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Convergence of probability measures and Markov decision models with incomplete information

EA Feinberg, PO Kasyanov, MZ Zgurovsky - Proceedings of the Steklov …, 2014 - Springer

This paper deals with three major types of convergence of probability measures on metric
spaces: weak convergence, setwise convergence, and convergence in total variation. First, it …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 38 Σχετικά άρθρα Όλες οι 10 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Saturated total-population dependent branching process and viral markets

K Agarwal, V Kavitha - 2022 IEEE 61st Conference on Decision …, 2022 - ieeexplore.ieee.org

Interesting posts are continually forwarded by the users of the online social network (OSN).
Such propagation leads to re-forwarding of the post to some of the previous recipients …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 8 Σχετικά άρθρα Όλες οι 3 εκδοχές

[Free GPT-4]

[PDF] nsf.gov

Structure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factors

EA Feinberg, Y Liang - Annals of Operations Research, 2022 - Springer

This paper describes the structure of optimal policies for discounted periodic-review single-
commodity total-cost inventory control problems with fixed ordering costs for finite and …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 13 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]

[PDF] wiley.com

On the convergence of optimal actions for Markov decision processes and the optimality of (s, S) inventory policies

EA Feinberg, ME Lewis - Naval Research Logistics (NRL), 2018 - Wiley Online Library

This article studies convergence properties of optimal values and actions for discounted and
average‐cost Markov decision processes (MDPs) with weakly continuous transition …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 31 Σχετικά άρθρα Όλες οι 10 εκδοχές

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Bergeʼs maximum theorem for noncompact image sets

Pure exploration with multiple correct answers

Partially observable total-cost Markov decision processes with weakly continuous transition probabilities

On the feasibility and continuity of feedback controllers defined by multiple control barrier functions

Optimality conditions for inventory control

Learning optimal antenna tilt control policies: A contextual linear bandits approach

Continuity of discounted values and the structure of optimal policies for periodic‐review inventory systems with setup costs

Convergence of probability measures and Markov decision models with incomplete information

Saturated total-population dependent branching process and viral markets

Structure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factors

On the convergence of optimal actions for Markov decision processes and the optimality of (s, S) inventory policies