Google Tudós

B Hambly, R Xu, H Yang - Mathematical Finance, 2023 - Wiley Online Library

The rapid changes in the finance industry due to the increasing amount of data have
revolutionized the techniques on data processing and data analysis and brought new …

Mentés Hivatkozás Idézetek száma: 219 Kapcsolódó cikkek Mind a(z) 14 változat

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Advances of machine learning in materials science: Ideas and techniques

SS Chong, YS Ng, HQ Wang, JC Zheng - Frontiers of Physics, 2024 - Springer

In this big data era, the use of large dataset in conjunction with machine learning (ML) has
been increasingly popular in both industry and academia. In recent times, the field of …

Mentés Hivatkozás Idézetek száma: 29 Kapcsolódó cikkek Mind a(z) 6 változat

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Efficient and targeted COVID-19 border testing via reinforcement learning

H Bastani, K Drakopoulos, V Gupta, I Vlachogiannis… - Nature, 2021 - nature.com

Throughout the coronavirus disease 2019 (COVID-19) pandemic, countries have relied on a
variety of ad hoc border control protocols to allow for non-essential travel while safeguarding …

Mentés Hivatkozás Idézetek száma: 130 Kapcsolódó cikkek Mind a(z) 12 változat

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Federated linear contextual bandits

R Huang, W Wu, J Yang… - Advances in neural …, 2021 - proceedings.neurips.cc

This paper presents a novel federated linear contextual bandits model, where individual
clients face different $ K $-armed stochastic bandits coupled through common global …

Mentés Hivatkozás Idézetek száma: 87 Kapcsolódó cikkek Mind a(z) 10 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Feedback efficient online fine-tuning of diffusion models

M Uehara, Y Zhao, K Black, E Hajiramezanali… - arxiv preprint arxiv …, 2024 - arxiv.org

Diffusion models excel at modeling complex data distributions, including those of images,
proteins, and small molecules. However, in many cases, our goal is to model parts of the …

Mentés Hivatkozás Idézetek száma: 24 Kapcsolódó cikkek Mind a(z) 7 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bypassing the monster: A faster and simpler optimal algorithm for contextual bandits under realizability

D Simchi-Levi, Y Xu - Mathematics of Operations Research, 2022 - pubsonline.informs.org

We consider the general (stochastic) contextual bandit problem under the realizability
assumption, that is, the expected reward, as a function of contexts and actions, belongs to a …

Mentés Hivatkozás Idézetek száma: 140 Kapcsolódó cikkek Mind a(z) 10 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The sample complexity of online contract design

B Zhu, S Bates, Z Yang, Y Wang, J Jiao… - arxiv preprint arxiv …, 2022 - arxiv.org

We study the hidden-action principal-agent problem in an online setting. In each round, the
principal posts a contract that specifies the payment to the agent based on each outcome …

Mentés Hivatkozás Idézetek száma: 57 Kapcsolódó cikkek Mind a(z) 7 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Multi-armed bandit experimental design: Online decision-making and adaptive inference

D Simchi-Levi, C Wang - International Conference on …, 2023 - proceedings.mlr.press

Multi-armed bandit has been well-known for its efficiency in online decision-making in terms
of minimizing the loss of the participants' welfare during experiments (ie, the regret). In …

Mentés Hivatkozás Idézetek száma: 38 Kapcsolódó cikkek Mind a(z) 3 változat Könyvtári keresés HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Provably efficient q-learning with low switching cost

Y Bai, T **e, N Jiang, YX Wang - Advances in Neural …, 2019 - proceedings.neurips.cc

We take initial steps in studying PAC-MDP algorithms with limited adaptivity, that is,
algorithms that change its exploration policy as infrequently as possible during regret …

Mentés Hivatkozás Idézetek száma: 120 Kapcsolódó cikkek Mind a(z) 10 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Inference for batched bandits

K Zhang, L Janson, S Murphy - Advances in neural …, 2020 - proceedings.neurips.cc

As bandit algorithms are increasingly utilized in scientific studies and industrial applications,
there is an associated increasing need for reliable inference methods based on the resulting …

Mentés Hivatkozás Idézetek száma: 111 Kapcsolódó cikkek Mind a(z) 10 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Batched multi-armed bandits problem

Recent advances in reinforcement learning in finance

Advances of machine learning in materials science: Ideas and techniques

Efficient and targeted COVID-19 border testing via reinforcement learning

Federated linear contextual bandits

Feedback efficient online fine-tuning of diffusion models

Bypassing the monster: A faster and simpler optimal algorithm for contextual bandits under realizability

The sample complexity of online contract design

Multi-armed bandit experimental design: Online decision-making and adaptive inference

Provably efficient q-learning with low switching cost

Inference for batched bandits