- Academic Search

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Save Cite Cited by 297 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] kcl.ac.uk

A review of safe reinforcement learning: Methods, theories and applications

S Gu, L Yang, Y Du, G Chen, F Walter… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Save Cite Cited by 13 Related articles All 8 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Multi-task learning as a bargaining game

A Navon, A Shamsian, I Achituve, H Maron… - arxiv preprint arxiv …, 2022 - arxiv.org

In Multi-task learning (MTL), a joint model is trained to simultaneously make predictions for
several tasks. Joint training reduces computation costs and improves data efficiency; …

Save Cite Cited by 137 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

On the almost sure convergence of stochastic gradient descent in non-convex problems

P Mertikopoulos, N Hallak, A Kavis… - Advances in Neural …, 2020 - proceedings.neurips.cc

In this paper, we analyze the trajectories of stochastic gradient descent (SGD) with the aim of
understanding their convergence properties in non-convex problems. We first show that the …

Save Cite Cited by 113 Related articles All 19 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

AdaGrad avoids saddle points

K Antonakopoulos, P Mertikopoulos… - International …, 2022 - proceedings.mlr.press

Adaptive first-order methods in optimization have widespread ML applications due to their
ability to adapt to non-convex landscapes. However, their convergence guarantees are …

Save Cite Cited by 21 Related articles All 13 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aps.org

Gradient-descent quantum process tomography by learning Kraus operators

S Ahmed, F Quijandría, AF Kockum - Physical Review Letters, 2023 - APS

We perform quantum process tomography (QPT) for both discrete-and continuous-variable
quantum systems by learning a process representation using Kraus operators. The Kraus …

Save Cite Cited by 32 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] neurips.cc

Riemannian stochastic optimization methods avoid strict saddle points

YP Hsieh, MR Karimi Jaghargh… - Advances in …, 2024 - proceedings.neurips.cc

Many modern machine learning applications-from online principal component analysis to
covariance matrix identification and dictionary learning-can be formulated as minimization …

Save Cite Cited by 6 Related articles All 15 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Robust reinforcement learning via adversarial training with langevin dynamics

P Kamalaruban, YT Huang, YP Hsieh… - Advances in …, 2020 - proceedings.neurips.cc

We introduce a\emph {sampling} perspective to tackle the challenging task of training robust
Reinforcement Learning (RL) agents. Leveraging the powerful Stochastic Gradient Langevin …

Save Cite Cited by 70 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aaai.org

Evaluating model-free reinforcement learning toward safety-critical tasks

L Zhang, Q Zhang, L Shen, B Yuan, X Wang… - Proceedings of the AAAI …, 2023 - ojs.aaai.org

Safety comes first in many real-world applications involving autonomous agents. Despite a
large number of reinforcement learning (RL) methods focusing on safety-critical tasks, there …

Save Cite Cited by 29 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Mathematical introduction to deep learning: methods, implementations, and theory

A Jentzen, B Kuckuck, P von Wurstemberger - arxiv preprint arxiv …, 2023 - arxiv.org

This book aims to provide an introduction to the topic of deep learning algorithms. We review
essential components of deep learning algorithms in full mathematical detail including …

Save Cite Cited by 24 Related articles All 3 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

First-order methods almost always avoid saddle points: The case of vanishing step-sizes

A review of safe reinforcement learning: Methods, theory and applications

A review of safe reinforcement learning: Methods, theories and applications

Multi-task learning as a bargaining game

On the almost sure convergence of stochastic gradient descent in non-convex problems

AdaGrad avoids saddle points

Gradient-descent quantum process tomography by learning Kraus operators

Riemannian stochastic optimization methods avoid strict saddle points

Robust reinforcement learning via adversarial training with langevin dynamics

Evaluating model-free reinforcement learning toward safety-critical tasks

Mathematical introduction to deep learning: methods, implementations, and theory