- Academic Search

J Degrave, F Felici, J Buchli, M Neunert, B Tracey… - Nature, 2022‏ - nature.com‏

Nuclear fusion using magnetic confinement, in particular in the tokamak configuration, is a
promising path towards sustainable energy. A core challenge is to shape and maintain a …‏

שמור צטט צוטט על ידי 939 מאמרים בנושא זה כל 13 הגרסאות

Outracing champion Gran Turismo drivers with deep reinforcement learning‏

PR Wurman, S Barrett, K Kawamoto, J MacGlashan… - Nature, 2022‏ - nature.com‏

Many potential applications of artificial intelligence involve making real-time decisions in
physical systems while interacting with humans. Automobile racing represents an extreme …‏

שמור צטט צוטט על ידי 484 מאמרים בנושא זה כל 9 הגרסאות

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Acme: A research framework for distributed reinforcement learning‏

MW Hoffman, B Shahriari, J Aslanides… - arxiv preprint arxiv …, 2020‏ - arxiv.org‏

Deep reinforcement learning (RL) has led to many recent and groundbreaking advances.
However, these advances have often come at the cost of both increased scale in the …‏

שמור צטט צוטט על ידי 273 מאמרים בנושא זה כל 2 הגרסאות פתיחה בתור HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Beyond supervised continual learning: a review‏

B Bagus, A Gepperth, T Lesort - arxiv preprint arxiv:2208.14307, 2022‏ - arxiv.org‏

Continual Learning (CL, sometimes also termed incremental learning) is a flavor of machine
learning where the usual assumption of stationary data distribution is relaxed or omitted …‏

שמור צטט צוטט על ידי 10 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] mlr.press

Open source vizier: Distributed infrastructure and api for reliable and flexible blackbox optimization‏

X Song, S Perel, C Lee, G Kochanski… - International …, 2022‏ - proceedings.mlr.press‏

Vizier is the de-facto blackbox optimization service across Google, having optimized some of
Google's largest products and research efforts. To operate at the scale of tuning thousands …‏

שמור צטט צוטט על ידי 34 מאמרים בנושא זה כל 5 הגרסאות פתיחה בתור HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] biorxiv.org

Whole-body simulation of realistic fruit fly locomotion with deep reinforcement learning‏

R Vaxenburg, I Siwanowicz, J Merel, AA Robie… - bioRxiv, 2024‏ - biorxiv.org‏

The body of an animal influences how the nervous system produces behavior. Therefore,
detailed modeling of the neural control of sensorimotor behavior requires a detailed model …‏

שמור צטט צוטט על ידי 12 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] neurips.cc

Active offline policy selection‏

K Konyushova, Y Chen, T Paine… - Advances in …, 2021‏ - proceedings.neurips.cc‏

This paper addresses the problem of policy selection in domains with abundant logged data,
but with a restricted interaction budget. Solving this problem would enable safe evaluation …‏

שמור צטט צוטט על ידי 28 מאמרים בנושא זה כל 7 הגרסאות פתיחה בתור HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Phantom--A RL-driven multi-agent framework to model complex systems‏

L Ardon, J Vann, D Garg, T Spooner… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

Agent based modelling (ABM) is a computational approach to modelling complex systems
by specifying the behaviour of autonomous decision-making components or agents in the …‏

שמור צטט צוטט על ידי 10 מאמרים בנושא זה כל 5 הגרסאות פתיחה בתור HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] mdpi.com

Increasing the safety of adaptive cruise control using physics-guided reinforcement learning‏

SL Jurj, D Grundt, T Werner, P Borchers, K Rothemann… - Energies, 2021‏ - mdpi.com‏

This paper presents a novel approach for improving the safety of vehicles equipped with
Adaptive Cruise Control (ACC) by making use of Machine Learning (ML) and physical …‏

שמור צטט צוטט על ידי 17 מאמרים בנושא זה כל 6 הגרסאות במטמון

[免费ChatGPT] [DeepSeek可用网址] [PDF] mlr.press

GEAR: a GPU-centric experience replay system for large reinforcement learning models‏

H Wang, MK Sit, C He, Y Wen… - International …, 2023‏ - proceedings.mlr.press‏

This paper introduces a distributed, GPU-centric experience replay system, GEAR, designed
to perform scalable reinforcement learning (RL) with large sequence models (such as …‏

שמור צטט צוטט על ידי 1 מאמרים בנושא זה כל 10 הגרסאות פתיחה בתור HTML

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Reverb: a framework for experience replay

Magnetic control of tokamak plasmas through deep reinforcement learning‏

Outracing champion Gran Turismo drivers with deep reinforcement learning‏

Acme: A research framework for distributed reinforcement learning‏

Beyond supervised continual learning: a review‏

Open source vizier: Distributed infrastructure and api for reliable and flexible blackbox optimization‏

Whole-body simulation of realistic fruit fly locomotion with deep reinforcement learning‏

Active offline policy selection‏

Phantom--A RL-driven multi-agent framework to model complex systems‏

Increasing the safety of adaptive cruise control using physics-guided reinforcement learning‏

GEAR: a GPU-centric experience replay system for large reinforcement learning models‏