- Academic Search

Continuous control with deep reinforcement learning. CoRR abs/1509.02971 (2015)

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Classic meets modern: A pragmatic learning-based congestion control for the internet

S Abbasloo, CY Yen, HJ Chao - … of the Annual conference of the ACM …, 2020 - dl.acm.org

These days, taking the revolutionary approach of using clean-slate learning-based designs
to completely replace the classic congestion control schemes for the Internet is gaining …

บันทึก อ้างอิง อ้างโดย247 บทความที่เกี่ยวข้อง ทั้งหมด 3 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Coarse-to-fine q-attention: Efficient learning for visual robotic manipulation via discretisation

S James, K Wada, T Laidlow… - Proceedings of the …, 2022 - openaccess.thecvf.com

We present a coarse-to-fine discretisation method that enables the use of discrete
reinforcement learning approaches in place of unstable and data-inefficient actor-critic …

บันทึก อ้างอิง อ้างโดย130 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Auto: Scaling deep reinforcement learning for datacenter-scale automatic traffic optimization

L Chen, J Lingys, K Chen, F Liu - Proceedings of the 2018 conference of …, 2018 - dl.acm.org

Traffic optimizations (TO, eg flow scheduling, load balancing) in datacenters are difficult
online decision-making problems. Previously, they are done with heuristics relying on …

บันทึก อ้างอิง อ้างโดย334 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

On the effectiveness of fine-tuning versus meta-reinforcement learning

M Zhao, P Abbeel, S James - Advances in neural …, 2022 - proceedings.neurips.cc

Intelligent agents should have the ability to leverage knowledge from previously learned
tasks in order to learn new ones quickly and efficiently. Meta-learning approaches have …

บันทึก อ้างอิง อ้างโดย55 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] archive.org

Learning basketball dribbling skills using trajectory optimization and deep reinforcement learning

L Liu, J Hodgins - Acm transactions on graphics (tog), 2018 - dl.acm.org

Basketball is one of the world's most popular sports because of the agility and speed
demonstrated by the players. This agility and speed makes designing controllers to realize …

บันทึก อ้างอิง อ้างโดย194 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] pku.edu.cn

Learning to schedule control fragments for physics-based characters using deep q-learning

L Liu, J Hodgins - ACM Transactions on Graphics (TOG), 2017 - dl.acm.org

Given a robust control system, physical simulation offers the potential for interactive human
characters that move in realistic and responsive ways. In this article, we describe how to …

บันทึก อ้างอิง อ้างโดย178 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Sustaingym: Reinforcement learning environments for sustainable energy systems

C Yeh, V Li, R Datta, J Arroyo… - Advances in …, 2023 - proceedings.neurips.cc

The lack of standardized benchmarks for reinforcement learning (RL) in sustainability
applications has made it difficult to both track progress on specific domains and identify …

บันทึก อ้างอิง อ้างโดย17 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Physics-based motion capture imitation with deep reinforcement learning

N Chentanez, M Müller, M Macklin… - Proceedings of the 11th …, 2018 - dl.acm.org

We introduce a deep reinforcement learning method that learns to control articulated
humanoid bodies to imitate given target motions closely when simulated in a physics …

บันทึก อ้างอิง อ้างโดย105 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Urban driving with multi-objective deep reinforcement learning

C Li, K Czarnecki - arxiv preprint arxiv:1811.08586, 2018 - arxiv.org

Autonomous driving is a challenging domain that entails multiple aspects: a vehicle should
be able to drive to its destination as fast as possible while avoiding collision, obeying traffic …

บันทึก อ้างอิง อ้างโดย102 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning hierarchical teaching policies for cooperative agents

DK Kim, M Liu, S Omidshafiei, S Lopez-Cot… - arxiv preprint arxiv …, 2019 - arxiv.org

Collective learning can be greatly enhanced when agents effectively exchange knowledge
with their peers. In particular, recent work studying agents that learn to teach other …

บันทึก อ้างอิง อ้างโดย39 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ ดูในรูปแบบ HTML

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

Continuous control with deep reinforcement learning. CoRR abs/1509.02971 (2015)

Classic meets modern: A pragmatic learning-based congestion control for the internet

Coarse-to-fine q-attention: Efficient learning for visual robotic manipulation via discretisation

Auto: Scaling deep reinforcement learning for datacenter-scale automatic traffic optimization

On the effectiveness of fine-tuning versus meta-reinforcement learning

Learning basketball dribbling skills using trajectory optimization and deep reinforcement learning

Learning to schedule control fragments for physics-based characters using deep q-learning

Sustaingym: Reinforcement learning environments for sustainable energy systems

Physics-based motion capture imitation with deep reinforcement learning

Urban driving with multi-objective deep reinforcement learning

Learning hierarchical teaching policies for cooperative agents