- Academic Search

T Osa - The International Journal of Robotics Research, 2020 - journals.sagepub.com

Existing motion planning methods often have two drawbacks:(1) goal configurations need to
be specified by a user, and (2) only a single solution is generated under a given condition. In …

Enregistrer Citer Cité 72 fois Autres articles Les 5 versions Free GPT-4

Hierarchical reinforcement learning with adaptive scheduling for robot control

Z Huang, Q Liu, F Zhu - Engineering Applications of Artificial Intelligence, 2023 - Elsevier

Conventional hierarchical reinforcement learning (HRL) relies on discrete options to
represent explicitly distinguishable knowledge, which may lead to severe performance …

Enregistrer Citer Cité 5 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Hierarchical reinforcement learning for quadruped locomotion

D Jain, A Iscen, K Caluwaerts - 2019 IEEE/RSJ International …, 2019 - ieeexplore.ieee.org

Legged locomotion is a challenging task for learning algorithms, especially when the task
requires a diverse set of primitive behaviors. To solve these problems, we introduce a …

Enregistrer Citer Cité 56 fois Autres articles Les 6 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Reparameterized policy learning for multimodal trajectory optimization

Z Huang, L Liang, Z Ling, X Li… - … on Machine Learning, 2023 - proceedings.mlr.press

We investigate the challenge of parametrizing policies for reinforcement learning (RL) in
high-dimensional continuous action spaces. Our objective is to develop a multimodal policy …

Enregistrer Citer Cité 9 fois Autres articles Les 8 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] sciencedirect.com

Discovering diverse solutions in deep reinforcement learning by maximizing state–action-based mutual information

T Osa, V Tangkaratt, M Sugiyama - Neural Networks, 2022 - Elsevier

Reinforcement learning algorithms are typically limited to learning a single solution for a
specified task, even though diverse solutions often exist. Recent studies showed that …

Enregistrer Citer Cité 28 fois Autres articles Les 8 versions Free GPT-4

[Free GPT-4]

[PDF] neurips.cc

Learning compositional neural programs with recursive tree search and planning

T Pierrot, G Ligner, SE Reed… - Advances in …, 2019 - proceedings.neurips.cc

We propose a novel reinforcement learning algorithm, AlphaNPI, that incorpo-rates the
strengths of Neural Programmer-Interpreters (NPI) and AlphaZero. NPI contributes structural …

Enregistrer Citer Cité 47 fois Autres articles Les 9 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Motion planning by learning the solution manifold in trajectory optimization

T Osa - The International Journal of Robotics Research, 2022 - journals.sagepub.com

The objective function used in trajectory optimization is often non-convex and can have an
infinite set of local optima. In such cases, there are diverse solutions to perform a given task …

Enregistrer Citer Cité 22 fois Autres articles Les 5 versions Free GPT-4

Spatial memory-augmented visual navigation based on hierarchical deep reinforcement learning in unknown environments

S **, X Wang, Q Meng - Knowledge-Based Systems, 2024 - Elsevier

Visual navigation in unknown environments poses significant challenges due to the
presence of many obstacles and low-texture scenes. These factors may cause frequent …

Enregistrer Citer Cité 20 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Multipolar: Multi-source policy aggregation for transfer reinforcement learning between diverse environmental dynamics

M Barekatain, R Yonetani, M Hamaya - arxiv preprint arxiv:1909.13111, 2019 - arxiv.org

Transfer reinforcement learning (RL) aims at improving the learning efficiency of an agent by
exploiting knowledge from other source agents trained on relevant tasks. However, it …

Enregistrer Citer Cité 32 fois Autres articles Les 8 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Reinforcement learning from hierarchical critics

Z Cao, CT Lin - IEEE Transactions on Neural Networks and …, 2021 - ieeexplore.ieee.org

In this study, we investigate the use of global information to speed up the learning process
and increase the cumulative rewards of reinforcement learning (RL) in competition tasks …

Enregistrer Citer Cité 28 fois Autres articles Les 9 versions Free GPT-4

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Hierarchical reinforcement learning via advantage-weighted information maximization

Multimodal trajectory optimization for motion planning

Hierarchical reinforcement learning with adaptive scheduling for robot control

Hierarchical reinforcement learning for quadruped locomotion

Reparameterized policy learning for multimodal trajectory optimization

Discovering diverse solutions in deep reinforcement learning by maximizing state–action-based mutual information

Learning compositional neural programs with recursive tree search and planning

Motion planning by learning the solution manifold in trajectory optimization

Spatial memory-augmented visual navigation based on hierarchical deep reinforcement learning in unknown environments

Multipolar: Multi-source policy aggregation for transfer reinforcement learning between diverse environmental dynamics

Reinforcement learning from hierarchical critics