Reinforcement learning algorithms: A brief survey

AK Shakya, G Pillai, S Chakrabarty - Expert Systems with Applications, 2023 - Elsevier
Reinforcement Learning (RL) is a machine learning (ML) technique to learn sequential
decision-making in complex problems. RL is inspired by trial-and-error based human/animal …

Intelligent control of multilegged robot smooth motion: a review

Y Zhao, J Wang, G Cao, Y Yuan, X Yao, L Qi - IEEE Access, 2023 - ieeexplore.ieee.org
Motion control is crucial for multilegged robot locomotion and task completion. This study
aims to address the fundamental challenges of inadequate foot tracking and weak leg …

Genie: Generative interactive environments

J Bruce, MD Dennis, A Edwards… - … on Machine Learning, 2024 - openreview.net
We introduce Genie, the first* generative interactive environment* trained in an
unsupervised manner from unlabelled Internet videos. The model can be prompted to …

Look before you leap: Unveiling the power of gpt-4v in robotic vision-language planning

Y Hu, F Lin, T Zhang, L Yi, Y Gao - arxiv preprint arxiv:2311.17842, 2023 - arxiv.org
In this study, we are interested in imbuing robots with the capability of physically-grounded
task planning. Recent advancements have shown that large language models (LLMs) …

Driving into the future: Multiview visual forecasting and planning with world model for autonomous driving

Y Wang, J He, L Fan, H Li, Y Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com
In autonomous driving predicting future events in advance and evaluating the foreseeable
risks empowers autonomous vehicles to plan their actions enhancing safety and efficiency …

Toward causal representation learning

B Schölkopf, F Locatello, S Bauer, NR Ke… - Proceedings of the …, 2021 - ieeexplore.ieee.org
The two fields of machine learning and graphical causality arose and are developed
separately. However, there is, now, cross-pollination and increasing interest in both fields to …

Predrnn: A recurrent neural network for spatiotemporal predictive learning

Y Wang, H Wu, J Zhang, Z Gao, J Wang… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
The predictive learning of spatiotemporal sequences aims to generate future images by
learning from the historical context, where the visual dynamics are believed to have modular …

Mastering atari with discrete world models

D Hafner, T Lillicrap, M Norouzi, J Ba - arxiv preprint arxiv:2010.02193, 2020 - arxiv.org
Intelligent agents need to generalize from past experience to achieve goals in complex
environments. World models facilitate such generalization and allow learning behaviors …

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arxiv preprint arxiv:2005.01643, 2020 - arxiv.org
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

Can deep learning beat numerical weather prediction?

MG Schultz, C Betancourt, B Gong… - … of the Royal …, 2021 - royalsocietypublishing.org
The recent hype about artificial intelligence has sparked renewed interest in applying the
successful deep learning (DL) methods for image recognition, speech recognition, robotics …