Lvlm-ehub: A comprehensive evaluation benchmark for large vision-language models

P Xu, W Shao, K Zhang, P Gao, S Liu… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Large Vision-Language Models (LVLMs) have recently played a dominant role in
multimodal vision-language learning. Despite the great success, it lacks a holistic evaluation …

Efficient online reinforcement learning with offline data

PJ Ball, L Smith, I Kostrikov… - … Conference on Machine …, 2023 - proceedings.mlr.press
Sample efficiency and exploration remain major challenges in online reinforcement learning
(RL). A powerful approach that can be applied to address these issues is the inclusion of …

Transformers in reinforcement learning: a survey

P Agarwal, AA Rahman, PL St-Charles… - arxiv preprint arxiv …, 2023 - arxiv.org
Transformers have significantly impacted domains like natural language processing,
computer vision, and robotics, where they improve performance compared to other neural …

Advancing 3D bioprinting through machine learning and artificial intelligence

S Ramesh, A Deep, A Tamayol, A Kamaraj, C Mahajan… - Bioprinting, 2024 - Elsevier
Abstract 3D bioprinting, a vital tool in tissue engineering, drug testing, and disease
modeling, is increasingly integrated with machine learning (ML) and artificial intelligence …

Synthetic experience replay

C Lu, P Ball, YW Teh… - Advances in Neural …, 2024 - proceedings.neurips.cc
A key theme in the past decade has been that when large neural networks and large
datasets combine they can produce remarkable results. In deep reinforcement learning (RL) …

Diffusion reward: Learning rewards via conditional video diffusion

T Huang, G Jiang, Y Ze, H Xu - European Conference on Computer Vision, 2024 - Springer
Learning rewards from expert videos offers an affordable and effective solution to specify the
intended behaviors for reinforcement learning (RL) tasks. In this work, we propose Diffusion …

Revisiting the minimalist approach to offline reinforcement learning

D Tarasov, V Kurenkov, A Nikulin… - Advances in Neural …, 2024 - proceedings.neurips.cc
Recent years have witnessed significant advancements in offline reinforcement learning
(RL), resulting in the development of numerous algorithms with varying degrees of …

Policy expansion for bridging offline-to-online reinforcement learning

H Zhang, W Xu, H Yu - arxiv preprint arxiv:2302.00935, 2023 - arxiv.org
Pre-training with offline data and online fine-tuning using reinforcement learning is a
promising strategy for learning control policies by leveraging the best of both worlds in terms …

Ignorance is bliss: Robust control via information gating

M Tomar, R Islam, M Taylor… - Advances in Neural …, 2023 - proceedings.neurips.cc
Informational parsimony provides a useful inductive bias for learning representations that
achieve better generalization by being robust to noise and spurious correlations. We …

Understanding and addressing the pitfalls of bisimulation-based representations in offline reinforcement learning

H Zang, X Li, L Zhang, Y Liu, B Sun… - Advances in …, 2024 - proceedings.neurips.cc
While bisimulation-based approaches hold promise for learning robust state representations
for Reinforcement Learning (RL) tasks, their efficacy in offline RL tasks has not been up to …