A comprehensive survey: Evaluating the efficiency of artificial intelligence and machine learning techniques on cyber security solutions

M Ozkan-Okay, E Akin, Ö Aslan, S Kosunalp… - IEEe …, 2024 - ieeexplore.ieee.org
Given the continually rising frequency of cyberattacks, the adoption of artificial intelligence
methods, particularly Machine Learning (ML), Deep Learning (DL), and Reinforcement …

Planning with diffusion for flexible behavior synthesis

M Janner, Y Du, JB Tenenbaum, S Levine - arxiv preprint arxiv …, 2022 - arxiv.org
Model-based reinforcement learning methods often use learning only for the purpose of
estimating an approximate dynamics model, offloading the rest of the decision-making work …

Contrastive learning as goal-conditioned reinforcement learning

B Eysenbach, T Zhang, S Levine… - Advances in Neural …, 2022 - proceedings.neurips.cc
In reinforcement learning (RL), it is easier to solve a task if given a good representation.
While deep RL should automatically acquire such good representations, prior work often …

Masked world models for visual control

Y Seo, D Hafner, H Liu, F Liu, S James… - … on Robot Learning, 2023 - proceedings.mlr.press
Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient
robot learning from visual observations. Yet the current approaches typically train a single …

Reinforcement learning with action-free pre-training from videos

Y Seo, K Lee, SL James… - … Conference on Machine …, 2022 - proceedings.mlr.press
Recent unsupervised pre-training methods have shown to be effective on language and
vision domains by learning useful representations for multiple downstream tasks. In this …

Latent plans for task-agnostic offline reinforcement learning

E Rosete-Beas, O Mees, G Kalweit… - … on Robot Learning, 2023 - proceedings.mlr.press
Everyday tasks of long-horizon and comprising a sequence of multiple implicit subtasks still
impose a major challenge in offline robot control. While a number of prior methods aimed to …

Repo: Resilient model-based reinforcement learning by regularizing posterior predictability

C Zhu, M Simchowitz, S Gadipudi… - Advances in Neural …, 2023 - proceedings.neurips.cc
Visual model-based RL methods typically encode image observations into low-dimensional
representations in a manner that does not eliminate redundant information. This leaves them …

Visuo-tactile transformers for manipulation

Y Chen, A Sipos, M Van der Merwe, N Fazeli - arxiv preprint arxiv …, 2022 - arxiv.org
Learning representations in the joint domain of vision and touch can improve manipulation
dexterity, robustness, and sample-complexity by exploiting mutual information and …

Off-policy evaluation for human feedback

Q Gao, G Gao, J Dong, V Tarokh… - Advances in Neural …, 2023 - proceedings.neurips.cc
Off-policy evaluation (OPE) is important for closing the gap between offline training and
evaluation of reinforcement learning (RL), by estimating performance and/or rank of target …

Planning goals for exploration

ES Hu, R Chang, O Rybkin, D Jayaraman - arxiv preprint arxiv …, 2023 - arxiv.org
Dropped into an unknown environment, what should an agent do to quickly learn about the
environment and how to accomplish diverse tasks within it? We address this question within …