Domain generalization: A survey

K Zhou, Z Liu, Y Qiao, T **ang… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Generalization to out-of-distribution (OOD) data is a capability natural to humans yet
challenging for machines to reproduce. This is because most learning algorithms strongly …

A survey on semantic communications for intelligent wireless networks

S Iyer, R Khanai, D Torse, RJ Pandya… - Wireless Personal …, 2023 - Springer
Research on intelligent wireless network aims at the development of a human society which
is ubiquitous and mobile, simultaneously providing solutions to the coverage, capacity, and …

The dormant neuron phenomenon in deep reinforcement learning

G Sokar, R Agarwal, PS Castro… - … Conference on Machine …, 2023 - proceedings.mlr.press
In this work we identify the dormant neuron phenomenon in deep reinforcement learning,
where an agent's network suffers from an increasing number of inactive neurons, thereby …

A taxonomy and review of generalization research in NLP

D Hupkes, M Giulianelli, V Dankers, M Artetxe… - Nature Machine …, 2023 - nature.com
The ability to generalize well is one of the primary desiderata for models of natural language
processing (NLP), but what 'good generalization'entails and how it should be evaluated is …

A survey on safety-critical driving scenario generation—a methodological perspective

W Ding, C Xu, M Arief, H Lin, B Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Autonomous driving systems have witnessed significant development during the past years
thanks to the advance in machine learning-enabled sensing and decision-making …

Evolving curricula with regret-based environment design

J Parker-Holder, M Jiang, M Dennis… - International …, 2022 - proceedings.mlr.press
Training generally-capable agents with reinforcement learning (RL) remains a significant
challenge. A promising avenue for improving the robustness of RL agents is through the use …

Human-timescale adaptation in an open-ended task space

AA Team, J Bauer, K Baumli, S Baveja… - arxiv preprint arxiv …, 2023 - arxiv.org
Foundation models have shown impressive adaptation and scalability in supervised and self-
supervised learning problems, but so far these successes have not fully translated to …

Human-timescale adaptation in an open-ended task space

J Bauer, K Baumli, F Behbahani… - International …, 2023 - proceedings.mlr.press
Foundation models have shown impressive adaptation and scalability in supervised and self-
supervised learning problems, but so far these successes have not fully translated to …

Recurrent model-free rl can be a strong baseline for many pomdps

T Ni, B Eysenbach, R Salakhutdinov - arxiv preprint arxiv:2110.05038, 2021 - arxiv.org
Many problems in RL, such as meta-RL, robust RL, generalization in RL, and temporal credit
assignment, can be cast as POMDPs. In theory, simply augmenting model-free RL with …

Goal misgeneralization in deep reinforcement learning

LL Di Langosco, J Koch, LD Sharkey… - International …, 2022 - proceedings.mlr.press
We study goal misgeneralization, a type of out-of-distribution robustness failure in
reinforcement learning (RL). Goal misgeneralization occurs when an RL agent retains its …