Recurrent independent mechanisms

A Goyal, A Lamb, J Hoffmann, S Sodhani… - arxiv preprint arxiv …, 2019 - arxiv.org
Learning modular structures which reflect the dynamics of the environment can lead to better
generalization and robustness to changes which only affect a few of the underlying causes …

Graph information bottleneck for subgraph recognition

J Yu, T Xu, Y Rong, Y Bian, J Huang, R He - arxiv preprint arxiv …, 2020 - arxiv.org
Given the input graph and its label/property, several key problems of graph learning, such as
finding interpretable subgraphs, graph denoising and graph compression, can be attributed …

Significance-aware information bottleneck for domain adaptive semantic segmentation

Y Luo, P Liu, T Guan, J Yu… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
For unsupervised domain adaptation problems, the strategy of aligning the two domains in
latent feature space through adversarial learning has achieved much progress in image …

Multimodal information bottleneck: Learning minimal sufficient unimodal and multimodal representations

S Mai, Y Zeng, H Hu - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org
Learning effective joint embedding for cross-modal data has always been a focus in the field
of multimodal machine learning. We argue that during multimodal fusion, the generated …

Generalization in reinforcement learning with selective noise injection and information bottleneck

M Igl, K Ciosek, Y Li, S Tschiatschek… - Advances in neural …, 2019 - proceedings.neurips.cc
The ability for policies to generalize to new environments is key to the broad application of
RL agents. A promising approach to prevent an agent's policy from overfitting to a limited set …

Causalworld: A robotic manipulation benchmark for causal structure and transfer learning

O Ahmed, F Träuble, A Goyal, A Neitz, Y Bengio… - arxiv preprint arxiv …, 2020 - arxiv.org
Despite recent successes of reinforcement learning (RL), it remains a challenge for agents
to transfer learned skills to related environments. To facilitate research addressing this …

Noveld: A simple yet effective exploration criterion

T Zhang, H Xu, X Wang, Y Wu… - Advances in …, 2021 - proceedings.neurips.cc
Efficient exploration under sparse rewards remains a key challenge in deep reinforcement
learning. Previous exploration methods (eg, RND) have achieved strong results in multiple …

Efficient knowledge distillation from model checkpoints

C Wang, Q Yang, R Huang, S Song… - Advances in Neural …, 2022 - proceedings.neurips.cc
Abstract Knowledge distillation is an effective approach to learn compact models (students)
with the supervision of large and strong models (teachers). As empirically there exists a …

Learning nearly decomposable value functions via communication minimization

T Wang, J Wang, C Zheng, C Zhang - arxiv preprint arxiv:1910.05366, 2019 - arxiv.org
Reinforcement learning encounters major challenges in multi-agent settings, such as
scalability and non-stationarity. Recently, value function factorization learning emerges as a …

Improving subgraph recognition with variational graph information bottleneck

J Yu, J Cao, R He - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Subgraph recognition aims at discovering a compressed substructure of a graph that is most
informative to the graph property. It can be formulated by optimizing Graph Information …