Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, so do risks from misalignment. To provide a comprehensive …

Reconstructing computational system dynamics from neural data with recurrent neural networks

D Durstewitz, G Koppe, MI Thurm - Nature Reviews Neuroscience, 2023 - nature.com
Computational models in neuroscience usually take the form of systems of differential
equations. The behaviour of such systems is the subject of dynamical systems theory …

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Deep long-tailed learning: A survey

Y Zhang, B Kang, B Hooi, S Yan… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
Deep long-tailed learning, one of the most challenging problems in visual recognition, aims
to train well-performing deep models from a large number of images that follow a long-tailed …

Sharpness-aware gradient matching for domain generalization

P Wang, Z Zhang, Z Lei… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The goal of domain generalization (DG) is to enhance the generalization capability of the
model learned from a source domain to other unseen domains. The recently developed …

Last layer re-training is sufficient for robustness to spurious correlations

P Kirichenko, P Izmailov, AG Wilson - arxiv preprint arxiv:2204.02937, 2022 - arxiv.org
Neural network classifiers can largely rely on simple spurious features, such as
backgrounds, to make predictions. However, even in these cases, we show that they still …

Towards out-of-distribution generalization: A survey

J Liu, Z Shen, Y He, X Zhang, R Xu, H Yu… - arxiv preprint arxiv …, 2021 - arxiv.org
Traditional machine learning paradigms are based on the assumption that both training and
test data follow the same statistical pattern, which is mathematically referred to as …

Federated learning for generalization, robustness, fairness: A survey and benchmark

W Huang, M Ye, Z Shi, G Wan, H Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Federated learning has emerged as a promising paradigm for privacy-preserving
collaboration among different parties. Recently, with the popularity of federated learning, an …

Federated domain generalization with generalization adjustment

R Zhang, Q Xu, J Yao, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Federated Domain Generalization (FedDG) attempts to learn a global model in a
privacy-preserving manner that generalizes well to new clients possibly with domain shift …

Domain generalization: A survey

K Zhou, Z Liu, Y Qiao, T **ang… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Generalization to out-of-distribution (OOD) data is a capability natural to humans yet
challenging for machines to reproduce. This is because most learning algorithms strongly …