Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, so do risks from misalignment. To provide a comprehensive …

Frameworks and results in distributionally robust optimization

H Rahimian, S Mehrotra - Open Journal of Mathematical Optimization, 2022 - numdam.org
The concepts of risk aversion, chance-constrained optimization, and robust optimization
have developed significantly over the last decade. The statistical learning community has …

Towards out-of-distribution generalization: A survey

J Liu, Z Shen, Y He, X Zhang, R Xu, H Yu… - arxiv preprint arxiv …, 2021 - arxiv.org
Traditional machine learning paradigms are based on the assumption that both training and
test data follow the same statistical pattern, which is mathematically referred to as …

Environment inference for invariant learning

E Creager, JH Jacobsen… - … Conference on Machine …, 2021 - proceedings.mlr.press
Learning models that gracefully handle distribution shifts is central to research on domain
generalization, robust optimization, and fairness. A promising formulation is domain …

Fairness without demographics in repeated loss minimization

T Hashimoto, M Srivastava… - International …, 2018 - proceedings.mlr.press
Abstract Machine learning models (eg, speech recognizers) trained on average loss suffer
from representation disparity—minority groups (eg, non-native speakers) carry less weight in …

Certifying some distributional robustness with principled adversarial training

A Sinha, H Namkoong, R Volpi, J Duchi - arxiv preprint arxiv:1710.10571, 2017 - arxiv.org
Neural networks are vulnerable to adversarial examples and researchers have proposed
many heuristic attack and defense mechanisms. We address this problem through the …

Learning models with uniform performance via distributionally robust optimization

JC Duchi, H Namkoong - The Annals of Statistics, 2021 - projecteuclid.org
Learning models with uniform performance via distributionally robust optimization Page 1 The
Annals of Statistics 2021, Vol. 49, No. 3, 1378–1406 https://doi.org/10.1214/20-AOS2004 © …

Measure and improve robustness in NLP models: A survey

X Wang, H Wang, D Yang - arxiv preprint arxiv:2112.08313, 2021 - arxiv.org
As NLP models achieved state-of-the-art performances over benchmarks and gained wide
applications, it has been increasingly important to ensure the safe deployment of these …

Invariant risk minimization games

K Ahuja, K Shanmugam, K Varshney… - International …, 2020 - proceedings.mlr.press
The standard risk minimization paradigm of machine learning is brittle when operating in
environments whose test distributions are different from the training distribution due to …

Variance-based regularization with convex objectives

J Duchi, H Namkoong - Journal of Machine Learning Research, 2019 - jmlr.org
We develop an approach to risk minimization and stochastic optimization that provides a
convex surrogate for variance, allowing near-optimal and computationally efficient trading …