Boosting Test Performance with Importance Sampling--a Subpopulation Perspective

H Shen, Z Zhao - arxiv preprint arxiv:2412.13003, 2024‏ - arxiv.org
Despite empirical risk minimization (ERM) is widely applied in the machine learning
community, its performance is limited on data with spurious correlation or subpopulation that …

Optimizing importance weighting in the presence of sub-population shifts

F Holstege, B Wouters, N van Giersbergen… - arxiv preprint arxiv …, 2024‏ - arxiv.org
A distribution shift between the training and test data can severely harm performance of
machine learning models. Importance weighting addresses this issue by assigning different …