TY - JOUR
T1 - Model Agnostic Sample Reweighting for Out-of-Distribution Learning
AU - Zhou, Xiao
AU - Lin, Yong
AU - Pi, Renjie
AU - Zhang, Weizhong
AU - Xu, Renzhe
AU - Cui, Peng
AU - Zhang, Tong
N1 - XZ, YL, RP, WZ and TZ acknowledge the funding supported by GRF 16201320. RX and PC acknowledge the funding supported by National Key R&D Program of China (No. 2018AAA0102004), National Natural Science Foundation of China (No. 62141607, U1936219).
PY - 2022
Y1 - 2022
N2 - Distributionally robust optimization (DRO) and invariant risk minimization (IRM) are two popular methods proposed to improve out-of-distribution (OOD) generalization performance of machine learning models. While effective for small models, it has been observed that these methods can be vulnerable to overfitting with large overparameterized models. This work proposes a principled method, Model Agnostic samPLe rEweighting (MAPLE), to effectively address OOD problem, especially in overparameterized scenarios. Our key idea is to find an effective reweighting of the training samples so that the standard empirical risk minimization training of a large model on the weighted training data leads to superior OOD generalization performance. The overfitting issue is addressed by considering a bilevel formulation to search for the sample reweighting, in which the generalization complexity depends on the search space of sample weights instead of the model size. We present theoretical analysis in linear case to prove the insensitivity of MAPLE to model size, and empirically verify its superiority in surpassing state-of-the-art methods by a large margin. Code is available at https://github.com/x-zho14/MAPLE.
AB - Distributionally robust optimization (DRO) and invariant risk minimization (IRM) are two popular methods proposed to improve out-of-distribution (OOD) generalization performance of machine learning models. While effective for small models, it has been observed that these methods can be vulnerable to overfitting with large overparameterized models. This work proposes a principled method, Model Agnostic samPLe rEweighting (MAPLE), to effectively address OOD problem, especially in overparameterized scenarios. Our key idea is to find an effective reweighting of the training samples so that the standard empirical risk minimization training of a large model on the weighted training data leads to superior OOD generalization performance. The overfitting issue is addressed by considering a bilevel formulation to search for the sample reweighting, in which the generalization complexity depends on the search space of sample weights instead of the model size. We present theoretical analysis in linear case to prove the insensitivity of MAPLE to model size, and empirically verify its superiority in surpassing state-of-the-art methods by a large margin. Code is available at https://github.com/x-zho14/MAPLE.
UR - http://www.scopus.com/inward/record.url?scp=85163105461&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85163105461&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85163105461
SN - 2640-3498
VL - 162
SP - 27203
EP - 27221
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 39th International Conference on Machine Learning, ICML 2022
Y2 - 17 July 2022 through 23 July 2022
ER -