TY - JOUR
T1 - Interpolation for Robust Learning
T2 - 40th International Conference on Machine Learning, ICML 2023
AU - Zhu, Jiacheng
AU - Qiu, Jielin
AU - Guha, Aritra
AU - Yang, Zhuolin
AU - Nguyen, Xuan Long
AU - Li, Bo
AU - Zhao, Ding
N1 - This work is partially supported by the Defense Advanced Research Projects Agency (DARPA) under Contract No. HR00112320012. We would like to thank Kilian Fatras, Lin Gui, and anonymous reviewers for their valuable feedback and comments on our manuscript.
PY - 2023
Y1 - 2023
N2 - We propose to study and promote the robustness of a model as per its performance through the interpolation of training data distributions. Specifically, (1) we augment the data by finding the worst-case Wasserstein barycenter on the geodesic connecting subpopulation distributions of different categories. (2) We regularize the model for smoother performance on the continuous geodesic path connecting subpopulation distributions. (3) Additionally, we provide a theoretical guarantee of robustness improvement and investigate how the geodesic location and the sample size contribute, respectively. Experimental validations of the proposed strategy on four datasets, including CIFAR-100 and ImageNet, establish the efficacy of our method, e.g., our method improves the baselines' certifiable robustness on CIFAR10 up to 7.7%, with 16.8% on empirical robustness on CIFAR-100. Our work provides a new perspective of model robustness through the lens of Wasserstein geodesic-based interpolation with a practical off-the-shelf strategy that can be combined with existing robust training methods.
AB - We propose to study and promote the robustness of a model as per its performance through the interpolation of training data distributions. Specifically, (1) we augment the data by finding the worst-case Wasserstein barycenter on the geodesic connecting subpopulation distributions of different categories. (2) We regularize the model for smoother performance on the continuous geodesic path connecting subpopulation distributions. (3) Additionally, we provide a theoretical guarantee of robustness improvement and investigate how the geodesic location and the sample size contribute, respectively. Experimental validations of the proposed strategy on four datasets, including CIFAR-100 and ImageNet, establish the efficacy of our method, e.g., our method improves the baselines' certifiable robustness on CIFAR10 up to 7.7%, with 16.8% on empirical robustness on CIFAR-100. Our work provides a new perspective of model robustness through the lens of Wasserstein geodesic-based interpolation with a practical off-the-shelf strategy that can be combined with existing robust training methods.
UR - http://www.scopus.com/inward/record.url?scp=85174417844&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85174417844&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85174417844
SN - 2640-3498
VL - 202
SP - 43129
EP - 43157
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
Y2 - 23 July 2023 through 29 July 2023
ER -