TY - GEN
T1 - Toward Learning Human-aligned Cross-domain Robust Models by Countering Misaligned Features
AU - Wang, Haohan
AU - Huang, Zeyi
AU - Zhang, Hanlin
AU - Lee, Yong Jae
AU - Xing, Eric P.
N1 - This work was supported in part by NIH R01GM114311, NIH P30DA035778, NSF IIS1617583, NSF CAREER IIS-2150012 and IIS-2204808.
PY - 2022
Y1 - 2022
N2 - Machine learning has demonstrated remarkable prediction accuracy over i.i.d data, but the accuracy often drops when tested with data from another distribution. In this paper, we aim to offer another view of this problem in a perspective assuming the reason behind this accuracy drop is the reliance of models on the features that are not aligned well with how a data annotator considers similar across these two datasets. We refer to these features as misaligned features. We extend the conventional generalization error bound to a new one for this setup with the knowledge of how the misaligned features are associated with the label. Our analysis offers a set of techniques for this problem, and these techniques are naturally linked to many previous methods in robust machine learning literature. We also compared the empirical strength of these methods demonstrated the performance when these previous techniques are combined, with implementation available here.
AB - Machine learning has demonstrated remarkable prediction accuracy over i.i.d data, but the accuracy often drops when tested with data from another distribution. In this paper, we aim to offer another view of this problem in a perspective assuming the reason behind this accuracy drop is the reliance of models on the features that are not aligned well with how a data annotator considers similar across these two datasets. We refer to these features as misaligned features. We extend the conventional generalization error bound to a new one for this setup with the knowledge of how the misaligned features are associated with the label. Our analysis offers a set of techniques for this problem, and these techniques are naturally linked to many previous methods in robust machine learning literature. We also compared the empirical strength of these methods demonstrated the performance when these previous techniques are combined, with implementation available here.
UR - http://www.scopus.com/inward/record.url?scp=85146150490&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85146150490&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85146150490
T3 - Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence, UAI 2022
SP - 2075
EP - 2084
BT - Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence, UAI 2022
PB - Association For Uncertainty in Artificial Intelligence (AUAI)
T2 - 38th Conference on Uncertainty in Artificial Intelligence, UAI 2022
Y2 - 1 August 2022 through 5 August 2022
ER -