TY - JOUR
T1 - Graph Mixup on Approximate Gromov-Wasserstein Geodesics
AU - Zeng, Zhichen
AU - Qiu, Ruizhong
AU - Xu, Zhe
AU - Liu, Zhining
AU - Yan, Yuchen
AU - Wei, Tianxin
AU - Ying, Lei
AU - He, Jingrui
AU - Tong, Hanghang
N1 - This work is supported by NSF (2134079, 2316233, 2324770, and 2117902), DARPA (HR001121C0165), NIFA (2020-67021-32799), DHS (17STQAC00001-07-00), AFOSR (FA9550-24-1-0002), the C3.ai Digital Transformation Institute, and IBM-Illinois Discovery Accelerator Institute.The content of the information in this document does not necessarily reflect the position or the policy of the Government, and no official endorsement should be inferred.The U.S.Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation here on.
PY - 2024
Y1 - 2024
N2 - Mixup, which generates synthetic training samples on the data manifold, has been shown to be highly effective in augmenting Euclidean data.However, finding a proper data manifold for graph data is non-trivial, as graphs are non-Euclidean data in disparate spaces.Though efforts have been made, most of the existing graph mixup methods neglect the intrinsic geodesic guarantee, thereby generating inconsistent sample-label pairs.To address this issue, we propose GEOMIX to mixup graphs on the Gromov-Wasserstein (GW) geodesics.A joint space over input graphs is first defined based on the GW distance, and graphs are then transformed into the GW space through equivalence-preserving transformations.We further show that the linear interpolation of the transformed graph pairs defines a geodesic connecting the original pairs on the GW manifold, hence ensuring the consistency between generated samples and labels.An accelerated mixup algorithm on the approximate low-dimensional GW manifold is further proposed.Extensive experiments show that the proposed GEOMIX promotes the generalization and robustness of GNN models.
AB - Mixup, which generates synthetic training samples on the data manifold, has been shown to be highly effective in augmenting Euclidean data.However, finding a proper data manifold for graph data is non-trivial, as graphs are non-Euclidean data in disparate spaces.Though efforts have been made, most of the existing graph mixup methods neglect the intrinsic geodesic guarantee, thereby generating inconsistent sample-label pairs.To address this issue, we propose GEOMIX to mixup graphs on the Gromov-Wasserstein (GW) geodesics.A joint space over input graphs is first defined based on the GW distance, and graphs are then transformed into the GW space through equivalence-preserving transformations.We further show that the linear interpolation of the transformed graph pairs defines a geodesic connecting the original pairs on the GW manifold, hence ensuring the consistency between generated samples and labels.An accelerated mixup algorithm on the approximate low-dimensional GW manifold is further proposed.Extensive experiments show that the proposed GEOMIX promotes the generalization and robustness of GNN models.
UR - http://www.scopus.com/inward/record.url?scp=85203817377&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85203817377&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85203817377
SN - 2640-3498
VL - 235
SP - 58387
EP - 58406
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 41st International Conference on Machine Learning, ICML 2024
Y2 - 21 July 2024 through 27 July 2024
ER -