TY - GEN
T1 - Graph Sanitation with Application to Node Classification
AU - Xu, Zhe
AU - Du, Boxin
AU - Tong, Hanghang
N1 - This work was supported in part by the National Key R&D Program of China under Grant 2020YFB1406704, the National Natural Science Foundation of China under Grant 61972442, Grant 62102413, Grant U2001202, Grant 62025604, Grant 61876128, in part by the Key Research and Development Project of Hebei Province of China under Grant 20350802D and 20310802D; in part by the Natural Science Foundation of Hebei Province of China under Grant F2020202040, in part by the Natural Science Foundation of Tianjin of China under Grant 20JCYBJC00650, and in part by the China Postdoctoral Science Foundation under Grant 2021M703472.
PY - 2022/4/25
Y1 - 2022/4/25
N2 - The past decades have witnessed the prosperity of graph mining, with a multitude of sophisticated models and algorithms designed for various mining tasks, such as ranking, classification, clustering and anomaly detection. Generally speaking, the vast majority of the existing works aim to answer the following question, that is, given a graph, what is the best way to mine it? In this paper, we introduce the graph sanitation problem, to answer an orthogonal question. That is, given a mining task and an initial graph, what is the best way to improve the initially provided graph? By learning a better graph as part of the input of the mining model, it is expected to benefit graph mining in a variety of settings, ranging from denoising, imputation to defense. We formulate the graph sanitation problem as a bilevel optimization problem, and further instantiate it by semi-supervised node classification, together with an effective solver named GaSoliNe. Extensive experimental results demonstrate that the proposed method is (1) broadly applicable with respect to various graph neural network models and flexible graph modification strategies, (2) effective in improving the node classification accuracy on both the original and contaminated graphs in various perturbation scenarios. In particular, it brings up to 25% performance improvement over the existing robust graph neural network methods.
AB - The past decades have witnessed the prosperity of graph mining, with a multitude of sophisticated models and algorithms designed for various mining tasks, such as ranking, classification, clustering and anomaly detection. Generally speaking, the vast majority of the existing works aim to answer the following question, that is, given a graph, what is the best way to mine it? In this paper, we introduce the graph sanitation problem, to answer an orthogonal question. That is, given a mining task and an initial graph, what is the best way to improve the initially provided graph? By learning a better graph as part of the input of the mining model, it is expected to benefit graph mining in a variety of settings, ranging from denoising, imputation to defense. We formulate the graph sanitation problem as a bilevel optimization problem, and further instantiate it by semi-supervised node classification, together with an effective solver named GaSoliNe. Extensive experimental results demonstrate that the proposed method is (1) broadly applicable with respect to various graph neural network models and flexible graph modification strategies, (2) effective in improving the node classification accuracy on both the original and contaminated graphs in various perturbation scenarios. In particular, it brings up to 25% performance improvement over the existing robust graph neural network methods.
KW - graph mining
KW - graph sanitation
KW - node classification
UR - http://www.scopus.com/inward/record.url?scp=85129888404&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85129888404&partnerID=8YFLogxK
U2 - 10.1145/3485447.3512180
DO - 10.1145/3485447.3512180
M3 - Conference contribution
AN - SCOPUS:85129888404
T3 - WWW 2022 - Proceedings of the ACM Web Conference 2022
SP - 1136
EP - 1147
BT - WWW 2022 - Proceedings of the ACM Web Conference 2022
PB - Association for Computing Machinery
T2 - 31st ACM World Wide Web Conference, WWW 2022
Y2 - 25 April 2022 through 29 April 2022
ER -