TY - JOUR
T1 - Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich Platform for Graph Learning Benchmarks
AU - Ma, Jiaqi
AU - Zhang, Xingjian
AU - Fan, Hezheng
AU - Huang, Jin
AU - Li, Tianyue
AU - Li, Ting Wei
AU - Tu, Yiwen
AU - Zhu, Chenshu
AU - Mei, Qiaozhu
N1 - The authors would like to thank Danai Koutra, Anton Tsitsulin, ChengXiang Zhai, and Jiong Zhu for helpful discussions, all the participants in the GLB 2021 and GLB 2022 workshops for motivating this project, and anonymous reviewers at LOG 2022 for constructive suggestions. This work was in part supported by the National Science Foundation under grant number 1633370.
PY - 2022/9/1
Y1 - 2022/9/1
N2 - Establishing open and general benchmarks has been a critical driving force behind the success of modern machine learning techniques. As machine learning is being applied to broader domains and tasks, there is a need to establish richer and more diverse benchmarks to better reflect the reality of the application scenarios. Graph learning is an emerging field of machine learning that urgently needs more and better benchmarks. To accommodate the need, we introduce Graph Learning Indexer (GLI), a benchmark curation platform for graph learning. In comparison to existing graph learning benchmark libraries, GLI highlights two novel design objectives. First, GLI is designed to incentivize dataset contributors. In particular, we incorporate various measures to minimize the effort of contributing and maintaining a dataset, increase the usability of the contributed dataset, as well as encourage attributions to different contributors of the dataset. Second, GLI is designed to curate a knowledge base, instead of a plain collection, of benchmark datasets. We use multiple sources of meta information to augment the benchmark datasets with rich characteristics, so that they can be easily selected and used in downstream research or development. The source code of GLI is available at https://github.com/Graph-Learning-Benchmarks/gli.
AB - Establishing open and general benchmarks has been a critical driving force behind the success of modern machine learning techniques. As machine learning is being applied to broader domains and tasks, there is a need to establish richer and more diverse benchmarks to better reflect the reality of the application scenarios. Graph learning is an emerging field of machine learning that urgently needs more and better benchmarks. To accommodate the need, we introduce Graph Learning Indexer (GLI), a benchmark curation platform for graph learning. In comparison to existing graph learning benchmark libraries, GLI highlights two novel design objectives. First, GLI is designed to incentivize dataset contributors. In particular, we incorporate various measures to minimize the effort of contributing and maintaining a dataset, increase the usability of the contributed dataset, as well as encourage attributions to different contributors of the dataset. Second, GLI is designed to curate a knowledge base, instead of a plain collection, of benchmark datasets. We use multiple sources of meta information to augment the benchmark datasets with rich characteristics, so that they can be easily selected and used in downstream research or development. The source code of GLI is available at https://github.com/Graph-Learning-Benchmarks/gli.
UR - http://www.scopus.com/inward/record.url?scp=85164536800&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85164536800&partnerID=8YFLogxK
M3 - Conference article
SN - 2640-3498
VL - 198
SP - 7:1-7:23
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 1st Learning on Graphs Conference, LOG 2022
Y2 - 9 December 2022 through 12 December 2022
ER -