Large Subgraph Matching: A Comprehensive and Efficient Approach for Heterogeneous Graphs

Hongtai Cao, Qihao Wang, Xiaodong Li, Matin Najafi, Kevin Chen Chuan Chang, Reynold Cheng

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The subgraph matching problem is crucial in graph analysis, involving identifying all instances of a given pattern P within a graph G. Advances in this field aim to uncover larger patterns across diverse graph types and subgraph matching tasks. However, existing methods often prove inefficient for such tasks. To address this gap, we propose CSCE, which generates efficient plans for various problem settings. CSCE utilizes clustered compressed sparse rows for heterogeneous graphs and sequential candidate equivalence to reduce redundant computations. Moreover, our approach seamlessly supports different subgraph matching variants, such as edge-induced, vertex-induced, and homomorphic scenarios. Experiments show that our work is up to two orders of magnitude faster than the state of the art on graphs of millions scale.

Original languageEnglish (US)
Title of host publicationProceedings - 2024 IEEE 40th International Conference on Data Engineering, ICDE 2024
PublisherIEEE Computer Society
Pages2972-2985
Number of pages14
ISBN (Electronic)9798350317152
DOIs
StatePublished - 2024
Event40th IEEE International Conference on Data Engineering, ICDE 2024 - Utrecht, Netherlands
Duration: May 13 2024May 17 2024

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627
ISSN (Electronic)2375-0286

Conference

Conference40th IEEE International Conference on Data Engineering, ICDE 2024
Country/TerritoryNetherlands
CityUtrecht
Period5/13/245/17/24

Keywords

  • clustering
  • edge induce
  • equivalence
  • heterogeneous graphs
  • homomorphism
  • subgraph matching
  • vertex induce

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Information Systems

Fingerprint

Dive into the research topics of 'Large Subgraph Matching: A Comprehensive and Efficient Approach for Heterogeneous Graphs'. Together they form a unique fingerprint.

Cite this