BiTe-GCN: A New GCN Architecture via Bidirectional Convolution of Topology and Features on Text-Rich Networks

Di Jin, Xiangchen Song, Zhizhi Yu, Ziyang Liu, Heling Zhang, Zhaomeng Cheng, Jiawei Han

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Graph convolutional networks (GCNs), aiming to obtain node embeddings by integrating high-order neighborhood information through stacked graph convolution layers, have demonstrated great power in many network analysis tasks such as node classification and link prediction. However, a fundamental weakness of GCNs, that is, topological limitations, including over-smoothing and local homophily of topology, limits their ability to represent networks. Existing studies for solving these topological limitations typically focus only on the convolution of features on network topology, which inevitably relies heavily on network structure. Moreover, most networks are text-rich, so it is important to integrate not only document-level information, but also the local text information which is particularly significant while often ignored by the existing methods. To solve these limitations, we propose BiTe-GCN, a novel GCN architecture modeling via bidirectional convolution of topology and features on text-rich networks. Specifically, we first transform the original text-rich network into an augmented bi-typed heterogeneous network, capturing both the global document-level information and the local text-sequence information from texts. We then introduce discriminative convolution mechanisms, which performs convolution on this augmented bi-typed network, realizing the convolutions of topology and features altogether in the same system, and learning different contributions of these two parts (i.e., network part and text part), automatically for the given learning objectives. Extensive experiments on text-rich networks demonstrate that our new architecture outperforms the state-of-the-arts by a breakout improvement. Moreover, this architecture can also be applied to several e-commerce search scenes such as JD searching, and experiments on JD dataset show the superiority of the proposed architecture over the related methods.

Original languageEnglish (US)
Title of host publicationWSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery
Pages157-165
Number of pages9
ISBN (Electronic)9781450382977
DOIs
StatePublished - Aug 3 2021
Event14th ACM International Conference on Web Search and Data Mining, WSDM 2021 - Virtual, Online, Israel
Duration: Mar 8 2021Mar 12 2021

Publication series

NameWSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining

Conference

Conference14th ACM International Conference on Web Search and Data Mining, WSDM 2021
Country/TerritoryIsrael
CityVirtual, Online
Period3/8/213/12/21

Keywords

  • bidirectional convolution
  • graph convolutional networks
  • text-rich networks

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'BiTe-GCN: A New GCN Architecture via Bidirectional Convolution of Topology and Features on Text-Rich Networks'. Together they form a unique fingerprint.

Cite this