Your Causal Self-Attentive Recommender Hosts a Lonely Neighborhood

Yueqi Wang, Zhankui He, Zhenrui Yue, Julian McAuley, Dong Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In the context of sequential recommendation, a pivotal issue pertains to the comparative analysis between bi-directional/auto-encoding (AE) and uni-directional/auto-regressive (AR) attention mechanisms, where the conclusions regarding architectural and performance superiority remain inconclusive. Previous efforts in such comparisons primarily involve summarizing existing works to identify a consensus or conducting ablation studies on peripheral modeling techniques, such as choices of loss functions. However, far fewer efforts have been made in (1) theoretical and (2) extensive empirical analysis of the self-attention module, the very pivotal structure on which performance and designing insights should be anchored. In this work, we first provide a comprehensive theoretical analysis of AE/AR attention matrix in the aspect of (1) sparse local inductive bias, a.k.a neighborhood effects, and (2) low rank approximation. Analytical metrics reveal that the AR attention exhibits sparse neighborhood effects suitable for generally sparse recommendation scenarios. Secondly, to support our theoretical analysis, we conduct extensive empirical experiments on comparing AE/AR attention on five popular benchmarks with AR performing better overall. Empirical results reported are based on our experimental pipeline named Modularized Design Space for Self-Attentive Recommender (ModSAR), supporting adaptive hyperparameter tuning, modularized design space and Huggingface plug-ins. We invite the recommendation community to utilize/contribute to ModSAR to (1) conduct more module/model-level examining beyond AE/AR comparison and (2) accelerate state-of-the-art model design. Lastly, we shed light on future design choices for performant self-attentive recommenders. We make our pipeline implementation and data available at https://github.com/yueqirex/SAR-Check.

Original languageEnglish (US)
Title of host publicationWSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery
Pages688-696
Number of pages9
ISBN (Electronic)9798400713293
DOIs
StatePublished - Mar 10 2025
Event18th ACM International Conference on Web Search and Data Mining, WSDM 2025 - Hannover, Germany
Duration: Mar 10 2025Mar 14 2025

Publication series

NameWSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining

Conference

Conference18th ACM International Conference on Web Search and Data Mining, WSDM 2025
Country/TerritoryGermany
CityHannover
Period3/10/253/14/25

Keywords

  • Auto-Encoding
  • Auto-Regression
  • BERT4Rec
  • Matrix Analysis
  • SASRec
  • Self-Attention
  • Sequential Recommendation

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'Your Causal Self-Attentive Recommender Hosts a Lonely Neighborhood'. Together they form a unique fingerprint.

Cite this