TY - GEN
T1 - Linear Recurrent Units for Sequential Recommendation
AU - Yue, Zhenrui
AU - Wang, Yueqi
AU - He, Zhankui
AU - Zeng, Huimin
AU - McAuley, Julian
AU - Wang, Dong
N1 - Publisher Copyright:
© 2024 Owner/Author.
PY - 2024/3/4
Y1 - 2024/3/4
N2 - State-of-The-Art sequential recommendation relies heavily on self-Attention-based recommender models. Yet such models are computationally expensive and often too slow for real-Time recommendation. Furthermore, the self-Attention operation is performed at a sequence-level, thereby making low-cost incremental inference challenging. Inspired by recent advances in efficient language modeling, we propose linear recurrent units for sequential recommendation (LRURec). Similar to recurrent neural networks, LRURec offers rapid inference and can achieve incremental inference on sequential inputs. By decomposing the linear recurrence operation and designing recursive parallelization in our framework, LRURec provides the additional benefits of reduced model size and parallelizable training. Moreover, we optimize the architecture of LRURec by implementing a series of modifications to address the lack of non-linearity and improve training dynamics. To validate the effectiveness of our proposed LRURec, we conduct extensive experiments on multiple real-world datasets and compare its performance against state-of-The-Art sequential recommenders. Experimental results demonstrate the effectiveness of LRURec, which consistently outperforms baselines by a significant margin. Results also highlight the efficiency of LRURec with our parallelized training paradigm and fast inference on long sequences, showing its potential to further enhance user experience in sequential recommendation.
AB - State-of-The-Art sequential recommendation relies heavily on self-Attention-based recommender models. Yet such models are computationally expensive and often too slow for real-Time recommendation. Furthermore, the self-Attention operation is performed at a sequence-level, thereby making low-cost incremental inference challenging. Inspired by recent advances in efficient language modeling, we propose linear recurrent units for sequential recommendation (LRURec). Similar to recurrent neural networks, LRURec offers rapid inference and can achieve incremental inference on sequential inputs. By decomposing the linear recurrence operation and designing recursive parallelization in our framework, LRURec provides the additional benefits of reduced model size and parallelizable training. Moreover, we optimize the architecture of LRURec by implementing a series of modifications to address the lack of non-linearity and improve training dynamics. To validate the effectiveness of our proposed LRURec, we conduct extensive experiments on multiple real-world datasets and compare its performance against state-of-The-Art sequential recommenders. Experimental results demonstrate the effectiveness of LRURec, which consistently outperforms baselines by a significant margin. Results also highlight the efficiency of LRURec with our parallelized training paradigm and fast inference on long sequences, showing its potential to further enhance user experience in sequential recommendation.
KW - recommender systems
KW - sequential recommendation
UR - http://www.scopus.com/inward/record.url?scp=85191721224&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85191721224&partnerID=8YFLogxK
U2 - 10.1145/3616855.3635760
DO - 10.1145/3616855.3635760
M3 - Conference contribution
AN - SCOPUS:85191721224
T3 - WSDM 2024 - Proceedings of the 17th ACM International Conference on Web Search and Data Mining
SP - 930
EP - 938
BT - WSDM 2024 - Proceedings of the 17th ACM International Conference on Web Search and Data Mining
PB - Association for Computing Machinery
T2 - 17th ACM International Conference on Web Search and Data Mining, WSDM 2024
Y2 - 4 March 2024 through 8 March 2024
ER -