TY - GEN
T1 - User modeling in search logs via a nonparametric Bayesian approach
AU - Wang, Hongning
AU - Zhai, Cheng Xiang
AU - Liang, Feng
AU - Dong, Anlei
AU - Chang, Yi
PY - 2014
Y1 - 2014
N2 - Searchers' information needs are diverse and cover a broad range of topics; hence, it is important for search engines to accurately understand each individual user's search intents in order to provide optimal search results. Search log data, which records users' search behaviors when interacting with search engines, provides a valuable source of information about users' search intents. Therefore, properly characterizing the heterogeneity among the users' observed search behaviors is the key to accurately understanding their search intents and to further predicting their behaviors. In this work, we study the problem of user modeling in the search log data and propose a generative model, dpRank, within a non-parametric Bayesian framework. By postulating generative assumptions about a user's search behaviors, dpRank identifies each individual user's latent search interests and his/her distinct result preferences in a joint manner. Experimental results on a large-scale news search log data set validate the effectiveness of the proposed approach, which not only provides in-depth understanding of a user's search intents but also benefits a variety of personalized applications.
AB - Searchers' information needs are diverse and cover a broad range of topics; hence, it is important for search engines to accurately understand each individual user's search intents in order to provide optimal search results. Search log data, which records users' search behaviors when interacting with search engines, provides a valuable source of information about users' search intents. Therefore, properly characterizing the heterogeneity among the users' observed search behaviors is the key to accurately understanding their search intents and to further predicting their behaviors. In this work, we study the problem of user modeling in the search log data and propose a generative model, dpRank, within a non-parametric Bayesian framework. By postulating generative assumptions about a user's search behaviors, dpRank identifies each individual user's latent search interests and his/her distinct result preferences in a joint manner. Experimental results on a large-scale news search log data set validate the effectiveness of the proposed approach, which not only provides in-depth understanding of a user's search intents but also benefits a variety of personalized applications.
KW - non-parametric bayesian
KW - search log mining
KW - user modeling
UR - http://www.scopus.com/inward/record.url?scp=84906847450&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84906847450&partnerID=8YFLogxK
U2 - 10.1145/2556195.2556262
DO - 10.1145/2556195.2556262
M3 - Conference contribution
AN - SCOPUS:84906847450
SN - 9781450323512
T3 - WSDM 2014 - Proceedings of the 7th ACM International Conference on Web Search and Data Mining
SP - 203
EP - 212
BT - WSDM 2014 - Proceedings of the 7th ACM International Conference on Web Search and Data Mining
PB - Association for Computing Machinery
T2 - 7th ACM International Conference on Web Search and Data Mining, WSDM 2014
Y2 - 24 February 2014 through 28 February 2014
ER -