TY - GEN
T1 - PLUMS
T2 - SIAM International Conference on Data Mining 2015, SDM 2015
AU - Subbian, Karthik
AU - Banerjee, Arindam
AU - Basu, Sugato
N1 - Funding Information:
Acknowledgements: The research was supported in part by NSF grants IIS-1447566, IIS-1422557, CCF-1451986, CNS-1314560, IIS-0953274, IIS-1029711, and by NASA grant NNX12AQ39A, DARPA grant W911NF-12-C-0028 and IBM Ph.D. fellowship award. Arindam Banerjee also acknowledges the generous support from IBM and Yahoo. The authors thank the anonymous reviewers for their valuable comments.
Publisher Copyright:
Copyright © SIAM.
PY - 2015
Y1 - 2015
N2 - Link prediction is an important problem in online social and collaboration networks, for recommending friends and future collaborators. Most of the existing approaches for link prediction are focused on building unsupervised or supervised classification models based on the availability of accepts and rejects of the past recommendations. Several of these methods are feature-based and they construct a large number of network-level features to make the prediction more effective. A more flexible approach is to allow the model to learn the required features from the network for a specific task, rather than explicit feature engineering. In addition, most of the social and collaboration relationships do not happen instantly and rather build slowly over time through several low cost interactions, such as Email and chat. The existing approaches often ignore the availability of such auxiliary networks to make link prediction more robust and effective. The main focus of this work is to build a robust and effective classifier for link prediction using multiple auxiliary networks. We develop a supervised random walk model, that does not require any explicit feature construction, and can be personalized to each user based on the past accept and reject behavior. Our approach consistently outperforms several popular baselines in terms of precision and recall in multiple real-life data sets. Also, our approach is robust to noise and sparsity in auxiliary networks, while several popular baselines, specifically feature-based ones, are inconsistent in their performance under such conditions.
AB - Link prediction is an important problem in online social and collaboration networks, for recommending friends and future collaborators. Most of the existing approaches for link prediction are focused on building unsupervised or supervised classification models based on the availability of accepts and rejects of the past recommendations. Several of these methods are feature-based and they construct a large number of network-level features to make the prediction more effective. A more flexible approach is to allow the model to learn the required features from the network for a specific task, rather than explicit feature engineering. In addition, most of the social and collaboration relationships do not happen instantly and rather build slowly over time through several low cost interactions, such as Email and chat. The existing approaches often ignore the availability of such auxiliary networks to make link prediction more robust and effective. The main focus of this work is to build a robust and effective classifier for link prediction using multiple auxiliary networks. We develop a supervised random walk model, that does not require any explicit feature construction, and can be personalized to each user based on the past accept and reject behavior. Our approach consistently outperforms several popular baselines in terms of precision and recall in multiple real-life data sets. Also, our approach is robust to noise and sparsity in auxiliary networks, while several popular baselines, specifically feature-based ones, are inconsistent in their performance under such conditions.
UR - http://www.scopus.com/inward/record.url?scp=84961907068&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84961907068&partnerID=8YFLogxK
U2 - 10.1137/1.9781611974010.42
DO - 10.1137/1.9781611974010.42
M3 - Conference contribution
AN - SCOPUS:84961907068
T3 - SIAM International Conference on Data Mining 2015, SDM 2015
SP - 370
EP - 378
BT - SIAM International Conference on Data Mining 2015, SDM 2015
A2 - Venkatasubramanian, Suresh
A2 - Ye, Jieping
PB - Society for Industrial and Applied Mathematics Publications
Y2 - 30 April 2015 through 2 May 2015
ER -