TY - JOUR
T1 - Achieving Exact Cluster Recovery Threshold via Semidefinite Programming
T2 - Extensions
AU - Hajek, Bruce
AU - Wu, Yihong
AU - Xu, Jiaming
N1 - Funding Information:
This work was supported in part by the National Science Foundation under Grant CCF 14-09106, Grant IIS-1447879, Grant NSF IOS 13-39388, and Grant CCF 14-23088, in part by the Strategic Research Initiative on Big-Data Analytics through the College of Engineering, University of Illinois, DOD ONR Grant N000141410823, and Grant 328025 from the Simons Foundation.
Publisher Copyright:
© 2016 IEEE.
PY - 2016/10
Y1 - 2016/10
N2 - Resolving a conjecture of Abbe, Bandeira, and Hall, the authors have recently shown that the semidefinite programming (SDP) relaxation of the maximum likelihood estimator achieves the sharp threshold for exactly recovering the community structure under the binary stochastic block model (SBM) of two equal-sized clusters. The same was shown for the case of a single cluster and outliers. Extending the proof techniques, in this paper, it is shown that SDP relaxations also achieve the sharp recovery threshold in the following cases: 1) binary SBM with two clusters of sizes proportional to network size but not necessarily equal; 2) SBM with a fixed number of equal-sized clusters; and 3) binary censored block model with the background graph being Erd's-Rényi. Furthermore, a sufficient condition is given for an SDP procedure to achieve exact recovery for the general case of a fixed number of clusters plus outliers. These results demonstrate the versatility of SDP relaxation as a simple, general purpose, computationally feasible methodology for community detection.
AB - Resolving a conjecture of Abbe, Bandeira, and Hall, the authors have recently shown that the semidefinite programming (SDP) relaxation of the maximum likelihood estimator achieves the sharp threshold for exactly recovering the community structure under the binary stochastic block model (SBM) of two equal-sized clusters. The same was shown for the case of a single cluster and outliers. Extending the proof techniques, in this paper, it is shown that SDP relaxations also achieve the sharp recovery threshold in the following cases: 1) binary SBM with two clusters of sizes proportional to network size but not necessarily equal; 2) SBM with a fixed number of equal-sized clusters; and 3) binary censored block model with the background graph being Erd's-Rényi. Furthermore, a sufficient condition is given for an SDP procedure to achieve exact recovery for the general case of a fixed number of clusters plus outliers. These results demonstrate the versatility of SDP relaxation as a simple, general purpose, computationally feasible methodology for community detection.
KW - Community detection
KW - Erd's-Rényi random graph
KW - Semidefinite programming
KW - Stochastic block model
UR - http://www.scopus.com/inward/record.url?scp=84988615482&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84988615482&partnerID=8YFLogxK
U2 - 10.1109/TIT.2016.2594812
DO - 10.1109/TIT.2016.2594812
M3 - Article
AN - SCOPUS:84988615482
SN - 0018-9448
VL - 62
SP - 5918
EP - 5937
JO - IEEE Transactions on Information Theory
JF - IEEE Transactions on Information Theory
IS - 10
M1 - 7523889
ER -