TY - GEN
T1 - Easing embedding learning by comprehensive transcription of heterogeneous information networks
AU - Shi, Yu
AU - Zhu, Qi
AU - Guo, Fang
AU - Zhang, Chao
AU - Han, Jiawei
N1 - Publisher Copyright:
© 2018 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2018/7/19
Y1 - 2018/7/19
N2 - Heterogeneous information networks (HINs) are ubiquitous in real-world applications. In the meantime, network embedding has emerged as a convenient tool to mine and learn from networked data. As a result, it is of interest to develop HIN embedding methods. However, the heterogeneity in HINs introduces not only rich information but also potentially incompatible semantics, which poses special challenges to embedding learning in HINs. With the intention to preserve the rich yet potentially incompatible information in HIN embedding, we propose to study the problem of comprehensive transcription of heterogeneous information networks. e comprehensive transcription of HINs also provides an easy-to-use approach to unleash the power of HINs, since it requires no additional supervision, expertise, or feature engineering. To cope with the challenges in the comprehensive transcription of HINs, we propose the HEER algorithm, which embeds HINs via edge representations that are further coupled with properly-learned heterogeneous metrics. To corroborate the ecacy of HEER, we conducted experiments on two large-scale real-words datasets with an edge reconstruction task and multiple case studies. Experiment results demonstrate the eectiveness of the proposed HEER model and the utility of edge representations and heterogeneous metrics. e code and data are available at hps://github.com/GentleZhu/HEER.
AB - Heterogeneous information networks (HINs) are ubiquitous in real-world applications. In the meantime, network embedding has emerged as a convenient tool to mine and learn from networked data. As a result, it is of interest to develop HIN embedding methods. However, the heterogeneity in HINs introduces not only rich information but also potentially incompatible semantics, which poses special challenges to embedding learning in HINs. With the intention to preserve the rich yet potentially incompatible information in HIN embedding, we propose to study the problem of comprehensive transcription of heterogeneous information networks. e comprehensive transcription of HINs also provides an easy-to-use approach to unleash the power of HINs, since it requires no additional supervision, expertise, or feature engineering. To cope with the challenges in the comprehensive transcription of HINs, we propose the HEER algorithm, which embeds HINs via edge representations that are further coupled with properly-learned heterogeneous metrics. To corroborate the ecacy of HEER, we conducted experiments on two large-scale real-words datasets with an edge reconstruction task and multiple case studies. Experiment results demonstrate the eectiveness of the proposed HEER model and the utility of edge representations and heterogeneous metrics. e code and data are available at hps://github.com/GentleZhu/HEER.
KW - Graph mining
KW - Heterogeneous information networks
KW - Network embedding
KW - Representation learning
UR - http://www.scopus.com/inward/record.url?scp=85051466590&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85051466590&partnerID=8YFLogxK
U2 - 10.1145/3219819.3220006
DO - 10.1145/3219819.3220006
M3 - Conference contribution
AN - SCOPUS:85051466590
SN - 9781450355520
T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
SP - 2190
EP - 2199
BT - KDD 2018 - Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
PB - Association for Computing Machinery
T2 - 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2018
Y2 - 19 August 2018 through 23 August 2018
ER -