TY - GEN
T1 - Research-insight
T2 - 2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013
AU - Tao, Fangbo
AU - Yu, Xiao
AU - Lei, Kin Hou
AU - Brova, George
AU - Cheng, Xiao
AU - Han, Jiawei
AU - Kanade, Rucha
AU - Sun, Yizhou
AU - Wang, Chi
AU - Wang, Lidan
AU - Weninger, Tim
PY - 2013
Y1 - 2013
N2 - A database contains rich, inter-related, multi-typed data and information, forming one or a set of gigantic, interconnected, heterogeneous information networks. Much knowledge can be derived from such information networks if we systematically develop an effective and scalable database-oriented information network analysis technology. In this system demo, we take a computer science research publication network as an example, which is an information network derived from an integration of DBLP, other web-based information about researchers, and partially available citation data, and construct a Research-Insight system in order to demonstrate the power of database-oriented information network analysis. We show that nontrivial research insight can be obtained from such analysis, including (1) ranking, clustering, classification and similarity search of researchers, terms and venues for research subfields and themes, (2) re Commending good researchers and good research papers to read or cite when conducting research on certain topics, (3) predicting potential collaborators for certain theme-oriented research, and (4) predicting advisor-advisee relationships and affiliation history based on historical research publications. Although some of these functions have been studied in recent research, effective and scalable realization of such functions in large networks still poses challenging research problems. Moreover, some function are our ongoing research tasks. By integrating these functionalities, Research-Insight may not only provide with us insightful re Commendations in CS research but also help us gain insight on how to perform effective data mining in large databases.
AB - A database contains rich, inter-related, multi-typed data and information, forming one or a set of gigantic, interconnected, heterogeneous information networks. Much knowledge can be derived from such information networks if we systematically develop an effective and scalable database-oriented information network analysis technology. In this system demo, we take a computer science research publication network as an example, which is an information network derived from an integration of DBLP, other web-based information about researchers, and partially available citation data, and construct a Research-Insight system in order to demonstrate the power of database-oriented information network analysis. We show that nontrivial research insight can be obtained from such analysis, including (1) ranking, clustering, classification and similarity search of researchers, terms and venues for research subfields and themes, (2) re Commending good researchers and good research papers to read or cite when conducting research on certain topics, (3) predicting potential collaborators for certain theme-oriented research, and (4) predicting advisor-advisee relationships and affiliation history based on historical research publications. Although some of these functions have been studied in recent research, effective and scalable realization of such functions in large networks still poses challenging research problems. Moreover, some function are our ongoing research tasks. By integrating these functionalities, Research-Insight may not only provide with us insightful re Commendations in CS research but also help us gain insight on how to perform effective data mining in large databases.
KW - Heterogeneous information network
KW - Re Commendation system
UR - http://www.scopus.com/inward/record.url?scp=84880556025&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84880556025&partnerID=8YFLogxK
U2 - 10.1145/2463676.2463689
DO - 10.1145/2463676.2463689
M3 - Conference contribution
AN - SCOPUS:84880556025
SN - 9781450320375
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 1093
EP - 1096
BT - SIGMOD 2013 - International Conference on Management of Data
Y2 - 22 June 2013 through 27 June 2013
ER -