TY - GEN
T1 - Construction and analysis of web-based computer science information networks
AU - Han, Jiawei
N1 - Funding Information:
Acknowledgements. The work was supported in part by the U.S. National Science Foundation grants IIS-09-05215, the Network Science Collaborative Technology Alliance Program (NS-CTA) of U.S. Army Research Lab (ARL) under
Funding Information:
contract number W911NF-09-2-0053, and the Air Force Office of Scientific Research MURI award FA9550-08-1-0265. The author would like to express his sincere thanks to all the WINACS project group and the Ph.D. students in the Data Mining Group of CS, UIUC for their dedication and contribution.
PY - 2011
Y1 - 2011
N2 - With the rapid development of the Web, huge amounts of information are available on the Web in the form of Web documents, structures, and links. It has been a dream of the database and Web communities to harvest information exhibited on the Web and reconcile the unstructured nature of the Web with the semi-structured schemas of the database paradigm. This is a challenging task. Even though databases are currently used to generate Web content in some sites, the schemas of these databases are rarely consistent across a domain. However, with the recent research in Web structure mining and information network analysis, major progress has been made at discovering Web hidden structures, constructing heterogeneous information networks by integration of information from structured databases and Web contents, and performing in-depth analysis for systematic harvesting of such rich information on the Web.
AB - With the rapid development of the Web, huge amounts of information are available on the Web in the form of Web documents, structures, and links. It has been a dream of the database and Web communities to harvest information exhibited on the Web and reconcile the unstructured nature of the Web with the semi-structured schemas of the database paradigm. This is a challenging task. Even though databases are currently used to generate Web content in some sites, the schemas of these databases are rarely consistent across a domain. However, with the recent research in Web structure mining and information network analysis, major progress has been made at discovering Web hidden structures, constructing heterogeneous information networks by integration of information from structured databases and Web contents, and performing in-depth analysis for systematic harvesting of such rich information on the Web.
UR - http://www.scopus.com/inward/record.url?scp=79960291608&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79960291608&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-21881-1_1
DO - 10.1007/978-3-642-21881-1_1
M3 - Conference contribution
AN - SCOPUS:79960291608
SN - 9783642218804
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 1
EP - 2
BT - Rough Sets, Fuzzy Sets, Data Mining and Granular Computing - 13th International Conference, RSFDGrC 2011, Proceedings
T2 - 13th International Conference on Rough Sets, Fuzzy Sets and Granular Computing, RSFDGrC 2011
Y2 - 25 June 2011 through 27 June 2011
ER -