TY - JOUR
T1 - DICON
T2 - Interactive visual analysis of multidimensional clusters
AU - Cao, Nan
AU - Gotz, David
AU - Sun, Jimeng
AU - Qu, Huamin
N1 - Funding Information:
We thank all the user study participants and doctors for their contributions to the system evaluation, Dr. Tim Dwyer for his help on the node overlap removing and the anonymous reviewers for their valuable comments. This work was supported in part by grant HK RGC GRF 619309 and an IBM Faculty Award.
PY - 2011/11/16
Y1 - 2011/11/16
N2 - Clustering as a fundamental data analysis technique has been widely used in many analytic applications. However, it is often difficult for users to understand and evaluate multidimensional clustering results, especially the quality of clusters and their semantics. For large and complex data, high-level statistical information about the clusters is often needed for users to evaluate cluster quality while a detailed display of multidimensional attributes of the data is necessary to understand the meaning of clusters. In this paper, we introduce DICON, an icon-based cluster visualization that embeds statistical information into a multi-attribute display to facilitate cluster interpretation, evaluation, and comparison. We design a treemap-like icon to represent a multidimensional cluster, and the quality of the cluster can be conveniently evaluated with the embedded statistical information. We further develop a novel layout algorithm which can generate similar icons for similar clusters, making comparisons of clusters easier. User interaction and clutter reduction are integrated into the system to help users more effectively analyze and refine clustering results for large datasets. We demonstrate the power of DICON through a user study and a case study in the healthcare domain. Our evaluation shows the benefits of the technique, especially in support of complex multidimensional cluster analysis.
AB - Clustering as a fundamental data analysis technique has been widely used in many analytic applications. However, it is often difficult for users to understand and evaluate multidimensional clustering results, especially the quality of clusters and their semantics. For large and complex data, high-level statistical information about the clusters is often needed for users to evaluate cluster quality while a detailed display of multidimensional attributes of the data is necessary to understand the meaning of clusters. In this paper, we introduce DICON, an icon-based cluster visualization that embeds statistical information into a multi-attribute display to facilitate cluster interpretation, evaluation, and comparison. We design a treemap-like icon to represent a multidimensional cluster, and the quality of the cluster can be conveniently evaluated with the embedded statistical information. We further develop a novel layout algorithm which can generate similar icons for similar clusters, making comparisons of clusters easier. User interaction and clutter reduction are integrated into the system to help users more effectively analyze and refine clustering results for large datasets. We demonstrate the power of DICON through a user study and a case study in the healthcare domain. Our evaluation shows the benefits of the technique, especially in support of complex multidimensional cluster analysis.
KW - Clustering
KW - Information Visualization
KW - Visual Analysis
UR - http://www.scopus.com/inward/record.url?scp=80955157762&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80955157762&partnerID=8YFLogxK
U2 - 10.1109/TVCG.2011.188
DO - 10.1109/TVCG.2011.188
M3 - Article
C2 - 22034380
AN - SCOPUS:80955157762
SN - 1077-2626
VL - 17
SP - 2581
EP - 2590
JO - IEEE Transactions on Visualization and Computer Graphics
JF - IEEE Transactions on Visualization and Computer Graphics
IS - 12
M1 - 6065026
ER -