TY - JOUR
T1 - Multidimensional analysis of atypical events in cyber-physical data
AU - Tang, Lu An
AU - Yu, Xiao
AU - Kim, Sangkyum
AU - Han, Jiawei
AU - Peng, Wen Chih
AU - Sun, Yizhou
AU - Gonzalez, Hector
AU - Seith, Sebastian
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2012
Y1 - 2012
N2 - A Cyber-Physical System (CPS) integrates physical devices (e.g., sensors, cameras) with cyber (or informational) components to form a situation-integrated analytical system that may respond intelligently to dynamic changes of the real-world situations. CPS claims many promising applications, such as traffic observation, battlefield surveillance and sensor-network based monitoring. One important research topic in CPS is about the atypical event analysis, i.e., retrieving the events from large amount of data and analyzing them with spatial, temporal and other multi-dimensional information. Many traditional approaches are not feasible for such analysis since they use numeric measures and cannot describe the complex atypical events. In this study, we propose a new model of atypical cluster to effectively represent those events and efficiently retrieve them from massive data. The micro-cluster is designed to summarize individual events, and the macro-cluster is used to integrate the information from multiple event. To facilitate scalable, flexible and online analysis, the concept of significant cluster is defined and a guided clustering algorithm is proposed to retrieve significant clusters in an efficient manner. We conduct experiments on real datasets with the size of more than 50 GB, the results show that the proposed method can provide more accurate information with only 15% to 20% time cost of the baselines.
AB - A Cyber-Physical System (CPS) integrates physical devices (e.g., sensors, cameras) with cyber (or informational) components to form a situation-integrated analytical system that may respond intelligently to dynamic changes of the real-world situations. CPS claims many promising applications, such as traffic observation, battlefield surveillance and sensor-network based monitoring. One important research topic in CPS is about the atypical event analysis, i.e., retrieving the events from large amount of data and analyzing them with spatial, temporal and other multi-dimensional information. Many traditional approaches are not feasible for such analysis since they use numeric measures and cannot describe the complex atypical events. In this study, we propose a new model of atypical cluster to effectively represent those events and efficiently retrieve them from massive data. The micro-cluster is designed to summarize individual events, and the macro-cluster is used to integrate the information from multiple event. To facilitate scalable, flexible and online analysis, the concept of significant cluster is defined and a guided clustering algorithm is proposed to retrieve significant clusters in an efficient manner. We conduct experiments on real datasets with the size of more than 50 GB, the results show that the proposed method can provide more accurate information with only 15% to 20% time cost of the baselines.
UR - http://www.scopus.com/inward/record.url?scp=84864248652&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84864248652&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2012.32
DO - 10.1109/ICDE.2012.32
M3 - Conference article
AN - SCOPUS:84864248652
SN - 1084-4627
SP - 1025
EP - 1036
JO - Proceedings - International Conference on Data Engineering
JF - Proceedings - International Conference on Data Engineering
M1 - 6228153
T2 - IEEE 28th International Conference on Data Engineering, ICDE 2012
Y2 - 1 April 2012 through 5 April 2012
ER -