TY - GEN
T1 - A spectral framework for detecting inconsistency across multi-source object relationships
AU - Gao, Jing
AU - Fan, Wei
AU - Turaga, Deepak
AU - Parthasarathy, Srinivasan
AU - Han, Jiawei
PY - 2011
Y1 - 2011
N2 - In this paper, we propose to conduct anomaly detection across multiple sources to identify objects that have inconsistent behavior across these sources.We assume that a set of objects can be described from various perspectives (multiple information sources). The underlying clustering structure of normal objects is usually shared by multiple sources. However, anomalous objects belong to different clusters when considering different aspects. For example, there exist movies that are expected to be liked by kids by genre, but are liked by grown-ups based on user viewing history. To identify such objects, we propose to compute the distance between different eigen decomposition results of the same object with respect to different sources as its anomalous score. We also give interpretations from the perspectives of constrained spectral clustering and random walks over graph. Experimental results on several UCI as well as DBLP and MovieLens datasets demonstrate the effectiveness of the proposed approach.
AB - In this paper, we propose to conduct anomaly detection across multiple sources to identify objects that have inconsistent behavior across these sources.We assume that a set of objects can be described from various perspectives (multiple information sources). The underlying clustering structure of normal objects is usually shared by multiple sources. However, anomalous objects belong to different clusters when considering different aspects. For example, there exist movies that are expected to be liked by kids by genre, but are liked by grown-ups based on user viewing history. To identify such objects, we propose to compute the distance between different eigen decomposition results of the same object with respect to different sources as its anomalous score. We also give interpretations from the perspectives of constrained spectral clustering and random walks over graph. Experimental results on several UCI as well as DBLP and MovieLens datasets demonstrate the effectiveness of the proposed approach.
KW - Anomaly detection
KW - Multiple information sources
KW - Spectral methods
UR - http://www.scopus.com/inward/record.url?scp=84857165495&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84857165495&partnerID=8YFLogxK
U2 - 10.1109/ICDM.2011.16
DO - 10.1109/ICDM.2011.16
M3 - Conference contribution
AN - SCOPUS:84857165495
SN - 9780769544083
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 1050
EP - 1055
BT - Proceedings - 11th IEEE International Conference on Data Mining, ICDM 2011
T2 - 11th IEEE International Conference on Data Mining, ICDM 2011
Y2 - 11 December 2011 through 14 December 2011
ER -