TY - GEN
T1 - Towards heterogeneous temporal clinical event pattern discovery
T2 - 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2012
AU - Wang, Fei
AU - Lee, Noah
AU - Hu, Jianying
AU - Sun, Jimeng
AU - Ebadollahi, Shahram
PY - 2012/9/14
Y1 - 2012/9/14
N2 - Large collections of electronic clinical records today provide us with a vast source of information on medical practice. However, the utilization of those data for exploratory analysis to support clinical decisions is still limited. Extracting useful patterns from such data is particularly challenging because it is longitudinal, sparse and heterogeneous. In this paper, we propose a Nonnegative Matrix Factorization (NMF) based framework using a convolutional approach for open-ended temporal pattern discovery over large collections of clinical records. We call the method One-Sided Convolutional NMF (OSC-NMF). Our framework can mine common as well as individual shift-invariant temporal patterns from heterogeneous events over different patient groups, and handle sparsity as well as scalability problems well. Furthermore, we use an event matrix based representation that can encode quantitatively all key temporal concepts including order, concurrency and synchronicity. We derive efficient multiplicative update rules for OSC-NMF, and also prove theoretically its convergence. Finally, the experimental results on both synthetic and real world electronic patient data are presented to demonstrate the effectiveness of the proposed method.
AB - Large collections of electronic clinical records today provide us with a vast source of information on medical practice. However, the utilization of those data for exploratory analysis to support clinical decisions is still limited. Extracting useful patterns from such data is particularly challenging because it is longitudinal, sparse and heterogeneous. In this paper, we propose a Nonnegative Matrix Factorization (NMF) based framework using a convolutional approach for open-ended temporal pattern discovery over large collections of clinical records. We call the method One-Sided Convolutional NMF (OSC-NMF). Our framework can mine common as well as individual shift-invariant temporal patterns from heterogeneous events over different patient groups, and handle sparsity as well as scalability problems well. Furthermore, we use an event matrix based representation that can encode quantitatively all key temporal concepts including order, concurrency and synchronicity. We derive efficient multiplicative update rules for OSC-NMF, and also prove theoretically its convergence. Finally, the experimental results on both synthetic and real world electronic patient data are presented to demonstrate the effectiveness of the proposed method.
KW - convolution
KW - nmf
KW - pattern discovery
UR - http://www.scopus.com/inward/record.url?scp=84866045813&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84866045813&partnerID=8YFLogxK
U2 - 10.1145/2339530.2339605
DO - 10.1145/2339530.2339605
M3 - Conference contribution
AN - SCOPUS:84866045813
SN - 9781450314626
T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
SP - 453
EP - 461
BT - KDD'12 - 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Y2 - 12 August 2012 through 16 August 2012
ER -