TY - JOUR
T1 - Identifying predictive multi-dimensional time series motifs
T2 - An application to severe weather prediction
AU - McGovern, Amy
AU - Rosendahl, Derek H.
AU - Brown, Rodger A.
AU - Droegemeier, Kelvin K.
N1 - Funding Information:
Acknowledgements This material is based upon work supported by the National Science Foundation under Grant No. REU/0453545, IIS/REU/0755462, IIS/CAREER/0746816 and corresponding REU Supplements IIS/0840956 and IIS/0938138, the NSF ERC Center for Collaborative Adaptive Sensing of the Atmosphere (CASA, NSF ERC 0313747), and the University of Oklahoma’s College of Engineering. We would also like to thank Nathan Hiers, Adrianna Kruger, and Meredith Beaton for their preliminary work on this data. The motion capture data used in this project was obtained from mocap.cs.cmu.edu and their database was created with funding from NSF EIA-0196217.
PY - 2011/1
Y1 - 2011/1
N2 - We introduce an efficient approach to mining multi-dimensional temporal streams of real-world data for ordered temporal motifs that can be used for prediction. Since many of the dimensions of the data are known or suspected to be irrelevant, our approach first identifies the salient dimensions of the data, then the key temporal motifs within each dimension, and finally the temporal ordering of the motifs necessary for prediction. For the prediction element, the data are assumed to be labeled. We tested the approach on two real-world data sets. To verify the generality of the approach, we validated the application on several subjects from the CMU Motion Capture database. Our main application uses several hundred numerically simulated supercell thunderstorms where the goal is to identify the most important features and feature interrelationships which herald the development of strong rotation in the lowest altitudes of a storm. We identified sets of precursors, in the form of meteorological quantities reaching extreme values in a particular temporal sequence, unique to storms producing strong low-altitude rotation. The eventual goal is to use this knowledge for future severe weather detection and prediction algorithms.
AB - We introduce an efficient approach to mining multi-dimensional temporal streams of real-world data for ordered temporal motifs that can be used for prediction. Since many of the dimensions of the data are known or suspected to be irrelevant, our approach first identifies the salient dimensions of the data, then the key temporal motifs within each dimension, and finally the temporal ordering of the motifs necessary for prediction. For the prediction element, the data are assumed to be labeled. We tested the approach on two real-world data sets. To verify the generality of the approach, we validated the application on several subjects from the CMU Motion Capture database. Our main application uses several hundred numerically simulated supercell thunderstorms where the goal is to identify the most important features and feature interrelationships which herald the development of strong rotation in the lowest altitudes of a storm. We identified sets of precursors, in the form of meteorological quantities reaching extreme values in a particular temporal sequence, unique to storms producing strong low-altitude rotation. The eventual goal is to use this knowledge for future severe weather detection and prediction algorithms.
KW - Multi-dimensional
KW - Severe weather
KW - Temporal data mining
UR - http://www.scopus.com/inward/record.url?scp=78651352893&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78651352893&partnerID=8YFLogxK
U2 - 10.1007/s10618-010-0193-7
DO - 10.1007/s10618-010-0193-7
M3 - Article
AN - SCOPUS:78651352893
SN - 1384-5810
VL - 22
SP - 232
EP - 258
JO - Data Mining and Knowledge Discovery
JF - Data Mining and Knowledge Discovery
IS - 1-2
ER -