Abstract
We present a clustering algorithm that classifies the points of a dataset by a combination of scalar variables' values as well as spatial locations. How heavily the spatial locations impact the algorithm is a tunable parameter. With no impact the algorithm bins the data by calculating a histogram and classifies each point by a bin ID. With full impact, points are bunched together by spatial neighborhood regardless of value. This approach is unsurprisingly very sensitive to this weighting; a sampling of possible values yields a wide variety of classifications. However, we have found that when tuned just right it is indeed possible to extract meaningful features from the resulting clustering. Furthermore, the principles behind our development of this technique are also applicable in both tuning the algorithm as well as in selecting data regions. In this paper we will provide the details of design and implementation and demonstrate using the auto-tuned approach to extract interesting regions of real scientific data. Our target application is the automatic detection of land cover data anomalies in NASA's Moderate Resolution Imaging Spectroradiometer (MODIS) sensors.
Original language | English (US) |
---|---|
Pages (from-to) | 2207-2216 |
Number of pages | 10 |
Journal | Procedia Computer Science |
Volume | 51 |
Issue number | 1 |
DOIs | |
State | Published - 2015 |
Event | International Conference on Computational Science, ICCS 2002 - Amsterdam, Netherlands Duration: Apr 21 2002 → Apr 24 2002 |
Keywords
- Anomaly detection
- MODIS
- Parallel computing
- Visualization
ASJC Scopus subject areas
- Computer Science(all)