TY - JOUR
T1 - Sequential Anomaly Detection Under Sampling Constraints
AU - Tsopelakos, Aristomenis
AU - Fellouris, Georgios
N1 - The work of Aristomenis Tsopelakos was supported in part by NSF through the University of Illinois at Urbana Champaign under Grant CIF 1514245. The work of Georgios Fellouris was supported in part by NSF through the University of Illinois at Urbana Champaign under Grant CIF 1514245 and Grant DMS 1737962.
PY - 2023/12/1
Y1 - 2023/12/1
N2 - The problem of sequential anomaly detection is considered, where multiple data sources are monitored in real time and the goal is to identify the 'anomalous' ones among them, when it is not possible to sample all sources at all times. A detection scheme in this context requires specifying not only when to stop sampling and which sources to identify as anomalous upon stopping, but also which sources to sample at each time instance until stopping. A novel formulation for this problem is proposed, in which the number of anomalous sources is not necessarily known in advance and the number of sampled sources per time instance is not necessarily fixed. Instead, an arbitrary lower bound and an arbitrary upper bound are assumed on the number of anomalous sources, and the fraction of the expected number of samples over the expected time until stopping is required to not exceed an arbitrary, user-specified level. In addition to this sampling constraint, the probabilities of at least one false alarm and at least one missed detection are controlled below user-specified tolerance levels. A general criterion is established for a policy to achieve the minimum expected time until stopping to a first-order asymptotic approximation as the two familywise error rates go to zero. Moreover, the asymptotic optimality is established of a family of policies that sample each source at each time instance with a probability that depends on past observations only through the current estimate of the subset of anomalous sources. This family includes, in particular, a novel policy that requires minimal computation under any setup of the problem.
AB - The problem of sequential anomaly detection is considered, where multiple data sources are monitored in real time and the goal is to identify the 'anomalous' ones among them, when it is not possible to sample all sources at all times. A detection scheme in this context requires specifying not only when to stop sampling and which sources to identify as anomalous upon stopping, but also which sources to sample at each time instance until stopping. A novel formulation for this problem is proposed, in which the number of anomalous sources is not necessarily known in advance and the number of sampled sources per time instance is not necessarily fixed. Instead, an arbitrary lower bound and an arbitrary upper bound are assumed on the number of anomalous sources, and the fraction of the expected number of samples over the expected time until stopping is required to not exceed an arbitrary, user-specified level. In addition to this sampling constraint, the probabilities of at least one false alarm and at least one missed detection are controlled below user-specified tolerance levels. A general criterion is established for a policy to achieve the minimum expected time until stopping to a first-order asymptotic approximation as the two familywise error rates go to zero. Moreover, the asymptotic optimality is established of a family of policies that sample each source at each time instance with a probability that depends on past observations only through the current estimate of the subset of anomalous sources. This family includes, in particular, a novel policy that requires minimal computation under any setup of the problem.
KW - Active sensing
KW - anomaly detection
KW - asymptotic optimality
KW - controlled sensing
KW - sequential design of experiments
KW - sequential detection
KW - sequential sampling
KW - sequential testing
UR - http://www.scopus.com/inward/record.url?scp=85130773984&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85130773984&partnerID=8YFLogxK
U2 - 10.1109/TIT.2022.3177142
DO - 10.1109/TIT.2022.3177142
M3 - Article
AN - SCOPUS:85130773984
SN - 0018-9448
VL - 69
SP - 8126
EP - 8146
JO - IEEE Transactions on Information Theory
JF - IEEE Transactions on Information Theory
IS - 12
ER -