TY - JOUR
T1 - Controlled Sensing for Sequential Multihypothesis Testing with Controlled Markovian Observations and Non-Uniform Control Cost
AU - Nitinawarat, Sirin
AU - Veeravalli, Venupogal V.
N1 - Publisher Copyright:
© 2015, Copyright Taylor & Francis Group, LLC.
PY - 2015/1/2
Y1 - 2015/1/2
N2 - Abstract: A new model for controlled sensing for multihypothesis testing is proposed and studied in the sequential setting. This new model, termed a controlled Markovian observation model, exhibits a more complicated memory structure in the controlled observations than existing models. In addition, instead of penalizing just the delay until the final decision time as in standard sequential hypothesis testing problems, a much more general cost structure is considered that entails accumulating the total control cost with respect to an arbitrary control cost function. An asymptotically optimal test is proposed for this new model and is shown to satisfy an optimality condition formulated in terms of decision-making risk. It is shown that the optimal causal control policy for the controlled sensing problem is self-tuning, in the sense of maximizing an inherent “inferential” reward simultaneously under every hypothesis, with the maximal value being the best possible corresponding to the case where the true hypothesis is known at the outset. Another test is also proposed to meet distinctly predefined constraints on the various decision risks nonasymptotically, while retaining asymptotic optimality.
AB - Abstract: A new model for controlled sensing for multihypothesis testing is proposed and studied in the sequential setting. This new model, termed a controlled Markovian observation model, exhibits a more complicated memory structure in the controlled observations than existing models. In addition, instead of penalizing just the delay until the final decision time as in standard sequential hypothesis testing problems, a much more general cost structure is considered that entails accumulating the total control cost with respect to an arbitrary control cost function. An asymptotically optimal test is proposed for this new model and is shown to satisfy an optimality condition formulated in terms of decision-making risk. It is shown that the optimal causal control policy for the controlled sensing problem is self-tuning, in the sense of maximizing an inherent “inferential” reward simultaneously under every hypothesis, with the maximal value being the best possible corresponding to the case where the true hypothesis is known at the outset. Another test is also proposed to meet distinctly predefined constraints on the various decision risks nonasymptotically, while retaining asymptotic optimality.
KW - Adaptive stochastic control
KW - Controlled sensing for inference
KW - Markov decision process
KW - Self-tuning control policy
KW - Sequential hypothesis testing
UR - http://www.scopus.com/inward/record.url?scp=84922362757&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84922362757&partnerID=8YFLogxK
U2 - 10.1080/07474946.2014.961864
DO - 10.1080/07474946.2014.961864
M3 - Article
AN - SCOPUS:84922362757
SN - 0747-4946
VL - 34
SP - 1
EP - 24
JO - Sequential Analysis
JF - Sequential Analysis
IS - 1
ER -