TY - GEN
T1 - Uninformed-to-informed exploration in unstructured real-world environments
AU - Axelrod, Allan
AU - Chowdhary, Girish
N1 - Publisher Copyright:
© Copyright 2015, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2015
Y1 - 2015
N2 - Conventionally, the process of learning the model (exploration) is initialized as either an uninformed or informed policy, where the latter leverages observations to guide future exploration. Informed exploration is ideal as it may allow a model to be learned in fewer samples. However, informed exploration cannot be implemented from the onset when a-priori knowledge on the sensing domain statistics are not available; such policies would only sample the first set of locations, repeatedly. Hence, we present a theoretically-derived bound for transitioning from uninformed exploration to informed exploration for unstructured real-world environments which may be partially-observable and time-varying. This bound is used in tandem with a sparsified Bayesian nonparametric Poisson Exposure Process, which is used to learn to predict the value of information in partially- observable and time-varying domains. The result is an uninformed-to-informed exploration policy which outperforms baseline algorithms in real-world data-sets.
AB - Conventionally, the process of learning the model (exploration) is initialized as either an uninformed or informed policy, where the latter leverages observations to guide future exploration. Informed exploration is ideal as it may allow a model to be learned in fewer samples. However, informed exploration cannot be implemented from the onset when a-priori knowledge on the sensing domain statistics are not available; such policies would only sample the first set of locations, repeatedly. Hence, we present a theoretically-derived bound for transitioning from uninformed exploration to informed exploration for unstructured real-world environments which may be partially-observable and time-varying. This bound is used in tandem with a sparsified Bayesian nonparametric Poisson Exposure Process, which is used to learn to predict the value of information in partially- observable and time-varying domains. The result is an uninformed-to-informed exploration policy which outperforms baseline algorithms in real-world data-sets.
UR - http://www.scopus.com/inward/record.url?scp=84964680546&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84964680546&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84964680546
T3 - AAAI Fall Symposium - Technical Report
SP - 2
EP - 3
BT - Self-Confidence in Autonomous Systems - Papers from the AAAI 2015 Fall Symposium, Technical Report
PB - AI Access Foundation
T2 - AAAI 2015 Fall Symposium
Y2 - 12 November 2015 through 14 November 2015
ER -