TY - GEN
T1 - Recursive truth estimation of time-varying sensing data from online open sources
AU - Cui, Hang
AU - Abdelzaher, Tarek
AU - Kaplan, Lance
N1 - ACKNOWLEDGMENT Research reported in this paper was sponsored in part by the U.S. Army Research Laboratory and was accomplished under Cooperative Agreements W911NF-17-2-0196 and W911NF-09-2-0053, DARPA contract W911NF-17-C-0099, and NSF grants CNS 13-29886 and CNS 16-18627. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Army Research Laboratory or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation here on.
PY - 2018/10/25
Y1 - 2018/10/25
N2 - This paper is motivated by prospective Internet of Things (IoT) applications that exploit inputs from online open sources whose reliability may be uncertain. Unlike physical signal fusion (that can leverage solid analytic foundations derived from physical properties of fused signals), data reliability assessment from arbitrary online open sources is a harder problem. At least two difficulties arise. First, source reliability is harder to estimate from first principles due to lack of visibility into the sensing and subsequent processing stages for published data. Second, by virtue of being open, some sources can be copied by others, leading to correlated errors at a large scale. This paper presents a recursive truth estimator for online public data streams that addresses the above two problems. We focus on categorical data. Many truth-finding systems were developed to cope with unreliable categorical data. Most of them are designed for batch analysis of bulk datasets. This work extends previous efforts by developing an online recursive estimator. Unlike previous recursive fact-finders, ours is the first that can jointly handle (i) changes in the population of sources over time, (ii) changes in the ground-truth state of the physical phenomenon being observed (that result in the appearance of conflicting claims), and (iii) correlated errors due to potential copying among sources. Results show that our algorithm not only outperforms other recursive fact-finders in the case of changing ground-truth state, but also improves estimation accuracy of static state.
AB - This paper is motivated by prospective Internet of Things (IoT) applications that exploit inputs from online open sources whose reliability may be uncertain. Unlike physical signal fusion (that can leverage solid analytic foundations derived from physical properties of fused signals), data reliability assessment from arbitrary online open sources is a harder problem. At least two difficulties arise. First, source reliability is harder to estimate from first principles due to lack of visibility into the sensing and subsequent processing stages for published data. Second, by virtue of being open, some sources can be copied by others, leading to correlated errors at a large scale. This paper presents a recursive truth estimator for online public data streams that addresses the above two problems. We focus on categorical data. Many truth-finding systems were developed to cope with unreliable categorical data. Most of them are designed for batch analysis of bulk datasets. This work extends previous efforts by developing an online recursive estimator. Unlike previous recursive fact-finders, ours is the first that can jointly handle (i) changes in the population of sources over time, (ii) changes in the ground-truth state of the physical phenomenon being observed (that result in the appearance of conflicting claims), and (iii) correlated errors due to potential copying among sources. Results show that our algorithm not only outperforms other recursive fact-finders in the case of changing ground-truth state, but also improves estimation accuracy of static state.
KW - Computer Science
KW - Noise Measurement
KW - Observers
KW - Reliability
KW - Sensors
UR - http://www.scopus.com/inward/record.url?scp=85057117040&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057117040&partnerID=8YFLogxK
U2 - 10.1109/DCOSS.2018.00012
DO - 10.1109/DCOSS.2018.00012
M3 - Conference contribution
AN - SCOPUS:85057117040
T3 - Proceedings - 14th Annual International Conference on Distributed Computing in Sensor Systems, DCOSS 2018
SP - 25
EP - 34
BT - Proceedings - 14th Annual International Conference on Distributed Computing in Sensor Systems, DCOSS 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 14th Annual International Conference on Distributed Computing in Sensor Systems, DCOSS 2018
Y2 - 18 June 2018 through 19 June 2018
ER -