TY - GEN
T1 - On diversifying source selection in social sensing
AU - Uddin, Md Yusuf S.
AU - Al Amin, Md Tanvir
AU - Le, Hieu
AU - Abdelzaher, Tarek
AU - Szymanski, Boleslaw
AU - Nguyen, Tommy
PY - 2012
Y1 - 2012
N2 - This paper develops algorithms for improved source selection in social sensing applications that exploit social networks (such as Twitter, Flickr, or other mass dissemination networks) for reporting. The collection point in these applications would simply be authorized to view relevant information from participating clients (either by explicit client-side action or by default such as on Twitter). Social networks, therefore, create unprecedented opportunities for the development of sensing applications, where humans act as sensors or sensor operators, simply by posting their observations or measurements on the shared medium. Resulting social sensing applications, for example, can report traffic speed based on GPS data shared by drivers, or determine damage in the aftermath of a natural disaster based on eye-witness reports. A key problem, when dealing with human sources on social media, is the difficulty in ensuring independence of measurements, making it harder to distinguish fact from rumor. This is because observations posted by one source are available to its neighbors in the social network, who may, in-turn, propagate those observations without verifying their correctness, thus creating correlations and bias. A corner-stone of successful social sensing is therefore to ensure an unbiased sampling of sources that minimizes dependence between them. This paper explores the merits of such diversification. It shows that a diversified sampling is advantageous not only in terms of reducing the number of samples but also in improving our ability to correctly estimate the accuracy of data in social sensing.
AB - This paper develops algorithms for improved source selection in social sensing applications that exploit social networks (such as Twitter, Flickr, or other mass dissemination networks) for reporting. The collection point in these applications would simply be authorized to view relevant information from participating clients (either by explicit client-side action or by default such as on Twitter). Social networks, therefore, create unprecedented opportunities for the development of sensing applications, where humans act as sensors or sensor operators, simply by posting their observations or measurements on the shared medium. Resulting social sensing applications, for example, can report traffic speed based on GPS data shared by drivers, or determine damage in the aftermath of a natural disaster based on eye-witness reports. A key problem, when dealing with human sources on social media, is the difficulty in ensuring independence of measurements, making it harder to distinguish fact from rumor. This is because observations posted by one source are available to its neighbors in the social network, who may, in-turn, propagate those observations without verifying their correctness, thus creating correlations and bias. A corner-stone of successful social sensing is therefore to ensure an unbiased sampling of sources that minimizes dependence between them. This paper explores the merits of such diversification. It shows that a diversified sampling is advantageous not only in terms of reducing the number of samples but also in improving our ability to correctly estimate the accuracy of data in social sensing.
UR - http://www.scopus.com/inward/record.url?scp=84865040725&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84865040725&partnerID=8YFLogxK
U2 - 10.1109/inss.2012.6240519
DO - 10.1109/inss.2012.6240519
M3 - Conference contribution
AN - SCOPUS:84865040725
SN - 9781467317856
T3 - 9th International Conference on Networked Sensing Systems, INSS 2012 - Conference Proceedings
BT - 9th International Conference on Networked Sensing Systems, INSS 2012 - Conference Proceedings
PB - IEEE Computer Society
T2 - 9th International Conference on Networked Sensing Systems, INSS 2012
Y2 - 11 June 2012 through 14 June 2012
ER -