GeoBurst: Real-time local event detection in geo-tagged tweet streams

Chao Zhang, Guangyu Zhou, Quan Yuan, Honglei Zhuang, Yu Zheng, Lance Kaplan, Shaowen Wang, Jiawei Han

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The real-time discovery of local events (e.g., protests, crimes, disasters) is of great importance to various applications, such as crime monitoring, disaster alarming, and activity recommendation. While this task was nearly impossible years ago due to the lack of timely and reliable data sources, the recent explosive growth in geo-tagged tweet data brings new opportunities to it. That said, how to extract quality local events from geo-tagged tweet streams in real time remains largely unsolved so far. We propose GEOBURST, a method that enables effective and real-time local event detection from geo-tagged tweet streams. With a novel authority measure that captures the geo-topic correlations among tweets, GEOBURST first identifies several pivots in the query window. Such pivots serve as representative tweets for potential local events and naturally attract similar tweets to form candidate events. To select truly interesting local events from the candidate list, GEOBURST further summarizes continuous tweet streams and compares the candidates against historical activities to obtain spatiotemporally bursty ones. Finally, GEOBURST also features an updating module that finds new pivots with little time cost when the query window shifts. As such, GEOBURST is capable of monitoring continuous streams in real time. We used crowdsourcing to evaluate GEOBURST on two real-life data sets that contain millions of geo-tagged tweets. The results demonstrate that GEOBURST significantly outperforms state-of-the-art methods in precision, and is orders of magnitude faster.

Original languageEnglish (US)
Title of host publicationSIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval
PublisherAssociation for Computing Machinery, Inc
Pages513-522
Number of pages10
ISBN (Electronic)9781450342902
DOIs
StatePublished - Jul 7 2016
Event39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016 - Pisa, Italy
Duration: Jul 17 2016Jul 21 2016

Publication series

NameSIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval

Other

Other39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016
CountryItaly
CityPisa
Period7/17/167/21/16

Fingerprint

Crime
Disasters
Monitoring
Costs

Keywords

  • Event detection
  • Local event
  • Social media
  • Tweet
  • Twitter

ASJC Scopus subject areas

  • Information Systems
  • Software

Cite this

Zhang, C., Zhou, G., Yuan, Q., Zhuang, H., Zheng, Y., Kaplan, L., ... Han, J. (2016). GeoBurst: Real-time local event detection in geo-tagged tweet streams. In SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 513-522). (SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval). Association for Computing Machinery, Inc. https://doi.org/10.1145/2911451.2911519

GeoBurst : Real-time local event detection in geo-tagged tweet streams. / Zhang, Chao; Zhou, Guangyu; Yuan, Quan; Zhuang, Honglei; Zheng, Yu; Kaplan, Lance; Wang, Shaowen; Han, Jiawei.

SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, Inc, 2016. p. 513-522 (SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Zhang, C, Zhou, G, Yuan, Q, Zhuang, H, Zheng, Y, Kaplan, L, Wang, S & Han, J 2016, GeoBurst: Real-time local event detection in geo-tagged tweet streams. in SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery, Inc, pp. 513-522, 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016, Pisa, Italy, 7/17/16. https://doi.org/10.1145/2911451.2911519
Zhang C, Zhou G, Yuan Q, Zhuang H, Zheng Y, Kaplan L et al. GeoBurst: Real-time local event detection in geo-tagged tweet streams. In SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, Inc. 2016. p. 513-522. (SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval). https://doi.org/10.1145/2911451.2911519
Zhang, Chao ; Zhou, Guangyu ; Yuan, Quan ; Zhuang, Honglei ; Zheng, Yu ; Kaplan, Lance ; Wang, Shaowen ; Han, Jiawei. / GeoBurst : Real-time local event detection in geo-tagged tweet streams. SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, Inc, 2016. pp. 513-522 (SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval).
@inproceedings{a698f14154174f1db473555111662d7e,
title = "GeoBurst: Real-time local event detection in geo-tagged tweet streams",
abstract = "The real-time discovery of local events (e.g., protests, crimes, disasters) is of great importance to various applications, such as crime monitoring, disaster alarming, and activity recommendation. While this task was nearly impossible years ago due to the lack of timely and reliable data sources, the recent explosive growth in geo-tagged tweet data brings new opportunities to it. That said, how to extract quality local events from geo-tagged tweet streams in real time remains largely unsolved so far. We propose GEOBURST, a method that enables effective and real-time local event detection from geo-tagged tweet streams. With a novel authority measure that captures the geo-topic correlations among tweets, GEOBURST first identifies several pivots in the query window. Such pivots serve as representative tweets for potential local events and naturally attract similar tweets to form candidate events. To select truly interesting local events from the candidate list, GEOBURST further summarizes continuous tweet streams and compares the candidates against historical activities to obtain spatiotemporally bursty ones. Finally, GEOBURST also features an updating module that finds new pivots with little time cost when the query window shifts. As such, GEOBURST is capable of monitoring continuous streams in real time. We used crowdsourcing to evaluate GEOBURST on two real-life data sets that contain millions of geo-tagged tweets. The results demonstrate that GEOBURST significantly outperforms state-of-the-art methods in precision, and is orders of magnitude faster.",
keywords = "Event detection, Local event, Social media, Tweet, Twitter",
author = "Chao Zhang and Guangyu Zhou and Quan Yuan and Honglei Zhuang and Yu Zheng and Lance Kaplan and Shaowen Wang and Jiawei Han",
year = "2016",
month = "7",
day = "7",
doi = "10.1145/2911451.2911519",
language = "English (US)",
series = "SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval",
publisher = "Association for Computing Machinery, Inc",
pages = "513--522",
booktitle = "SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval",

}

TY - GEN

T1 - GeoBurst

T2 - Real-time local event detection in geo-tagged tweet streams

AU - Zhang, Chao

AU - Zhou, Guangyu

AU - Yuan, Quan

AU - Zhuang, Honglei

AU - Zheng, Yu

AU - Kaplan, Lance

AU - Wang, Shaowen

AU - Han, Jiawei

PY - 2016/7/7

Y1 - 2016/7/7

N2 - The real-time discovery of local events (e.g., protests, crimes, disasters) is of great importance to various applications, such as crime monitoring, disaster alarming, and activity recommendation. While this task was nearly impossible years ago due to the lack of timely and reliable data sources, the recent explosive growth in geo-tagged tweet data brings new opportunities to it. That said, how to extract quality local events from geo-tagged tweet streams in real time remains largely unsolved so far. We propose GEOBURST, a method that enables effective and real-time local event detection from geo-tagged tweet streams. With a novel authority measure that captures the geo-topic correlations among tweets, GEOBURST first identifies several pivots in the query window. Such pivots serve as representative tweets for potential local events and naturally attract similar tweets to form candidate events. To select truly interesting local events from the candidate list, GEOBURST further summarizes continuous tweet streams and compares the candidates against historical activities to obtain spatiotemporally bursty ones. Finally, GEOBURST also features an updating module that finds new pivots with little time cost when the query window shifts. As such, GEOBURST is capable of monitoring continuous streams in real time. We used crowdsourcing to evaluate GEOBURST on two real-life data sets that contain millions of geo-tagged tweets. The results demonstrate that GEOBURST significantly outperforms state-of-the-art methods in precision, and is orders of magnitude faster.

AB - The real-time discovery of local events (e.g., protests, crimes, disasters) is of great importance to various applications, such as crime monitoring, disaster alarming, and activity recommendation. While this task was nearly impossible years ago due to the lack of timely and reliable data sources, the recent explosive growth in geo-tagged tweet data brings new opportunities to it. That said, how to extract quality local events from geo-tagged tweet streams in real time remains largely unsolved so far. We propose GEOBURST, a method that enables effective and real-time local event detection from geo-tagged tweet streams. With a novel authority measure that captures the geo-topic correlations among tweets, GEOBURST first identifies several pivots in the query window. Such pivots serve as representative tweets for potential local events and naturally attract similar tweets to form candidate events. To select truly interesting local events from the candidate list, GEOBURST further summarizes continuous tweet streams and compares the candidates against historical activities to obtain spatiotemporally bursty ones. Finally, GEOBURST also features an updating module that finds new pivots with little time cost when the query window shifts. As such, GEOBURST is capable of monitoring continuous streams in real time. We used crowdsourcing to evaluate GEOBURST on two real-life data sets that contain millions of geo-tagged tweets. The results demonstrate that GEOBURST significantly outperforms state-of-the-art methods in precision, and is orders of magnitude faster.

KW - Event detection

KW - Local event

KW - Social media

KW - Tweet

KW - Twitter

UR - http://www.scopus.com/inward/record.url?scp=84980343855&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84980343855&partnerID=8YFLogxK

U2 - 10.1145/2911451.2911519

DO - 10.1145/2911451.2911519

M3 - Conference contribution

AN - SCOPUS:84980343855

T3 - SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval

SP - 513

EP - 522

BT - SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval

PB - Association for Computing Machinery, Inc

ER -