Social media intelligence and learning environment: An open source framework for social media data Collection, Analysis and Curation

Chen Wang, Luigi Marini, Chieh Li Chin, Nickolas Vance, Curtis Donelson, Pascal Meunier, Joseph T. Yun

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Social Media Intelligence and Learning Environment (SMILE) is an open source framework bringing cutting-edge computational models on social media data to social science researchers and students with any level of programming and computation expertise. Many existing social media analysis tools require programming knowledge, a fee, or are closed source, making it challenging for social science researchers to apply existing and new methods to social media data. SMILE provides a user-friendly web interface, through which researchers can perform a wide spectrum of research tasks, ranging from social media data collection, natural language processing, text classification, social network analysis, and generating human readable outputs and visualizations. SMILE has adopted several technologies to support its needs. The data service of SMILE leverages the GraphQL language to provide an efficient and succinct API for client to communicate with a heterogeneous collection of social media APIs, including Twitter and Reddit. SMILE implements a microservices design and utilizes Amazon AWS services, such as Lambda and Batch for computation, S3 for data storage, and Elasticsearch for a Twitter streaming database, which makes it more portable, economic, and resilient. Analysis outputs can be shared with the larger community using Clowder, an open source data management system to support data curation of long tail data and metadata. SMILE is one of the main applications deployed as a standalone tool within the Social Media Macroscope (SMM), a science gateway based on the HUBzero platform. Over 200 users have used SMILE since its first release in 2018.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 15th International Conference on eScience, eScience 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages252-261
Number of pages10
ISBN (Electronic)9781728124513
DOIs
StatePublished - Sep 2019
Event15th IEEE International Conference on eScience, eScience 2019 - San Diego, United States
Duration: Sep 24 2019Sep 27 2019

Publication series

NameProceedings - IEEE 15th International Conference on eScience, eScience 2019

Conference

Conference15th IEEE International Conference on eScience, eScience 2019
Country/TerritoryUnited States
CitySan Diego
Period9/24/199/27/19

Keywords

  • Cloud computing
  • Data management
  • Research computing infrastructure
  • Scientific gateways
  • Text mining

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Ecological Modeling
  • Modeling and Simulation

Fingerprint

Dive into the research topics of 'Social media intelligence and learning environment: An open source framework for social media data Collection, Analysis and Curation'. Together they form a unique fingerprint.

Cite this