TY - GEN
T1 - An architecture for automatic deployment of brown dog services at scale into diverse computing infrastructures
AU - Padhy, Smruti
AU - Alameda, Jay
AU - Kooper, Rob
AU - Liu, Rui
AU - Satheesan, Sandeep Puthanveetil
AU - Zharnitsky, Inna
AU - Jansen, Gregory
AU - Dietze, Michael C.
AU - Kumar, Praveen
AU - Lee, Jong
AU - Marciano, Richard
AU - Marini, Luigi
AU - Minsker, Barbara S
AU - Navarro, Chris
AU - Slavenas, Marcus
AU - Sullivan, William
AU - McHenry, Kenton
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/7/17
Y1 - 2016/7/17
N2 - Brown Dog is an extensible data cyberinfrastructure, that provides a set of extensible and distributed data conversion and metadata extraction services to enable access and search within unstructured, un-curated and inaccessible research data across different domains of sciences and social science, which ultimately aids in supporting reproducibility of results. We envision that Brown Dog, as a data cyberinfrastructure, is an essential service in a comprehensive cyberinfrastructure which includes data services, high performance computing services and more that would enable scholarly research in a variety of disciplines that today is not yet possi-ble. Brown Dog focuses on four initial use cases, specifically, addressing the conversion and extraction needs in the research areas of ecology, civil and environmental engineering, library and information science, and use by the general public. In this paper, we describe an architecture that supports contribution of data transformation tools from users, and automatic deployment of the tools as Brown Dog services in diverse infrastructures such as cloud or high performance computing (HPC) based on user demands and load on the system. We also present results validating the performance of the initial implementation of Brown Dog.
AB - Brown Dog is an extensible data cyberinfrastructure, that provides a set of extensible and distributed data conversion and metadata extraction services to enable access and search within unstructured, un-curated and inaccessible research data across different domains of sciences and social science, which ultimately aids in supporting reproducibility of results. We envision that Brown Dog, as a data cyberinfrastructure, is an essential service in a comprehensive cyberinfrastructure which includes data services, high performance computing services and more that would enable scholarly research in a variety of disciplines that today is not yet possi-ble. Brown Dog focuses on four initial use cases, specifically, addressing the conversion and extraction needs in the research areas of ecology, civil and environmental engineering, library and information science, and use by the general public. In this paper, we describe an architecture that supports contribution of data transformation tools from users, and automatic deployment of the tools as Brown Dog services in diverse infrastructures such as cloud or high performance computing (HPC) based on user demands and load on the system. We also present results validating the performance of the initial implementation of Brown Dog.
KW - Autocuration
KW - Civil and environmental engineering
KW - Cloud
KW - Data conversion
KW - Data cyberinfrastructure
KW - Digital preservation
KW - Ecology
KW - Elasticity
KW - HPC
KW - Library and information science
KW - Metadata extraction
UR - http://www.scopus.com/inward/record.url?scp=84989177759&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84989177759&partnerID=8YFLogxK
U2 - 10.1145/2949550.2949647
DO - 10.1145/2949550.2949647
M3 - Conference contribution
AN - SCOPUS:84989177759
T3 - ACM International Conference Proceeding Series
BT - Proceedings of XSEDE 2016
PB - Association for Computing Machinery
T2 - Conference on Diversity, Big Data, and Science at Scale, XSEDE 2016
Y2 - 17 July 2016 through 21 July 2016
ER -