Improving the start-up time of python applications on large scale HPC systems

Colin A. Maclean, Hon Wai Leong, Jeremy Enos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Interpreted programming languages (e.g. Perl [15], Python [13], R [12]) are gaining popularity in modern scientific computation. The syntaxes are easy to read and learn, are flexible, and are portable for execution on different HPC systems without the need to recompile. This is achieved by executing platform independent byte code instructions in a virtual machine, in contrast to platform dependent machine language instructions run directly on the hardware. Debugging in such a language is easier, as the program runs through one statement at a time and stops whenever there is an error, prompting the location of the error almost immediately. On the other hand, a programming language that uses a compiler takes extra time for compilation and often requires additional linking to library dependencies. It is more difficult to debug, as an error is only generated after compilation. As such, writing and debugging in interpreter language is convenient as programmers can change the code quickly and test it without the need to recompile. Due to the nature of interpreted languages, the source files and dependencies typically exist as many different files, as is the case with Python, rather than being linked together from source then object files into a smaller number of executables and libraries. This property of interpreted languages demands a large number of input/ output operations per second (IOPs) for fast start-up times. When many nodes of a HPC system launch an interpreted program, this behavior places significant stress on the metadata server of parallel file systems, leading to poor start-up performance and impacts the file system performance for all users.

Original languageEnglish (US)
Title of host publicationProceedings of HPCSYSPROS 2017
Subtitle of host publicationHPC Systems Professionals Workshop, Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450351287
DOIs
StatePublished - Nov 12 2017
EventHPC Systems Professionals Workshop, HPCSYSPROS 2017 - Denver, United States
Duration: Nov 12 2017Nov 17 2017

Publication series

NameProceedings of HPCSYSPROS 2017: HPC Systems Professionals Workshop, Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage and Analysis

Other

OtherHPC Systems Professionals Workshop, HPCSYSPROS 2017
Country/TerritoryUnited States
CityDenver
Period11/12/1711/17/17

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Improving the start-up time of python applications on large scale HPC systems'. Together they form a unique fingerprint.

Cite this