Planning large data transfers in institutional grids

Fatiha Bouabache, Thomas Herault, Sylvain Peyronnet, Franck Cappello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In grid computing, many scientific and engineering applications require access to large amounts of distributed data. The size and number of these data collections has been growing rapidly in recent years. The costs of data transmission take a significant part of the global execution time. When communication streams flow concurrently on shared links, transport control protocols have issues allocating fair bandwidth to all the streams, and the network becomes sub-optimally used. One way to deal with this situation is to schedule the communications in a way that will induce an optimal use of the network. We focus on the case of large data transfers that can be completely described at the initialization time. In this case, a plan of data migration can be computed at initialization time, and then executed. However, this computation phase must take a small time when compared to the actual execution of the plan. We propose a best effort solution, to compute approximately, based on the uniform random sampling of possible schedules, a communication plan. We show the effectiveness of this approach both theoretically and by simulations.

Original languageEnglish (US)
Title of host publicationCCGrid 2010 - 10th IEEE/ACM International Conference on Cluster, Cloud, and Grid Computing
Pages547-552
Number of pages6
DOIs
StatePublished - 2010
Externally publishedYes
Event10th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2010 - Melbourne, VIC, Australia
Duration: May 17 2010May 20 2010

Publication series

NameCCGrid 2010 - 10th IEEE/ACM International Conference on Cluster, Cloud, and Grid Computing

Other

Other10th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2010
Country/TerritoryAustralia
CityMelbourne, VIC
Period5/17/105/20/10

Keywords

  • Data transfer
  • Scheduling

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Networks and Communications
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Planning large data transfers in institutional grids'. Together they form a unique fingerprint.

Cite this