Scheduling independent tasks sharing large data distributed with BitTorrent

Baohua Wei, Gilles Fedak, Franck Cappello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Data-centric applications are still a challenging issue for large scale distributed computing systems. The emergence of new protocols and software for collaborative content distribution over Internet offers a new opportunity for efficient and fast delivery of high volume of data. In a previous paper, we have investigated BitTorrent as a protocol for data diffusion in the context of Computational Desktop Grid. We showed that BitTorrent is efficient for large file transfers, scalable when the number of nodes increases but suffers from a high overhead when transmitting small files. This paper investigates two approach to overcome these limitations. First, we propose a performance model to select the best of FTP and BitTorrent protocols according to the size of the file to distribute and the number of receiver nodes. Next we propose enhancement of the BitTorrent protocol which provides more predictable communication patterns. We design a model for communication performance and evaluate BitTorrent-aware versions BT-MinMin, BT-MaxMin and BTSufferage scheduling heuristics against a synthetic parameter-sweep application.

Original languageEnglish (US)
Title of host publicationProceedings of the 6th IEEE/ACM International Workshop on Grid Computing
PublisherIEEE Computer Society
Pages219-226
Number of pages8
ISBN (Print)0780394933, 9780780394933
DOIs
StatePublished - 2005
Externally publishedYes
Event6th IEEE/ACM International Workshop on Grid Computing - Seattle, WA, United States
Duration: Nov 13 2005Nov 14 2005

Publication series

NameProceedings - IEEE/ACM International Workshop on Grid Computing
Volume2005
ISSN (Print)1550-5510

Other

Other6th IEEE/ACM International Workshop on Grid Computing
Country/TerritoryUnited States
CitySeattle, WA
Period11/13/0511/14/05

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Scheduling independent tasks sharing large data distributed with BitTorrent'. Together they form a unique fingerprint.

Cite this