TY - GEN
T1 - Scheduling independent tasks sharing large data distributed with BitTorrent
AU - Wei, Baohua
AU - Fedak, Gilles
AU - Cappello, Franck
PY - 2005
Y1 - 2005
N2 - Data-centric applications are still a challenging issue for large scale distributed computing systems. The emergence of new protocols and software for collaborative content distribution over Internet offers a new opportunity for efficient and fast delivery of high volume of data. In a previous paper, we have investigated BitTorrent as a protocol for data diffusion in the context of Computational Desktop Grid. We showed that BitTorrent is efficient for large file transfers, scalable when the number of nodes increases but suffers from a high overhead when transmitting small files. This paper investigates two approach to overcome these limitations. First, we propose a performance model to select the best of FTP and BitTorrent protocols according to the size of the file to distribute and the number of receiver nodes. Next we propose enhancement of the BitTorrent protocol which provides more predictable communication patterns. We design a model for communication performance and evaluate BitTorrent-aware versions BT-MinMin, BT-MaxMin and BTSufferage scheduling heuristics against a synthetic parameter-sweep application.
AB - Data-centric applications are still a challenging issue for large scale distributed computing systems. The emergence of new protocols and software for collaborative content distribution over Internet offers a new opportunity for efficient and fast delivery of high volume of data. In a previous paper, we have investigated BitTorrent as a protocol for data diffusion in the context of Computational Desktop Grid. We showed that BitTorrent is efficient for large file transfers, scalable when the number of nodes increases but suffers from a high overhead when transmitting small files. This paper investigates two approach to overcome these limitations. First, we propose a performance model to select the best of FTP and BitTorrent protocols according to the size of the file to distribute and the number of receiver nodes. Next we propose enhancement of the BitTorrent protocol which provides more predictable communication patterns. We design a model for communication performance and evaluate BitTorrent-aware versions BT-MinMin, BT-MaxMin and BTSufferage scheduling heuristics against a synthetic parameter-sweep application.
UR - http://www.scopus.com/inward/record.url?scp=33749656482&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33749656482&partnerID=8YFLogxK
U2 - 10.1109/GRID.2005.1542745
DO - 10.1109/GRID.2005.1542745
M3 - Conference contribution
AN - SCOPUS:33749656482
SN - 0780394933
SN - 9780780394933
T3 - Proceedings - IEEE/ACM International Workshop on Grid Computing
SP - 219
EP - 226
BT - Proceedings of the 6th IEEE/ACM International Workshop on Grid Computing
PB - IEEE Computer Society
T2 - 6th IEEE/ACM International Workshop on Grid Computing
Y2 - 13 November 2005 through 14 November 2005
ER -