TY - JOUR
T1 - Dynamic Remapping of Parallel Computations with Varying Resource Demands
AU - Nicol, David M.
AU - Saltz, Joel H.
N1 - Funding Information:
Manuscript received July 17, 1986; revised January 22, 1987. This work was supported by the National Aeronautics and Space Administration under NASA Contracts NASI-I7070 and NAS1-18107, while the authors were in residence at ICASE, Langley Research Center, Hampton, VA. D. M. Nicol is with the Department of Computer Science, College of William and Mary, Williamsburg, VA 23185. J. H. Saltz is with the Department of Computer Science, Yale University, New Haven, CT. IEEE Log Number 8718426.
PY - 1988/9
Y1 - 1988/9
N2 - A large class of computational problems are characterized by frequent synchronization, and computational requirements which change as a function of time. When such a problem is solved on a message passing multiprocessor machine, the combination of these characteristics leads to system performance which deteriorates in time. Performance can be improved with periodic redistribution of computational load; however, redistribution can exact a sometimes large delay cost. We study the issue of deciding when to invoke a global load remapping mechanism. Such a decision policy must effectively weigh the costs of remapping against the performance benefits, and should be general enough to apply automatically to a wide range of computations. We propose here a general remapping decision heuristic, then study its effectiveness and its anticipated behavior on two very different models of load evolution. Assuming only that the remapping cost is known, this policy dynamically attempts to minimize system degradation (including the cost of remapping) per computation step. This policy is quite simple, choosing to remap when the first local minimum in the degradation function is detected. Simulations show that our policy provides significantly better performance than that achieved by never remapping. We also observe that the average interremapping frequency is quite close to the optimal fixed remapping frequency. This phenomenon is explained, in part, analytically. We show that when performance deteriorates monotonically (in expectation), then the heuristic is well motivated. Finally, using a tractable model of load evolution, we compare our policy’s performance to provably optimal performance.
AB - A large class of computational problems are characterized by frequent synchronization, and computational requirements which change as a function of time. When such a problem is solved on a message passing multiprocessor machine, the combination of these characteristics leads to system performance which deteriorates in time. Performance can be improved with periodic redistribution of computational load; however, redistribution can exact a sometimes large delay cost. We study the issue of deciding when to invoke a global load remapping mechanism. Such a decision policy must effectively weigh the costs of remapping against the performance benefits, and should be general enough to apply automatically to a wide range of computations. We propose here a general remapping decision heuristic, then study its effectiveness and its anticipated behavior on two very different models of load evolution. Assuming only that the remapping cost is known, this policy dynamically attempts to minimize system degradation (including the cost of remapping) per computation step. This policy is quite simple, choosing to remap when the first local minimum in the degradation function is detected. Simulations show that our policy provides significantly better performance than that achieved by never remapping. We also observe that the average interremapping frequency is quite close to the optimal fixed remapping frequency. This phenomenon is explained, in part, analytically. We show that when performance deteriorates monotonically (in expectation), then the heuristic is well motivated. Finally, using a tractable model of load evolution, we compare our policy’s performance to provably optimal performance.
KW - Adaptive computation
KW - dynamic load balancing
KW - multiprocessors
KW - optimal partitioning
KW - parallel computation
KW - remapping
UR - http://www.scopus.com/inward/record.url?scp=0024072710&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0024072710&partnerID=8YFLogxK
U2 - 10.1109/12.2258
DO - 10.1109/12.2258
M3 - Article
AN - SCOPUS:0024072710
SN - 0018-9340
VL - 37
SP - 1073
EP - 1087
JO - IEEE Transactions on Computers
JF - IEEE Transactions on Computers
IS - 9
ER -