Abstract
This paper extends an earlier out-of-core Fast Fourier Transform (FFT) method for a uniprocessor with the Parallel Disk Model (PDM) to use multiple processors. Four out-of-core multiprocessor methods are examined. Operationally, these methods differ in the size of 'mini-butterfly' computed in memory and how the data are organized on the disks and in the distributed memory of the multiprocessor. The methods also perform differing amounts of I/O and communication. Two of them have the remarkable property that even though they are computing the FFT on a multiprocessor, all interprocessor communication occurs outside the mini-butterfly computations; communication that ordinarily occurs in a butterfly is folded into other data-movement operations. An analysis program shows that the two methods that use no butterfly communication usually use less communication overall than the other methods. The analysis program is fast enough that it can be invoked at run time to determine which of the four methods uses the least communication. One set of performance results on a small workstation cluster indicates that the methods without butterfly communication are approximately 9.5% faster. Moreover, they are much easier to implement.
Original language | English (US) |
---|---|
Pages | 68-77 |
Number of pages | 10 |
State | Published - 1997 |
Externally published | Yes |
Event | Proceedings of the 1997 5th Workshop on I/O in Parallel and Distributed Systems - San Jose, CA, USA Duration: Nov 17 1997 → Nov 17 1997 |
Other
Other | Proceedings of the 1997 5th Workshop on I/O in Parallel and Distributed Systems |
---|---|
City | San Jose, CA, USA |
Period | 11/17/97 → 11/17/97 |
ASJC Scopus subject areas
- Computer Science(all)