Synthesizing efficient out-of-core programs for block recursive algorithms using block-cyclic data distributions
In this paper, we present a framework for synthesizing I/O efficient out-of-core programs for block recursive algorithms, such as the fast Fourier transform (FFT) and block matrix transposition algorithms. Our framework uses an algebraic representation which is based on tensor products and other matrix operations. The programs are optimized for the striped Vitter and Shriver's two-level memory model in which data can be distributed using various cyclic(B) distributions in contrast to the normally used physical track distribution cyclic(B
Duke Scholars
Published In
DOI
ISSN
Publication Date
Volume
Issue
Start / End Page
Related Subject Headings
- Distributed Computing
- 4606 Distributed computing and systems software
- 1005 Communications Technologies
- 0805 Distributed Computing
- 0803 Computer Software
Citation
Published In
DOI
ISSN
Publication Date
Volume
Issue
Start / End Page
Related Subject Headings
- Distributed Computing
- 4606 Distributed computing and systems software
- 1005 Communications Technologies
- 0805 Distributed Computing
- 0803 Computer Software