We present a computationally efficient method for deriving the most appropriate transformation and mapping of a nested loop for a given hierarchical parallel machine. This method is in the context of...
Generalized Unimodular Loop Transformations for Distributed Memory Multiprocessors (1995)
In this paper, we present a generalized unimodular loop transformation as a simple, systematic and elegant method for partitioning the iteration spaces of nested loops for execution on distributed...
Loop and Data Transformations: A Tutorial (1995)
Dattatraya Kulkarni, Michael Stumm, Ms A, D Kulkarni
In this tutorial, we address the problem of restructuring a (possibly sequential) program to improve execution efficiency on parallel machines. This restructuring involves the transformation and...
We present a computationally efficient method for deriving the most appropriate transformation and mapping of a nested loop for a given hierarchical parallel machine. This method is in the context of...