We address the problem of deriving systolic arrays in which the processor utilization is 100%. We first address this problem in the context of synthesis horn Uniform Recurrence Equations (UREs), and then generalize our result to deal with arbitrary systolic arrays (outside the context of synthesis). We show that in a systolic array, it is always possible to merge a parallelepiped of neighboring processors which are active at different clock cycles. The new array is fully efficient and its processors have almost the same cost as the original one. Such merging corresponds exactly to the transformation by a quasi-linear function. When the original array is derived by integral linear projections of systems of UREs, we give a method to mechanically determine the quasi-linear allocation function which yields the efficient array. The technique can also be extended to any (piece-wise) systolic array to derive a fully efficient array by “post-processing” it with a (piece-wise) quasi-linear function.
CITATION STYLE
Zhong, X., & Rajopadhye, S. (1991). Deriving fully efficient systolic arrays by quasi-linear allocation functions. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 505 LNCS, pp. 219–235). Springer Verlag. https://doi.org/10.1007/978-3-662-25209-3_16
Mendeley helps you to discover research relevant for your work.