Paul J. Steinhardt, P. Chaudhari
Journal of Computational Physics
Techniques and algorithms for efficient in-place conversion to and from standard and blocked matrix storage formats are described. Such functionality is required by numerical libraries that use different data layouts internally. Parallel algorithms and a software package for in-place matrix storage format conversion based on in-place matrix transposition are presented and evaluated. A new algorithm for in-place transposition which efficiently determines the structure of the transposition permutation a priori is one of the key ingredients. It enables effective load balancing in a parallel environment. © 2012 ACM.
Paul J. Steinhardt, P. Chaudhari
Journal of Computational Physics
Da-Ke He, Ashish Jagmohan, et al.
ISIT 2007
Ziv Bar-Yossef, T.S. Jayram, et al.
Journal of Computer and System Sciences
Jaione Tirapu Azpiroz, Alan E. Rosenbluth, et al.
SPIE Photomask Technology + EUV Lithography 2009