Optimizing matrix multiply using phipac: a portable, high-performance, ansi c coding methodology,” (1997)

by J Bilmes, K Asanovic, C-W Chin, J Demmel
Venue:in ICS ’97: Proceedings of the 11th international conference on Supercomputing.