MAGMA  2.5.4
Matrix Algebra for GPU and Multicore Architectures
lauum: Multiply triangular matrices; used in potri