Cited By
View all- Abdelfattah ACosta TDongarra JGates MHaidar AHammarling SHigham NKurzak JLuszczek PTomov SZounon M(2021)A Set of Batched Basic Linear Algebra Subprograms and LAPACK RoutinesACM Transactions on Mathematical Software10.1145/343192147:3(1-23)Online publication date: 26-Jun-2021
- Charara AKeyes DLtaief H(2019)Batched Triangular Dense Linear Algebra Kernels for Very Small Matrix Sizes on GPUsACM Transactions on Mathematical Software10.1145/326710145:2(1-28)Online publication date: 3-May-2019
- Haidar AAbdelfattah AZounon MTomov SDongarra J(2018)A Guide for Achieving High Performance with Very Small Matrices on GPU: A Case Study of Batched LU and Cholesky FactorizationsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2017.278392929:5(973-984)Online publication date: 1-May-2018
- Show More Cited By