Cited By
View all- Thomadakis PChrisochoides N(2024)Runtime support for CPU-GPU high-performance computing on distributed memory platformsFrontiers in High Performance Computing10.3389/fhpcp.2024.14170402Online publication date: 19-Jul-2024
- Khalilov MDi Girolamo SChrapek MNudelman RBloch GHoefler T(2024)Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AIProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00109(1-17)Online publication date: 17-Nov-2024
- Feldmann AGolden CYang YEmer JSanchez D(2024)Azul: An Accelerator for Sparse Iterative Solvers Leveraging Distributed On-Chip Memory2024 57th IEEE/ACM International Symposium on Microarchitecture (MICRO)10.1109/MICRO61859.2024.00054(643-656)Online publication date: 2-Nov-2024
- Show More Cited By