Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2024
Globus service enhancements for exascale applications and facilities
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 658–670https://doi.org/10.1177/10943420241281744Many extreme-scale applications require the movement of large quantities of data to, from, and among leadership computing facilities, as well as other scientific facilities and the home institutions of facility users. These applications, particularly ...
- research-articleNovember 2024
Refining HPCToolkit for application performance analysis at exascale
- Laksono Adhianto,
- Jonathon Anderson,
- Robert Matthew Barnett,
- Dragana Grbic,
- Vladimir Indic,
- Mark Krentel,
- Yumeng Liu,
- Srđan Milaković,
- Wileam Phan,
- John Mellor-Crummey,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 612–632https://doi.org/10.1177/10943420241277839As part of the US Department of Energy’s Exascale Computing Project (ECP), Rice University has been refining its HPCToolkit performance tools to better support measurement and analysis of applications executing on exascale supercomputers. To efficiently ...
- research-articleNovember 2024
AMReX and pyAMReX: Looking beyond the exascale computing project
- Andrew Myers,
- Weiqun Zhang,
- Ann Almgren,
- Thierry Antoun,
- John Bell,
- Axel Huebl,
- Alexander Sinn,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 599–611https://doi.org/10.1177/10943420241271017AMReX is a software framework for the development of block-structured mesh applications with adaptive mesh refinement (AMR). AMReX was initially developed and supported by the AMReX Co-Design Center as part of the U.S. DOE Exascale Computing Project (ECP)...
- research-articleNovember 2024
Ginkgo - A math library designed to accelerate Exascale Computing Project science applications
- Terry Cojean,
- Pratik Nayak,
- Tobias Ribizel,
- Natalie Beams,
- Yu-Hsiang Mike Tsai,
- Marcel Koch,
- Fritz Göbel,
- Thomas Grützmacher,
- Hartwig Anzt,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 568–584https://doi.org/10.1177/10943420241268323Large-scale simulations require efficient computation across the entire computing hierarchy. A challenge of the Exascale Computing Project (ECP) was to reconcile highly heterogeneous hardware with the myriad of applications that were required to run on ...
- research-articleNovember 2024
Bricks: A high-performance portability layer for computations on block-structured grids
- Mahesh Lakshminarasimhan,
- Oscar Antepara,
- Tuowen Zhao,
- Benjamin Sepanski,
- Protonu Basu,
- Hans Johansen,
- Mary Hall,
- Samuel Williams,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 549–567https://doi.org/10.1177/10943420241268288From partial differential equations to the convolutional neural networks in deep learning, to matrix operations in dense linear algebra, computations on structured grids dominate high-performance computing and machine learning. The performance of such ...
-
- research-articleOctober 2024
ECP libraries and tools: An overview
- Michael A Heroux,
- Lois Curfman McInnes,
- James Ahrens,
- Todd Gamblin,
- Timothy C Germann,
- Xiaoye Sherry Li,
- Kathryn Mohror,
- Todd Munson,
- Sameer Shende,
- Rajeev Thakur,
- Jeffrey Vetter,
- James Willenbring,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 381–408https://doi.org/10.1177/10943420241271005The Exascale Computing Project (ECP) Software Technology and Co-Design teams addressed the growing complexities in high-performance computing (HPC) by developing scalable software libraries and tools that leverage exascale system capabilities. As we ...
- research-articleOctober 2024
Visualization at exascale: Making it all work with VTK-m
- Kenneth Moreland,
- Tushar M Athawale,
- Vicente Bolea,
- Mark Bolstad,
- Eric Brugger,
- Hank Childs,
- Axel Huebl,
- Li-Ta Lo,
- Berk Geveci,
- Nicole Marsaglia,
- Sujin Philip,
- David Pugmire,
- Silvio Rizzi,
- Zhe Wang,
- Abhishek Yenpure,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 508–526https://doi.org/10.1177/10943420241270969The VTK-m software library enables scientific visualization on exascale-class supercomputers. Exascale machines are particularly challenging for software development in part because they use GPU accelerators to provide the vast majority of their ...
- research-articleOctober 2024
Taking the MPI standard and the open MPI library to exascale
- David E Bernholdt,
- George Bosilca,
- Aurelien Bouteiller,
- Ron Brightwell,
- Jan Ciesko,
- Matthew GF Dosanjh,
- Giorgis Georgakoudis,
- Ignacio Laguna,
- Scott Levy,
- Thomas Naughton,
- Stephen L Olivier,
- Howard P Pritchard,
- Whit Schonbein,
- Joseph Schuchart,
- Amir Shehata,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 491–507https://doi.org/10.1177/10943420241265936The Open MPI for Exascale (OMPI-X) project was one of two in the Exascale Computing Project (ECP) focused on advancing the MPI ecosystem. The OMPI-X team worked with other MPI Forum members to champion several important features for inclusion in the MPI ...
- research-articleOctober 2024
Designing and prototyping extensions to the Message Passing Interface in MPICH
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 527–545https://doi.org/10.1177/10943420241263544As HPC system architectures and the applications running on them continue to evolve, the MPI standard itself must evolve. The trend in current and future HPC systems toward powerful nodes with multiple CPU cores and multiple GPU accelerators makes ...
- research-articleOctober 2024
Enhancing Kokkos with OpenACC
- Pedro Valero-Lara,
- Seyong Lee,
- Marc Gonzalez-Tallada,
- Joel Denny,
- Keita Teranishi,
- Jeffrey S. Vetter,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 409–426https://doi.org/10.1177/10943420241261987C++ template metaprogramming has emerged as a prominent approach for achieving performance portability in heterogeneous computing. Kokkos represents a notable paradigm in this domain, offering programmers a suite of high-level abstractions for generic ...
- research-articleOctober 2024
High-performance finite elements with MFEM
- Julian Andrej,
- Nabil Atallah,
- Jan-Phillip Bäcker,
- Jean-Sylvain Camier,
- Dylan Copeland,
- Veselin Dobrev,
- Yohann Dudouit,
- Tobias Duswald,
- Brendan Keith,
- Dohyun Kim,
- Tzanio Kolev,
- Boyan Lazarov,
- Ketan Mittal,
- Will Pazner,
- Socratis Petrides,
- Syun’ichi Shiraiwa,
- Mark Stowell,
- Vladimir Tomov,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 447–467https://doi.org/10.1177/10943420241261981The MFEM (Modular Finite Element Methods) library is a high-performance C++ library for finite element discretizations. MFEM supports numerous types of finite element methods and is the discretization engine powering many computational physics and ...
- research-articleOctober 2024
Clacc: OpenACC for C/C++ in Clang
- Joel E Denny,
- Seyong Lee,
- Pedro Valero-Lara,
- Marc Gonzalez-Tallada,
- Keita Teranishi,
- Jeffrey S Vetter,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 427–446https://doi.org/10.1177/10943420241261976The Clacc project has developed OpenACC compiler, runtime, and profiling interface support for C/C++ by extending Clang and LLVM. A key Clacc design feature is that it translates OpenACC to OpenMP to leverage the OpenMP offloading support that is ...
- research-articleOctober 2024
MAGMA: Enabling exascale performance with accelerated BLAS and LAPACK for diverse GPU architectures
- Ahmad Abdelfattah,
- Natalie Beams,
- Robert Carson,
- Pieter Ghysels,
- Tzanio Kolev,
- Thomas Stitt,
- Arturo Vargas,
- Stanimire Tomov,
- Jack Dongarra,
- Michael Heroux
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 468–490https://doi.org/10.1177/10943420241261960MAGMA (Matrix Algebra for GPU and Multicore Architectures) is a pivotal open-source library in the landscape of GPU-enabled dense and sparse linear algebra computations. With a repertoire of approximately 750 numerical routines across four precisions, ...
- research-articleJanuary 2024
General framework for re-assuring numerical reliability in parallel Krylov solvers: A case of bi-conjugate gradient stabilized methods
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 1Pages 17–33https://doi.org/10.1177/10943420231207642Parallel implementations of Krylov subspace methods often help to accelerate the procedure of finding an approximate solution of a linear system. However, such parallelization coupled with asynchronous and out-of-order execution often makes more visible ...
- research-articleJanuary 2024
Parallel multithreaded deduplication of data sequences in nuclear structure calculations
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 1Pages 5–16https://doi.org/10.1177/10943420231183697High performance computing (HPC) applications that work with redundant sequences of data can benefit from their deduplication. We study this problem on the symmetry-adapted no-core shell model (SA-NCSM), where redundant sequences of different kinds ...
- research-articleJuly 2022
An elastic framework for ensemble-based large-scale data assimilation
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 36, Issue 4Pages 543–563https://doi.org/10.1177/10943420221110507Prediction of chaotic systems relies on a floating fusion of sensor data (observations) with a numerical model to decide on a good system trajectory and to compensate non-linear feedback effects. Ensemble-based data assimilation (DA) is a major method ...
- research-articleJuly 2022
Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA
- Arturo Vargas,
- Thomas M Stitt,
- Kenneth Weiss,
- Vladimir Z Tomov,
- Jean-Sylvain Camier,
- Tzanio Kolev,
- Robert N Rieben
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 36, Issue 4Pages 492–509https://doi.org/10.1177/10943420221100262With the introduction of advanced heterogeneous computing architectures based on GPU accelerators, large-scale production codes have had to rethink their numerical algorithms and incorporate new programming models and memory management strategies in ...
- research-articleMay 2022
AI4IO: A suite of AI-based tools for IO-aware scheduling
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 36, Issue 3Pages 370–387https://doi.org/10.1177/10943420221079765Traditional workload managers do not have the capacity to consider how IO contention can increase job runtime and even cause entire resource allocations to be wasted. Whether from bursts of IO demand or parallel file systems (PFS) performance degradation,...
- research-articleMay 2022
Efficient high-precision integer multiplication on the GPU
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 36, Issue 3Pages 356–369https://doi.org/10.1177/10943420221077964The multiplication of large integers, which has many applications in computer science, is an operation that can be expressed as a polynomial multiplication followed by a carry normalization. This work develops two approaches for efficient polynomial ...