Software and its engineering

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,802,449 records)|Limit your search to The ACM Full-Text Collection (772,004 records)

Showing 1 - 20of487 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
November 2024
Globus service enhancements for exascale applications and facilities
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 658–670https://doi.org/10.1177/10943420241281744

Many extreme-scale applications require the movement of large quantities of data to, from, and among leadership computing facilities, as well as other scientific facilities and the home institutions of facility users. These applications, particularly ...
0
Metrics
Total Citations0
research-article
Open Access
November 2024
Refining HPCToolkit for application performance analysis at exascale
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 612–632https://doi.org/10.1177/10943420241277839

As part of the US Department of Energy’s Exascale Computing Project (ECP), Rice University has been refining its HPCToolkit performance tools to better support measurement and analysis of applications executing on exascale supercomputers. To efficiently ...
0
Metrics
Total Citations0
research-article
November 2024
AMReX and pyAMReX: Looking beyond the exascale computing project
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 599–611https://doi.org/10.1177/10943420241271017

AMReX is a software framework for the development of block-structured mesh applications with adaptive mesh refinement (AMR). AMReX was initially developed and supported by the AMReX Co-Design Center as part of the U.S. DOE Exascale Computing Project (ECP)...
0
Metrics
Total Citations0
research-article
Open Access
November 2024
Ginkgo - A math library designed to accelerate Exascale Computing Project science applications
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 568–584https://doi.org/10.1177/10943420241268323

Large-scale simulations require efficient computation across the entire computing hierarchy. A challenge of the Exascale Computing Project (ECP) was to reconcile highly heterogeneous hardware with the myriad of applications that were required to run on ...
0
Metrics
Total Citations0
research-article
November 2024
Bricks: A high-performance portability layer for computations on block-structured grids
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 6Pages 549–567https://doi.org/10.1177/10943420241268288

From partial differential equations to the convolutional neural networks in deep learning, to matrix operations in dense linear algebra, computations on structured grids dominate high-performance computing and machine learning. The performance of such ...
0
Metrics
Total Citations0
research-article
October 2024
ECP libraries and tools: An overview
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 381–408https://doi.org/10.1177/10943420241271005

The Exascale Computing Project (ECP) Software Technology and Co-Design teams addressed the growing complexities in high-performance computing (HPC) by developing scalable software libraries and tools that leverage exascale system capabilities. As we ...
0
Metrics
Total Citations0
research-article
October 2024
Visualization at exascale: Making it all work with VTK-m
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 508–526https://doi.org/10.1177/10943420241270969

The VTK-m software library enables scientific visualization on exascale-class supercomputers. Exascale machines are particularly challenging for software development in part because they use GPU accelerators to provide the vast majority of their ...
0
Metrics
Total Citations0
research-article
October 2024
Taking the MPI standard and the open MPI library to exascale
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 491–507https://doi.org/10.1177/10943420241265936

The Open MPI for Exascale (OMPI-X) project was one of two in the Exascale Computing Project (ECP) focused on advancing the MPI ecosystem. The OMPI-X team worked with other MPI Forum members to champion several important features for inclusion in the MPI ...
0
Metrics
Total Citations0
research-article
October 2024
Designing and prototyping extensions to the Message Passing Interface in MPICH
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 527–545https://doi.org/10.1177/10943420241263544

As HPC system architectures and the applications running on them continue to evolve, the MPI standard itself must evolve. The trend in current and future HPC systems toward powerful nodes with multiple CPU cores and multiple GPU accelerators makes ...
0
Metrics
Total Citations0
research-article
October 2024
Enhancing Kokkos with OpenACC
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 409–426https://doi.org/10.1177/10943420241261987

C++ template metaprogramming has emerged as a prominent approach for achieving performance portability in heterogeneous computing. Kokkos represents a notable paradigm in this domain, offering programmers a suite of high-level abstractions for generic ...
0
Metrics
Total Citations0
research-article
October 2024
High-performance finite elements with MFEM
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 447–467https://doi.org/10.1177/10943420241261981

The MFEM (Modular Finite Element Methods) library is a high-performance C++ library for finite element discretizations. MFEM supports numerous types of finite element methods and is the discretization engine powering many computational physics and ...
0
Metrics
Total Citations0
research-article
October 2024
Clacc: OpenACC for C/C++ in Clang
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 427–446https://doi.org/10.1177/10943420241261976

The Clacc project has developed OpenACC compiler, runtime, and profiling interface support for C/C++ by extending Clang and LLVM. A key Clacc design feature is that it translates OpenACC to OpenMP to leverage the OpenMP offloading support that is ...
0
Metrics
Total Citations0
research-article
October 2024
MAGMA: Enabling exascale performance with accelerated BLAS and LAPACK for diverse GPU architectures
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 5Pages 468–490https://doi.org/10.1177/10943420241261960

MAGMA (Matrix Algebra for GPU and Multicore Architectures) is a pivotal open-source library in the landscape of GPU-enabled dense and sparse linear algebra computations. With a repertoire of approximately 750 numerical routines across four precisions, ...
0
Metrics
Total Citations0
editorial
January 2024
Guest Editor’s note: Special issue on challenges and solutions for porting applications to next-generation high performance computing systems
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 1Pages 3–4https://doi.org/10.1177/10943420231224509
0
Metrics
Total Citations0
research-article
Open Access
January 2024
General framework for re-assuring numerical reliability in parallel Krylov solvers: A case of bi-conjugate gradient stabilized methods
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 1Pages 17–33https://doi.org/10.1177/10943420231207642

Parallel implementations of Krylov subspace methods often help to accelerate the procedure of finding an approximate solution of a linear system. However, such parallelization coupled with asynchronous and out-of-order execution often makes more visible ...
1
Metrics
Total Citations1
research-article
January 2024
Parallel multithreaded deduplication of data sequences in nuclear structure calculations
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 38, Issue 1Pages 5–16https://doi.org/10.1177/10943420231183697

High performance computing (HPC) applications that work with redundant sequences of data can benefit from their deduplication. We study this problem on the symmetry-adapted no-core shell model (SA-NCSM), where redundant sequences of different kinds ...
0
Metrics
Total Citations0
research-article
July 2022
An elastic framework for ensemble-based large-scale data assimilation
- Sebastian Friedemann,
- Bruno Raffin
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 36, Issue 4Pages 543–563https://doi.org/10.1177/10943420221110507

Prediction of chaotic systems relies on a floating fusion of sensor data (observations) with a numerical model to decide on a good system trajectory and to compensate non-linear feedback effects. Ensemble-based data assimilation (DA) is a major method ...
0
Metrics
Total Citations0
research-article
July 2022
Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 36, Issue 4Pages 492–509https://doi.org/10.1177/10943420221100262

With the introduction of advanced heterogeneous computing architectures based on GPU accelerators, large-scale production codes have had to rethink their numerical algorithms and incorporate new programming models and memory management strategies in ...
1
Metrics
Total Citations1
research-article
May 2022
AI4IO: A suite of AI-based tools for IO-aware scheduling
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 36, Issue 3Pages 370–387https://doi.org/10.1177/10943420221079765

Traditional workload managers do not have the capacity to consider how IO contention can increase job runtime and even cause entire resource allocations to be wasted. Whether from bursts of IO demand or parallel file systems (PFS) performance degradation,...
1
Metrics
Total Citations1
research-article
May 2022
Efficient high-precision integer multiplication on the GPU
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 36, Issue 3Pages 356–369https://doi.org/10.1177/10943420221077964

The multiplication of large integers, which has many applications in computer science, is an operation that can be expressed as a polynomial multiplication followed by a carry normalization. This work develops two approaches for efficient polynomial ...
0
Metrics
Total Citations0

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

All Publications

Content Type

Publisher

Publication Date

Globus service enhancements for exascale applications and facilities

Refining HPCToolkit for application performance analysis at exascale

AMReX and pyAMReX: Looking beyond the exascale computing project

Ginkgo - A math library designed to accelerate Exascale Computing Project science applications

Bricks: A high-performance portability layer for computations on block-structured grids

ECP libraries and tools: An overview

Visualization at exascale: Making it all work with VTK-m

Taking the MPI standard and the open MPI library to exascale

Designing and prototyping extensions to the Message Passing Interface in MPICH

Enhancing Kokkos with OpenACC

High-performance finite elements with MFEM

Clacc: OpenACC for C/C++ in Clang

MAGMA: Enabling exascale performance with accelerated BLAS and LAPACK for diverse GPU architectures

Guest Editor’s note: Special issue on challenges and solutions for porting applications to next-generation high performance computing systems

General framework for re-assuring numerical reliability in parallel Krylov solvers: A case of bi-conjugate gradient stabilized methods

Parallel multithreaded deduplication of data sequences in nuclear structure calculations

An elastic framework for ensemble-based large-scale data assimilation

Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA

AI4IO: A suite of AI-based tools for IO-aware scheduling

Efficient high-precision integer multiplication on the GPU