skip to main content
10.1145/3432261.3439863acmotherconferencesArticle/Chapter ViewAbstractPublication PageshpcasiaConference Proceedingsconference-collections
poster

GPU Optimizations for Atmospheric Chemical Kinetics

Published: 20 January 2021 Publication History

Abstract

We present a series of optimizations to alleviate stack memory overflow issues and improve overall performance of GPU computational kernels in atmospheric chemical kinetics model simulations. We use heap memory in numerical solvers for stiff ODEs, move chemical reaction constants and tracer concentration arrays from stack to global memory, use direct pointer indexing for array memory access, and use CUDA streams to overlap computation with memory transfer to the device. Overall, an order of magnitude reduction in GPU memory requirements is achieved, allowing for simultaneous offloading from multiple MPI processes per node and/or increasing the chemical mechanism complexity.

References

[1]
Michail Alvanos and Theodoros Christoudias. 2017. GPU-accelerated atmospheric chemical kinetics in the ECHAM/MESSy (EMAC) Earth system model (version 2.52). Geoscientific Model Development 10, 10 (2017), 3679.
[2]
Michail Alvanos and Theodoros Christoudias. 2019. Accelerating Atmospheric Chemical Kinetics for Climate Simulations. IEEE Transactions on Parallel and Distributed Systems 30, 11 (2019), 2396–2407.
[3]
Theodoros Christoudias and Michail Alvanos. 2016. Accelerated chemical kinetics in the EMAC chemistry-climate model. In 2016 International Conference on High Performance Computing & Simulation (HPCS). IEEE, 886–889.
[4]
Patrick Jöckel, Astrid Kerkweg, Andrea Pozzer, Rolf Sander, Holger Tost, Hella Riede, Andreas Baumgaertner, Sergey Gromov, and Bastian Kern. 2010. Development cycle 2 of the modular earth submodel system (MESSy2). Geoscientific Model Development 3 (2010), 717–752.
[5]
Jülich Supercomputing Centre. 2018. JURECA: Modular supercomputer at Jülich Supercomputing Centre. Journal of large-scale research facilities 4, A132 (2018). https://doi.org/10.17815/jlsrf-4-121-1
[6]
E Roeckner, R Brokopf, M Esch, MA Giorgetta, S Hagemann, L Kornblueh, E Manzini, U Schlese, and U Schulzweida. 2006. Sensitivity of simulated climate to horizontal and vertical resolution in the ECHAM5 atmosphere model. Journal of Climate 19, 16 (2006), 3771–3791.
[7]
R. Sander, A. Baumgaertner, D. Cabrera-Perez, F. Frank, S. Gromov, J.-U. Grooß, H. Harder, V. Huijnen, P. Jöckel, V. A. Karydis, K. E. Niemeyer, A. Pozzer, H. Riede, M. G. Schultz, D. Taraborrelli, and S. Tauer. 2019. The community atmospheric chemistry box model CAABA/MECCA-4.0. Geoscientific Model Development 12, 4 (2019), 1365–1385. https://doi.org/10.5194/gmd-12-1365-2019
[8]
Ben van Werkhoven, Jason Maassen, Frank J Seinstra, and Henri E Bal. 2014. Performance models for CPU-GPU data transfers. In 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. IEEE, 11–20.

Cited By

View all

Index Terms

  1. GPU Optimizations for Atmospheric Chemical Kinetics
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        HPCAsia '21: The International Conference on High Performance Computing in Asia-Pacific Region
        January 2021
        143 pages
        ISBN:9781450388429
        DOI:10.1145/3432261
        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 20 January 2021

        Check for updates

        Author Tags

        1. CUDA
        2. Distributed computing
        3. GPU
        4. Memory test
        5. Parallel systems

        Qualifiers

        • Poster
        • Research
        • Refereed limited

        Conference

        HPC Asia 2021

        Acceptance Rates

        Overall Acceptance Rate 69 of 143 submissions, 48%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)4
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 17 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media