skip to main content
10.1145/3615318.3615323acmotherconferencesArticle/Chapter ViewAbstractPublication PageseurompiConference Proceedingsconference-collections
research-article
Open access

Library Development with MPI: Attributes, Request Objects, Group Communicator Creation, Local Reductions, and Datatypes

Published: 21 September 2023 Publication History

Abstract

A major design objective of MPI is to enable support for the construction of safe parallel libraries that can be used and mixed freely in complex applications. In this respect, MPI has been extremely successful; but may nevertheless lack elementary supporting functionality for some situations, and may have made design choices that are difficult to accommodate in certain libraries. We discuss several cases of library construction requiring different kinds of supporting MPI functionality, and propose concrete improvements for library implementations and future MPI versions to alleviate the problems that were encountered. Specifically, we pinpoint (performance) issues with MPI object attributes, caching and lookup, request objects, partly collective and non-blocking communicator creation, process local reductions, type correct process local copying, and user-defined datatypes.

References

[1]
Enes Bajrović and Jesper Larsson Träff. 2011. Using MPI derived datatypes in numerical libraries. In Recent Advances in Message Passing Interface. 18th European MPI Users’ Group Meeting(Lecture Notes in Computer Science, Vol. 6960). Springer, 29–38.
[2]
Pavan Balaji, Darius Buntinas, Satish Balay, Barry F. Smith, Rajeev Thakur, and William Gropp. 2007. Nonuniformly Communicating Noncontiguous Data: A Case Study with PETSc and MPI. In 21th International Parallel and Distributed Processing Symposium (IPDPS). 1–10.
[3]
Pavan Balaji, Anthony Chan, William Gropp, Rajeev Thakur, and Ewing L. Lusk. 2010. The Importance of Non-Data-Communication Overheads in MPI. International Journal on High Performance Computing Applications 24, 1 (2010), 5–15.
[4]
Alexandra Carpen-Amarie, Sascha Hunold, and Jesper Larsson Träff. 2017. On Expected and Observed Communication Performance with MPI Derived Datatypes. Parallel Computing 69 (2017), 98–117.
[5]
Jaeyoung Choi, Jack Dongarra, Susan Ostrouchov, Antoine Petitet, David W. Walker, and R. Clinton Whaley. 1996. Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines. Scientific Programming 5, 3 (1996), 173–184.
[6]
James Dinan, David Goodell, William Gropp, Rajeev Thakur, and Pavan Balaji. [n. d.]. Efficient Multithreaded Context ID Allocation in MPI. In Recent Advances in the Message Passing Interface. 19th European MPI Users’ Group Meeting (EuroMPI)(Lecture Notes in Computer Science, Vol. 7490). 57–66.
[7]
Markus Geimer, Marc-André Hermanns, Christian Siebert, Felix Wolf, and Brian J. N. Wylie. 2011. Scaling Performance Tool MPI Communicator Management. In Recent Advances in the Message Passing Interface. 18th European MPI Users’ Group Meeting (EuroMPI)(Lecture Notes in Computer Science, Vol. 6960). 178–187.
[8]
William D. Gropp, Torsten Hoefler, Rajeev Thakur, and Jesper Larsson Träff. 2011. Performance expectations and guidelines for MPI derived datatypes: a first analysis. In Recent Advances in Message Passing Interface. 18th European MPI Users’ Group Meeting(Lecture Notes in Computer Science, Vol. 6960). Springer, 150–159.
[9]
Torsten Hoefler, Prabhanjan Kambadur, Richard L. Graham, Galen M. Shipman, and Andrew Lumsdaine. 2007. A Case for Standard Non-blocking Collective Operations. In Recent Advances in Parallel Virtual Machine and Message Passing Interface. 14th European PVM/MPI User’s Group Meeting(Lecture Notes in Computer Science, Vol. 4757). Springer, 125–134.
[10]
Torsten Hoefler, Christian Siebert, and Andrew Lumsdaine. 2009. Group Operation Assembly Language - A Flexible Way to Express Collective Communication. In International Conference on Parallel Processing (ICPP). 574–581.
[11]
Torsten Hoefler and Marc Snir. 2011. Writing Parallel Libraries with MPI – Common Practice, Issues, and Extensions. In Recent Advances in the Message Passing Interface. 18th European MPI Users’ Group Meeting (EuroMPI). 345–355.
[12]
Sascha Hunold, Alexandra Carpen-Amarie, Felix Donatus Lübbe, and Jesper Larsson Träff. 2016. Automatic Verification of Self-Consistent MPI Performance Guidelines. In Euro-Par Parallel Processing(Lecture Notes in Computer Science, Vol. 9833). 433–446.
[13]
Faisal Ghias Mir and Jesper Larsson Träff. 2008. Constructing MPI input-output Datatypes for efficient Transpacking. In Recent Advances in Parallel Virtual Machine and Message Passing Interface. 15th European PVM/MPI Users’ Group Meeting(Lecture Notes in Computer Science, Vol. 5205). Springer, 141–150.
[14]
Faisal Ghias Mir and Jesper Larsson Träff. 2009. Exploiting efficient Transpacking for One-sided Communication and MPI-IO. In Recent Advances in Parallel Virtual Machine and Message Passing Interface. 16th European PVM/MPI Users’ Group Meeting(Lecture Notes in Computer Science, Vol. 5759). Springer, 154–163.
[15]
MPI Forum. 2021. MPI: A Message-Passing Interface Standard. Version 4.0. www.mpi-forum.org.
[16]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32. 8024–8035.
[17]
Jack Poulson, Bryan Marker, Robert van de Geijn, Jeff R. Hammond, and Nichols A. Romero. 2013. Elemental: A New Framework for Distributed Memory Dense Matrix Computations. ACM Trans. Math. Software 39, 2 (2013).
[18]
Rajeev Thakur, William Gropp, and Ewing Lusk. 1999. On Implementing MPI-IO Portably and with High Performance. In 6th Workshop on I/O in Parallel and Distributed Systems (IOPADS). 23–32.
[19]
Jesper Larsson Träff. 2016. A Library for Advanced Datatype Programming. In 23rd European MPI Users’ Group Meeting (EuroMPI). ACM, 98–107.
[20]
Jesper Larsson Träff. 2020. Signature datatypes for type correct collective operations, revisited. In 27th European MPI Users’ Group Meeting (EuroMPI/USA). ACM, 1:1–1:8.
[21]
Jesper Larsson Träff, William D. Gropp, and Rajeev Thakur. 2010. Self-consistent MPI Performance Guidelines. IEEE Transactions on Parallel and Distributed Systems 21, 5 (2010), 698–709.
[22]
Jesper Larsson Träff and Sascha Hunold. 2020. Decomposing MPI Collectives for Exploiting Multi-lane Communication. In IEEE International Conference on Cluster Computing (CLUSTER). IEEE Computer Society, 270–280.
[23]
Jesper Larsson Träff, Sascha Hunold, Nikolaus Manes Funk, and Ioannis Vardas. 2023. Uniform Algorithms for Reduce-scatter and (most) other Collectives for MPI. In IEEE International Conference on Cluster Computing (CLUSTER). IEEE Computer Society.
[24]
Jesper Larsson Träff, Sascha Hunold, Guillaume Mercier, and Daniel J. Holmes. 2021. MPI collective communication through a single set of interfaces: A case for orthogonality. Parallel Computing 107 (2021).
[25]
Ioannis Vardas, Sascha Hunold, Jordy I. Ajanohoun, and Jesper Larsson Träff. 2022. mpisee: MPI Profiling for Communication and Communicator Structure. In 27th International Workshop on High-level Parallel Programming Models and Supportive Environments (HIPS), 36th International Parallel and Distributed Processing Symposium (IPDPS). IEEE Computer Society, 520–529.

Cited By

View all

Index Terms

  1. Library Development with MPI: Attributes, Request Objects, Group Communicator Creation, Local Reductions, and Datatypes

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      EuroMPI '23: Proceedings of the 30th European MPI Users' Group Meeting
      September 2023
      123 pages
      ISBN:9798400709135
      DOI:10.1145/3615318
      This work is licensed under a Creative Commons Attribution International 4.0 License.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 21 September 2023

      Check for updates

      Author Tags

      1. MPI
      2. attributes
      3. collective operations
      4. derived datatypes
      5. library building
      6. request objects

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Funding Sources

      Conference

      EUROMPI '23
      EUROMPI '23: 30th European MPI Users' Group Meeting
      September 11 - 13, 2023
      Bristol, United Kingdom

      Acceptance Rates

      Overall Acceptance Rate 66 of 139 submissions, 47%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)147
      • Downloads (Last 6 weeks)12
      Reflects downloads up to 04 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media