skip to main content
10.1145/2601381.2601386acmconferencesArticle/Chapter ViewAbstractPublication PagespadsConference Proceedingsconference-collections
research-article

Synchronisation for dynamic load balancing of decentralised conservative distributed simulation

Published: 18 May 2014 Publication History

Abstract

Synchronisation mechanisms are essential in distributed simulation. Some systems rely on central units to control the simulation but central units are known to be bottlenecks. If we want to avoid using a central unit to optimise the simulation speed, we lose the capacity to act on the simulation at a global scale. Being able to act on the entire simulation is an important feature which allows to dynamically load-balance a distributed simulation. While some local partitioning algorithms exist, their lack of global view reduces their efficiency. Running a global partitioning algorithm without central unit requires a synchronisation of all logical processes (LPs) at the same step. The first algorithm requires the knowledge of some topological properties of the network while the second algorithm works without any requirement. The algorithms are detailed and compared against each other. An evaluation shows the benefits of using a global dynamic load-balancing for distributed simulations.

References

[1]
K. Arvind. Probabilistic clock synchronization in distributed systems. Parallel and Distributed Systems, IEEE Transactions on, 5(5): 474--487, May 1994.
[2]
J. W. Barrus, R. C. Waters, and D. B. Anderson. Locales and Beacons: Efficient and Precise Support For Large Multi-User Virtual Environments. In VRAIS, pages 204--213, 1996.
[3]
D. W. Bauer Jr., C. D. Carothers, and A. Holder. Scalable time warp on blue gene supercomputers. In PADS, pages 35--44, 2009.
[4]
B. Bollobás. Modern Graph Theory. Springer, Heidelberg, corrected edition, 1998.
[5]
Q. Bragard, A. Ventresque, and L. Murphy. dsumo: Towards a distributed sumo. In First SUMO conference, Berlin, Germany, 2013.
[6]
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst., 30(1-7):107--117, Apr. 1998.
[7]
R. Chen, X. Weng, B. He, and M. Yang. Large graph processing in the cloud. In SIGMOD, pages 1123--1126, 2010.
[8]
I. D. Couzin, J. Krause, N. R. Franks, and S. A. Levin. Effective leadership and decision-making in animal groups on the move. Nature, 433(7025):513--516, 2005.
[9]
K. D. Devine, E. G. Boman, R. T. Heaphy, B. A. Hendrickson, J. D. Teresco, J. Faik, J. E. Flaherty, and L. G. Gervasio. New challenges in dynamic load balancing. Applied Numerical Mathematics, 52(2{3):133--152, 2005.
[10]
A. Ferscha and S. K. Tripathi. Parallel and distributed simulation of discrete event systems. 1998.
[11]
R. M. Fujimoto. Parallel and distributed simulation. In WSC, pages 122--131, 1999.
[12]
B. Hendrickson and K. Devine. Dynamic load balancing in computational mechanics. Computer Methods in Applied Mechanics and Engineering, 184(2{4): 485--500, 2000.
[13]
S. Iqbal and G. F. Carey. Performance analysis of dynamic load balancing algorithms with variable number of processors. Journal of Parallel and Distributed Computing, 65(8):934--948, 2005.
[14]
D. B. Johnson. Efficient algorithms for shortest paths in sparse networks. J. ACM, 24(1):1--13, Jan. 1977.
[15]
N. T. Karonis, B. Toonen, and I. Foster. Mpich-g2: A grid-enabled implementation of the message passing interface. Journal of Parallel and Distributed Computing, 63(5):551--563, 2003.
[16]
G. Karypis and V. Kumar. Metis - unstructured graph partitioning and sparse matrix ordering system, version 2.0. Technical report, 1995.
[17]
G. Karypis and V. Kumar. Multilevel k-way partitioning scheme for irregular graphs. Journal of Parallel and Distributed Computing, 48(1):96--129, 1998.
[18]
B. Kirk, J. Peterson, R. Stogner, and G. Carey. libmesh: a c++ library for parallel adaptive mesh refinement/coarsening simulations. Engineering with Computers, 22(3-4):237--254, 2006.
[19]
S. Lin, X. Cheng, and J. Lv. Micro-synchronization in conservative parallel network simulation. In PADS, pages 195--202, 2008.
[20]
J. Liu and R. Rong. Hierarchical composite synchronization. In PADS, pages 3--12, July 2012.
[21]
Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, C. Guestrin, and J. M. Hellerstein. Graphlab: A new parallel framework for machine learning. In Conference on Uncertainty in Artificial Intelligence (UAI), 2010.
[22]
E. Lusk and K. Yelick. Languages for high-productivity computing: the darpa hpcs language project. Parallel Processing Letters, 17(01), 2007.
[23]
M. R. Macedonia, M. J. Zyda, D. R. Pratt, D. P. Brutzman, and P. T. Barham. Exploiting Reality with Multicast Groups: A Network Architecture for Large-scale Virtual Environments. In IEEE Virtual Reality Annual International Symposium, 1995.
[24]
G. Malewicz, M. H. Austern, A. J. Bik, J. C. Dehnert, I. Horn, N. Leiser, and G. Czajkowski. Pregel: A system for large-scale graph processing. In SIGMOD, pages 135--146, 2010.
[25]
D. L. Mills. Internet time synchronization: the network time protocol. Communications, IEEE Transactions on, 39(10): 1482--1493, 1991.
[26]
A. Montresor and M. Jelasity. Peersim: A scalable p2p simulator. In Peer-to-Peer Computing, 2009. P2P '09. IEEE Ninth International Conference on, pages 99--100, 2009.
[27]
L. Nyland, J. Prins, R. Yun, J. Hermans, H.-C. Kum, and L. Wang. Modeling dynamic load balancing in molecular dynamics to achieve scalable parallel execution. In Solving Irregularly Structured Problems in Parallel, volume 1457, pages 356--365. 1998.
[28]
K. Perumalla. Parallel and distributed simulation: Traditional techniques and recent advances. In WSC, pages 84--95, 2006.
[29]
S. Plimpton, SteveAttaway, S. Attaway, B. Hendrickson, J. Swegle, C. Vaughan, and D. Gardner. Parallel transient dynamics simulations: Algorithms for contact detection and smoothed particle hydrodynamics. J. Par. Distrib. Computing, 50:50--1, 1998.
[30]
P. Ramanathan, K. Shin, and R. Butler. Fault-tolerant clock synchronization in distributed systems. Computer, 23(10):33--42, Oct 1990.
[31]
M. Snir, S. W. Otto, D. W. Walker, J. Dongarra, and S. Huss-Lederman. MPI: The Complete Reference. MIT Press, Cambridge, MA, USA, 1995.
[32]
A. Steed and R. Abou-Haidar. Partitioning Crowded Virtual Environments. In VRST, pages 7--14, 2003.
[33]
D. J. Van Hook, S. J. Rak, and J. O. Calvin. Approaches to Relevance Filtering. In Workshop on Standards for the Interoperability of Distributed Simulations, pages 26--30, 1994.
[34]
A. Ventresque, Q. Bragard, E. S. Liu, D. Nowak, L. Murphy, G. Theodoropoulos, and J. Q. Liu. Spartsim: A space partitioning guided by road network for distributed traffic simulations. In IEEE/ACM DS-RT, 2012.
[35]
X. Wang, S. Turner, M. Low, and B.-P. Gan. Optimistic synchronization in hla based distributed simulation. In PADS, pages 123--130, May 2004.
[36]
Y. Xu and G. Tan. An offline road network partitioning solution in distributed transportation simulation. In IEEE/ACM DR-ST, pages 210--217, Oct 2012.
[37]
T. Zou, G. Wang, M. V. Salles, D. Bindel, A. Demers, J. Gehrke, and W. White. Making time-stepped applications tick in the cloud. In 2Nd ACM Symposium on Cloud Computing, pages 20:1--20:14, 2011.

Cited By

View all

Index Terms

  1. Synchronisation for dynamic load balancing of decentralised conservative distributed simulation

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGSIM PADS '14: Proceedings of the 2nd ACM SIGSIM Conference on Principles of Advanced Discrete Simulation
      May 2014
      222 pages
      ISBN:9781450327947
      DOI:10.1145/2601381
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 18 May 2014

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. distributed simulation
      2. dynamic load-balancing
      3. synchronisation

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      SIGSIM-PADS '14
      Sponsor:

      Acceptance Rates

      SIGSIM PADS '14 Paper Acceptance Rate 19 of 33 submissions, 58%;
      Overall Acceptance Rate 398 of 779 submissions, 51%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 06 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media