ORCHESTRA: An Asynchronous Wait-Free Distributed GVT Algorithm

Tommaso Tocci, Alessandro Pellegrini, Francesco Quaglia, Josep Casanovas-García, and Toyotaro Suzumura


Published in: Proceedings of the 21st IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications
pdf Download PDF

Abstract:
Taking advantage of computing capabilities oered by modern parallel and distributed architectures is fundamental to run large-scale simulation models based on the Parallel Discrete Event Simulation (PDES) paradigm. By relying on this computing organization, it is possible to eectively overcome both the power and the memory wall, which are core limiting aspects to deliver high-performance simulations. This is even more the case when relying on the speculative Time Warp synchronization protocol, which could be particularly memory greedy. At the same time, some form of coordination, such as the computation of the Global Virtual Time (GVT), is required by Time Warp Systems. These coordination points could easily become the bottleneck of large-scale simulations, hindering an efficient exploitation of the computing power oered by large supercomputing facilities. In this dissertation is presented ORCHESTRA, a coordination algorithm which is both wait-free and asynchronous. The nature of this algorithm allows any computing node to carry on simulation activities while the global agreement is reached, thus oering an eective building block to achieve scalable PDES. The general organization of ORCHESTRA could be adopted by different high-performance computing applications, thus paving the way to a more effective usage of modern computing infrastructures.

BibTeX Entry:

@inproceedings{Toc17,
author = {Tocci, Tommaso and Pellegrini, Alessandro and Quaglia, Francesco and Casanovas-García, Josep and Suzumura, Toyotaro},
booktitle = {Proceedings of the 21st IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications},
title = {ORCHESTRA: An Asynchronous Wait-Free Distributed GVT Algorithm},
year = {2017},
month = oct,
pages = {51--58},
publisher = {IEEE Computer Society},
series = {DS-RT},
doi = {10.1109/DISTRA.2017.8167666},
location = {Rome, Italy}
}