Adaptive distributed replica-exchange simulations
- Owing to the loose coupling between replicas, the replica-exchange (RE) class of algorithms should be able to benefit greatly from using as many resources as available. However, the ability to effectively use multiple distributed resources to reduce the time to completion remains a challenge at many levels. Additionally, an implementation of a pleasingly distributed algorithm such as replica-exchange, which is independent of infrastructural details, does not exist. This paper proposes an extensible and scalable framework based on Simple API for Grid Applications that provides a general-purpose, opportunistic mechanism to effectively use multiple resources in an infrastructure-independent way. By analysing the requirements of the RE algorithm and the challenges of implementing it on real production systems, we propose a new abstraction (BIGJOB), which forms the basis of the adaptive redistribution and effective scheduling of replicas.
Verfasserangaben: | Andre Luckow, Shantenu Jha, Joohyun Kim, Andre Merzky, Bettina SchnorORCiDGND |
---|---|
URL: | http://rsta.royalsocietypublishing.org/ |
DOI: | https://doi.org/10.1098/rsta.2009.0051 |
ISSN: | 1364-503X |
Publikationstyp: | Wissenschaftlicher Artikel |
Sprache: | Englisch |
Jahr der Erstveröffentlichung: | 2009 |
Erscheinungsjahr: | 2009 |
Datum der Freischaltung: | 25.03.2017 |
Quelle: | Philosophical transactions of the Royal Society : A. - ISSN 1364-503X. - 367 (2009), 1897, S. 2595 - 2606 |
Organisationseinheiten: | Mathematisch-Naturwissenschaftliche Fakultät / Institut für Informatik und Computational Science |
Peer Review: | Referiert |
Name der Einrichtung zum Zeitpunkt der Publikation: | Mathematisch-Naturwissenschaftliche Fakultät / Institut für Informatik |