A Fail-Safe NVRAM Based Mechanism for Efficient Creation and Recovery of Data Copies in Parallel MPI Applications
Abstrakt
The paper presents a fail-safe NVRAM based mechanism for creation and recovery of data copies during parallel MPI application runtime. Specifically, we target a cluster environment in which each node has an NVRAM installed in it. Our previously developed extension to the MPI I/O API can take advantage of NVRAM regions in order to provide an NVRAM based cache like mechanism to significantly speed up I/O operations and allow to preload large files and operate efficiently on them. In this work, we show how to provide fail safe data write to such files using NVRAM and how to recover from failures. This provides an efficient alternative to costly checkpointing provided an application can store its consistent state in a file.
Cytowania
-
2
CrossRef
-
0
Web of Science
-
7
Scopus
Autorzy (4)
Cytuj jako
Pełna treść
pełna treść publikacji nie jest dostępna w portalu
Słowa kluczowe
Informacje szczegółowe
- Kategoria:
- Aktywność konferencyjna
- Typ:
- materiały konferencyjne indeksowane w Web of Science
- Tytuł wydania:
- Information Systems Architecture and Technology: Proceedings of 37th International Conference on Information Systems Architecture and Technology – ISAT 2016 – Part II strony 137 - 147
- Język:
- angielski
- Rok wydania:
- 2016
- Opis bibliograficzny:
- Malinowski A., Czarnul P., Maciejewski, M., Skowron P..: A Fail-Safe NVRAM Based Mechanism for Efficient Creation and Recovery of Data Copies in Parallel MPI Applications, W: Information Systems Architecture and Technology: Proceedings of 37th International Conference on Information Systems Architecture and Technology – ISAT 2016 – Part II, 2016, Springer International Publishing,.
- DOI:
- Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.1007/978-3-319-46586-9_11
- Weryfikacja:
- Politechnika Gdańska
wyświetlono 110 razy