Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications - Publikacja - MOST Wiedzy

Wyszukiwarka

Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications

Abstrakt

The aim of this paper is to evaluate performance of new CUDA mechanisms—unified memory and dynamic parallelism for real parallel applications compared to standard CUDA API versions. In order to gain insight into performance of these mechanisms, we decided to implement three applications with control and data flow typical of SPMD, geometric SPMD and divide-and-conquer schemes, which were then used for tests and experiments. Specifically, tested applications include verification of Goldbach’s conjecture, 2D heat transfer simulation and adaptive numerical integration. We experimented with various ways of how dynamic parallelism can be deployed into an existing implementation and be optimized further. Subsequently, we compared the best dynamic parallelism and unified memory versions to respective standard API counterparts. It was shown that usage of dynamic parallelism resulted in improvement in performance for heat simulation, better than static but worse than an iterative version for numerical integration and finally worse results for Golbach’s conjecture verification. In most cases, unified memory results in decrease in performance. On the other hand, both mechanisms can contribute to simpler and more readable codes. For dynamic parallelism, it applies to algorithms in which it can be naturally applied. Unified memory generally makes it easier for a programmer to enter the CUDA programming paradigm as it resembles the traditional memory allocation/usage pattern.

Cytowania

  • 1 5

    CrossRef

  • 0

    Web of Science

  • 1 8

    Scopus

Cytuj jako

Pełna treść

pobierz publikację
pobrano 1306 razy
Wersja publikacji
Accepted albo Published Version
Licencja
Creative Commons: CC-BY otwiera się w nowej karcie

Słowa kluczowe

Informacje szczegółowe

Kategoria:
Publikacja w czasopiśmie
Typ:
artykuł w czasopiśmie wyróżnionym w JCR
Opublikowano w:
JOURNAL OF SUPERCOMPUTING nr 72, strony 5378 - 5401,
ISSN: 0920-8542
ISSN:
0920-8542
Język:
angielski
Rok wydania:
2017
Opis bibliograficzny:
Jarząbek Ł., Czarnul P.: Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications// JOURNAL OF SUPERCOMPUTING. -Vol. 72, nr. 12 (2017), s.5378-5401
DOI:
Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.1007/s11227-017-2091-x
Weryfikacja:
Politechnika Gdańska

wyświetlono 532 razy

Publikacje, które mogą cię zainteresować

Meta Tagi