Dynamic GPU power capping with online performance tracing for energy efficient GPU computing using DEPO tool - Publikacja - MOST Wiedzy

Wyszukiwarka

Dynamic GPU power capping with online performance tracing for energy efficient GPU computing using DEPO tool

Abstrakt

GPU accelerators have become essential to the recent advance in computational power of high- performance computing (HPC) systems. Current HPC systems’ reaching an approximately 20–30 mega-watt power demand has resulted in increasing CO2 emissions, energy costs and necessitate increasingly complex cooling systems. This is a very real challenge. To address this, new mechanisms of software power control could be employed. In this paper, a dynamic new method of limiting software power is introduced on one of the latest NVIDIA GPUs: a software tool called the Dynamic Energy- Performance Optimiser (DEPO). DEPO minimizes the energy consumption of the CUDA based GPU workloads, with respect to one of the three given metrics: minimum of energy (E), Energy-Delay product (EDP) and Energy-Delay sum (EDS). The tool gathers power measurements from NVIDIA Management Library (NVML). Measuring the application progress at runtime is based on CUDA Profiling Tools Interface (CUPTI) kernel-counting. We have evaluated the DEPO tool on the NVIDIA RTX A4500 and A100 GPUs with machine learning workloads. Depending on the application (training of neural networks: Resnet152, Densenet161, VGG- 19 or a GEMM benchmark) for the E target metric, we were able to obtain energy savings exceeding 22% for both NVIDIA A100 and RTX A4500 GPUs while the performance drop has never been higher than 20%. Using one of the bi-objective EDP or EDS metrics allowed finding configurations resulting in 15% or 18% of energy saved with only 8% of performance loss. For most of the experiments the percentage-wise performance penalty is lower than the energy savings. This demonstrates its potential for energy consumption reduction in HPC systems with GPU accelerators.

Cytowania

  • 9

    CrossRef

  • 0

    Web of Science

  • 6

    Scopus

Cytuj jako

Pełna treść

pełna treść publikacji nie jest dostępna w portalu

Słowa kluczowe

Informacje szczegółowe

Kategoria:
Publikacja w czasopiśmie
Typ:
artykuły w czasopismach
Opublikowano w:
Future Generation Computer Systems-The International Journal of Grid Computing-Theory Methods and Applications nr 145, strony 396 - 414,
ISSN: 0167-739X
Język:
angielski
Rok wydania:
2023
Opis bibliograficzny:
Krzywaniak A., Czarnul P., Proficz J.: Dynamic GPU power capping with online performance tracing for energy efficient GPU computing using DEPO tool// Future Generation Computer Systems-The International Journal of Grid Computing-Theory Methods and Applications -Vol. 145, (2023), s.396-414
DOI:
Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.1016/j.future.2023.03.041
Źródła finansowania:
  • Działalność statutowa/subwencja
Weryfikacja:
Politechnika Gdańska

wyświetlono 62 razy

Publikacje, które mogą cię zainteresować

Meta Tagi