Benchmarking overlapping communication and computations with multiple streams for modern GPUs - Publication - Bridge of Knowledge

Search

Benchmarking overlapping communication and computations with multiple streams for modern GPUs

Abstract

The paper presents benchmarking a multi-stream application processing a set of input data arrays. Tests have been performed and execution times measured for various numbers of streams and various compute intensities measured as the ratio of kernel compute time and data transfer time. As such, the application and benchmarking is representative of frequently used operations such as vector weighted sum, matrix multiplication etc. The paper shows benefits of using multiple data streams for various compute intensities compared to one stream, benchmarked for 4 GPUs: professional NVIDIA Tesla V100, Tesla K20m, desktop GTX 1060 and mobile GeForce 940MX. Additionally, relative performances are shown for various numbers of kernel computations for these GPUs.

Citations

  • 4

    CrossRef

  • 0

    Web of Science

  • 0

    Scopus

Cite as

Full text

download paper
downloaded 42 times
Publication version
Accepted or Published Version
DOI:
Digital Object Identifier (open in new tab) 10.15439/2018F17
License
Creative Commons: CC-BY open in new tab

Keywords

Details

Category:
Articles
Type:
artykuły w czasopismach
Published in:
Annals of Computer Science and Information Systems no. 17, pages 105 - 110,
ISSN: 2300-5963
ISSN:
2300-5963
Language:
English
Publication year:
2018
Bibliographic description:
Czarnul P.: Benchmarking overlapping communication and computations with multiple streams for modern GPUs// Annals of Computer Science and Information Systems -Vol. 17, (2018), s.105-110
DOI:
Digital Object Identifier (open in new tab) 10.15439/2018f17
Verified by:
Gdańsk University of Technology

seen 165 times

Recommended for you

Meta Tags