Abstract
The paper presents benchmarking a multi-stream application processing a set of input data arrays. Tests have been performed and execution times measured for various numbers of streams and various compute intensities measured as the ratio of kernel compute time and data transfer time. As such, the application and benchmarking is representative of frequently used operations such as vector weighted sum, matrix multiplication etc. The paper shows benefits of using multiple data streams for various compute intensities compared to one stream, benchmarked for 4 GPUs: professional NVIDIA Tesla V100, Tesla K20m, desktop GTX 1060 and mobile GeForce 940MX. Additionally, relative performances are shown for various numbers of kernel computations for these GPUs.
Citations
-
4
CrossRef
-
0
Web of Science
-
0
Scopus
Author (1)
Cite as
Full text
- Publication version
- Accepted or Published Version
- DOI:
- Digital Object Identifier (open in new tab) 10.15439/2018F17
- License
- open in new tab
Keywords
Details
- Category:
- Articles
- Type:
- artykuły w czasopismach
- Published in:
-
Annals of Computer Science and Information Systems
no. 17,
pages 105 - 110,
ISSN: 2300-5963 - ISSN:
- 2300-5963
- Language:
- English
- Publication year:
- 2018
- Bibliographic description:
- Czarnul P.: Benchmarking overlapping communication and computations with multiple streams for modern GPUs// Annals of Computer Science and Information Systems -Vol. 17, (2018), s.105-110
- DOI:
- Digital Object Identifier (open in new tab) 10.15439/2018f17
- Verified by:
- Gdańsk University of Technology
seen 167 times