Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system - Publication - Bridge of Knowledge

Search

Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system

Abstract

In the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain sizes among CPUs and GPUs of a high performance system with 2 Intel Xeon Silver multicore CPUs and 8 NVIDIA Quadro RTX 5000 GPUs. We then present how execution time depends on the number of agents as well as the number of CUDA streams used for parallel execution of several CUDA kernels. We discuss the design and implementation of an algorithm with CPU computational threads, GPU management threads, assignment of particular tasks to threads as well as usage of pinned memory and CUDA shared memory for maximizing performance.

Citations

  • 3

    CrossRef

  • 0

    Web of Science

  • 5

    Scopus

Keywords

Details

Category:
Articles
Type:
artykuły w czasopismach
Published in:
SIMULATION MODELLING PRACTICE AND THEORY no. 123,
ISSN: 1569-190X
Language:
English
Publication year:
2023
Bibliographic description:
Skrzypczak J., Czarnul P.: Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system// SIMULATION MODELLING PRACTICE AND THEORY -Vol. 123,iss. 102691 (2023), s.1-18
DOI:
Digital Object Identifier (open in new tab) 10.1016/j.simpat.2022.102691
Sources of funding:
  • Statutory activity/subsidy
Verified by:
Gdańsk University of Technology

seen 154 times

Recommended for you

Meta Tags