Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA

Matthew Joseph Adiletta; Jesmin Jahan Tithi; Emmanouil-Ioannis Farsarakis; Gerasimos Gerogiannis; Robert Adolf; Robert Benke; Sidharth Kashyap; Samuel Hsia; Kartik Lakhotia; Fabrizio Petrini; Gu-Yeon Wei; David Brooks

doi:10.1109/ispass57527.2023.00025

Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA

Abstrakt

Large-scale Graph Convolutional Network (GCN) inference on traditional CPU/GPU systems is challenging due to a large memory footprint, sparse computational patterns, and irregular memory accesses with poor locality. Intel’s Programmable Integrated Unffied Memory Architecture (PIUMA) is designed to address these challenges for graph analytics. In this paper, a detailed characterization of GCNs is presented using the Open-Graph Benchmark (OGB) datasets to determine the viability of PIUMA as a potential solution to GCN scalability. First, the extent of sparse matrix dense matrix multiplication (SpMM) as a performance driver for GCN on CPU and GPU is explored, offering a methodology for predicting GCN behavior as a function of dataset characteristics. Second, an SpMM kernel optimized for PIUMA is described and investigated for sensitivity to system parameters including memory bandwidth, latency, and thread count. SpMM scalability on PIUMA is demonstrated, while the scalability limitations of a Xeon-optimized SpMM implementation are discussed. Finally, GCN performance is compared on PIUMA versus a Xeon CPU system and Ampere GPU system, showing impressive results on PIUMA for largescale datasets.

Cytowania

2

CrossRef
0

Web of Science
5

Scopus

Autorzy (12)

Matthew Joseph Adiletta
Jesmin Jahan Tithi
Emmanouil-Ioannis Farsarakis
Gerasimos Gerogiannis
Robert Adolf
Robert Benke mgr
Sidharth Kashyap
Samuel Hsia
Kartik Lakhotia
Fabrizio Petrini
Gu-Yeon Wei
David Brooks

Cytuj jako

Pełna treść

pełna treść publikacji nie jest dostępna w portalu

pełna treść artykułu zobacz w serwisie zewnętrznym otwiera się w nowej karcie

Słowa kluczowe

SENSITIVITY , SCALABILITY , MEMORY ARCHITECTURE , GRAPHICS PROCESSING UNITS , BANDWIDTH , SOFTWARE , PERFORMANCE ANALYSIS

Informacje szczegółowe

Kategoria:: Aktywność konferencyjna
Typ:: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Język:: angielski
Rok wydania:: 2023
Opis bibliograficzny:: Adiletta M. J., Tithi J. J., Farsarakis E., Gerogiannis G., Adolf R., Benke R., Kashyap S., Hsia S., Lakhotia K., Petrini F., Wei G., Brooks D.: Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA// / : , 2023,
DOI:: Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.1109/ispass57527.2023.00025
Weryfikacja:: Politechnika Gdańska

wyświetlono 87 razy

Publikacje, które mogą cię zainteresować

A GPU Solver for Sparse Generalized Eigenvalue Problems with Symmetric Complex-Valued Matrices Obtained Using Higher-Order FEM

2018

Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA

Abstrakt

Cytowania

Autorzy (12)

Matthew Joseph Adiletta

Jesmin Jahan Tithi

Emmanouil-Ioannis Farsarakis

Gerasimos Gerogiannis

Robert Adolf

Robert Benke mgr

Sidharth Kashyap

Samuel Hsia

Kartik Lakhotia

Fabrizio Petrini

Gu-Yeon Wei

David Brooks

Cytuj jako

Pełna treść

Słowa kluczowe

Informacje szczegółowe

Publikacje, które mogą cię zainteresować

A GPU Solver for Sparse Generalized Eigenvalue Problems with Symmetric Complex-Valued Matrices Obtained Using Higher-Order FEM

A memory efficient and fast sparse matrix vector product on a Gpu

A Task-Scheduling Approach for Efficient Sparse Symmetric Matrix-Vector Multiplication on a GPU

Generation of large finite-element matrices on multiple graphics processors

Wyszukiwarka

Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA

Abstrakt

Cytowania

Autorzy (12)

Matthew Joseph Adiletta

Jesmin Jahan Tithi

Emmanouil-Ioannis Farsarakis

Gerasimos Gerogiannis

Robert Adolf

Robert Benke mgr

Sidharth Kashyap

Samuel Hsia

Kartik Lakhotia

Fabrizio Petrini

Gu-Yeon Wei

David Brooks

Cytuj jako

Pełna treść

Słowa kluczowe

Informacje szczegółowe

Publikacje, które mogą cię zainteresować

A GPU Solver for Sparse Generalized Eigenvalue Problems with Symmetric Complex-Valued Matrices Obtained Using Higher-Order FEM

A memory efficient and fast sparse matrix vector product on a Gpu

A Task-Scheduling Approach for Efficient Sparse Symmetric Matrix-Vector Multiplication on a GPU

Generation of large finite-element matrices on multiple graphics processors