Wyniki wyszukiwania dla: DEEP NEURAL NETWORK TRAINING BENCHMARKING PARALLEL COMPUTATIONS CAFFE MKL

Wyniki wyszukiwania dla: DEEP NEURAL NETWORK TRAINING BENCHMARKING PARALLEL COMPUTATIONS CAFFE MKL

Nie znaleźliśmy wyników w zadanych kryteriach!

Ale mamy wyniki w innych katalogach.

Przykład wyników znalezionych w innych katalogach

zobacz wszystkie wyniki

Filtry

wszystkich: 628

wyczyść wszystkie filtry niedostępne

Benchmarking Deep Neural Network Training Using Multi- and Many-Core Processors
Publikacja
- P. Czarnul
- K. Jabłońska
- International Journal of Computer Information Systems and Industrial Management Applications - Rok 2020
In the paper we provide thorough benchmarking of deep neural network (DNN) training on modern multi- and many-core Intel processors in order to assess performance differences for various deep learning as well as parallel computing parameters. We present performance of DNN training for Alexnet, Googlenet, Googlenet_v2 as well as Resnet_50 for various engines used by the deep learning framework, for various batch sizes. Furthermore,...

Pełny tekst do pobrania w serwisie zewnętrznym
Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training
Publikacja
- P. Rościszewski
- Procedia Computer Science - Rok 2017
In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

Pełny tekst do pobrania w portalu
The impact of the AC922 Architecture on Performance of Deep Neural Network Training
Publikacja
- P. Rościszewski
- M. Iwański
- P. Czarnul
- Rok 2020
Practical deep learning applications require more and more computing power. New computing architectures emerge, specifically designed for the artificial intelligence applications, including the IBM Power System AC922. In this paper we confront an AC922 (8335-GTG) server equipped with 4 NVIDIA Volta V100 GPUs with selected deep neural network training applications, including four convolutional and one recurrent model. We report...

Pełny tekst do pobrania w serwisie zewnętrznym
Performance and Energy Aware Training of a Deep Neural Network in a Multi-GPU Environment with Power Capping
Publikacja
- Rok 2024
In this paper we demonstrate that it is possible to obtain considerable improvement of performance and energy aware metrics for training of deep neural networks using a modern parallel multi-GPU system, by enforcing selected, non-default power caps on the GPUs. We measure the power and energy consumption of the whole node using a professional, certified hardware power meter. For a high performance workstation with 8 GPUs, we were...

Pełny tekst do pobrania w serwisie zewnętrznym
Paweł Rościszewski dr inż.

Osoby

Paweł Rościszewski received his PhD in Computer Science at Gdańsk University of Technology in 2018 based on PhD thesis entitled: "Optimization of hybrid parallel application execution in heterogeneous high performance computing systems considering execution time and power consumption". Currently, he is an Assistant Professor at the Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Poland....
Poprawa jakości klasyfikacji głębokich sieci neuronowych poprzez optymalizację ich struktury i dwuetapowy proces uczenia
Publikacja
- A. Kwasigroch
- Rok 2024
W pracy doktorskiej podjęto problem realizacji algorytmów głębokiego uczenia w warunkach deficytu danych uczących. Głównym celem było opracowanie podejścia optymalizującego strukturę sieci neuronowej oraz zastosowanie uczeniu dwuetapowym, w celu uzyskania mniejszych struktur, zachowując przy tym dokładności. Proponowane rozwiązania poddano testom na zadaniu klasyfikacji znamion skórnych na znamiona złośliwe i łagodne. W pierwszym...

Pełny tekst do pobrania w portalu
Resource constrained neural network training
Publikacja
- M. Pietrołaj
- M. Blok
- Scientific Reports - Rok 2024
Modern applications of neural-network-based AI solutions tend to move from datacenter backends to low-power edge devices. Environmental, computational, and power constraints are inevitable consequences of such a shift. Limiting the bit count of neural network parameters proved to be a valid technique for speeding up and increasing efficiency of the inference process. Hence, it is understandable that a similar approach is gaining...

Pełny tekst do pobrania w portalu
Neural network training with limited precision and asymmetric exponent
Publikacja
- M. Blok
- M. Pietrołaj
- Journal of Big Data - Rok 2022
Along with an extremely increasing number of mobile devices, sensors and other smart utilities, an unprecedented growth of data can be observed in today’s world. In order to address multiple challenges facing the big data domain, machine learning techniques are often leveraged for data analysis, filtering and classification. Wide usage of artificial intelligence with large amounts of data creates growing demand not only for storage...

Pełny tekst do pobrania w portalu
A Bayesian regularization-backpropagation neural network model for peeling computations
Publikacja
- S. Gouravaraju
- J. Narayan
- R. Sauer
- S. S. Gautam
- JOURNAL OF ADHESION - Rok 2023
A Bayesian regularization-backpropagation neural network (BRBPNN) model is employed to predict some aspects of the gecko spatula peeling, viz. the variation of the maximum normal and tangential pull-off forces and the resultant force angle at detachment with the peeling angle. K-fold cross validation is used to improve the effectiveness of the model. The input data is taken from finite element (FE) peeling results. The neural network...

Pełny tekst do pobrania w portalu
Deep neural network architecture search using network morphism
Publikacja
- Rok 2019
The paper presents the results of the research on neural architecture search (NAS) algorithm. We utilized the hill climbing algorithm to search for well-performing structures of deep convolutional neural network. Moreover, we used the function preserving transformations which enabled the effective operation of the algorithm in a short period of time. The network obtained with the advantage of NAS was validated on skin lesion classification...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Nie znaleźliśmy wyników w zadanych kryteriach!

Filtry

Katalog

Wyniki wyszukiwania dla: DEEP NEURAL NETWORK TRAINING BENCHMARKING PARALLEL COMPUTATIONS CAFFE MKL

Paweł Rościszewski dr inż.