Wyniki wyszukiwania dla: DEEP NEURAL NETWORK TRAINING BENCHMARKING PARALLEL COMPUTATIONS CAFFE MKL - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: DEEP NEURAL NETWORK TRAINING BENCHMARKING PARALLEL COMPUTATIONS CAFFE MKL
Przykład wyników znalezionych w innych katalogach

Wyniki wyszukiwania dla: DEEP NEURAL NETWORK TRAINING BENCHMARKING PARALLEL COMPUTATIONS CAFFE MKL

  • Benchmarking Deep Neural Network Training Using Multi- and Many-Core Processors

    In the paper we provide thorough benchmarking of deep neural network (DNN) training on modern multi- and many-core Intel processors in order to assess performance differences for various deep learning as well as parallel computing parameters. We present performance of DNN training for Alexnet, Googlenet, Googlenet_v2 as well as Resnet_50 for various engines used by the deep learning framework, for various batch sizes. Furthermore,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training

    In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

    Pełny tekst do pobrania w portalu

  • The impact of the AC922 Architecture on Performance of Deep Neural Network Training

    Publikacja

    - Rok 2020

    Practical deep learning applications require more and more computing power. New computing architectures emerge, specifically designed for the artificial intelligence applications, including the IBM Power System AC922. In this paper we confront an AC922 (8335-GTG) server equipped with 4 NVIDIA Volta V100 GPUs with selected deep neural network training applications, including four convolutional and one recurrent model. We report...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Performance and Energy Aware Training of a Deep Neural Network in a Multi-GPU Environment with Power Capping

    In this paper we demonstrate that it is possible to obtain considerable improvement of performance and energy aware metrics for training of deep neural networks using a modern parallel multi-GPU system, by enforcing selected, non-default power caps on the GPUs. We measure the power and energy consumption of the whole node using a professional, certified hardware power meter. For a high performance workstation with 8 GPUs, we were...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Paweł Rościszewski dr inż.

    Osoby

    Paweł Rościszewski received his PhD in Computer Science at Gdańsk University of Technology in 2018 based on PhD thesis entitled: "Optimization of hybrid parallel application execution in heterogeneous high performance computing systems considering execution time and power consumption". Currently, he is an Assistant Professor at the Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Poland....

  • Poprawa jakości klasyfikacji głębokich sieci neuronowych poprzez optymalizację ich struktury i dwuetapowy proces uczenia

    Publikacja

    - Rok 2024

    W pracy doktorskiej podjęto problem realizacji algorytmów głębokiego uczenia w warunkach deficytu danych uczących. Głównym celem było opracowanie podejścia optymalizującego strukturę sieci neuronowej oraz zastosowanie uczeniu dwuetapowym, w celu uzyskania mniejszych struktur, zachowując przy tym dokładności. Proponowane rozwiązania poddano testom na zadaniu klasyfikacji znamion skórnych na znamiona złośliwe i łagodne. W pierwszym...

    Pełny tekst do pobrania w portalu

  • Resource constrained neural network training

    Publikacja

    Modern applications of neural-network-based AI solutions tend to move from datacenter backends to low-power edge devices. Environmental, computational, and power constraints are inevitable consequences of such a shift. Limiting the bit count of neural network parameters proved to be a valid technique for speeding up and increasing efficiency of the inference process. Hence, it is understandable that a similar approach is gaining...

    Pełny tekst do pobrania w portalu

  • Neural network training with limited precision and asymmetric exponent

    Publikacja

    Along with an extremely increasing number of mobile devices, sensors and other smart utilities, an unprecedented growth of data can be observed in today’s world. In order to address multiple challenges facing the big data domain, machine learning techniques are often leveraged for data analysis, filtering and classification. Wide usage of artificial intelligence with large amounts of data creates growing demand not only for storage...

    Pełny tekst do pobrania w portalu

  • A Bayesian regularization-backpropagation neural network model for peeling computations

    Publikacja
    • S. Gouravaraju
    • J. Narayan
    • R. Sauer
    • S. S. Gautam

    - JOURNAL OF ADHESION - Rok 2023

    A Bayesian regularization-backpropagation neural network (BRBPNN) model is employed to predict some aspects of the gecko spatula peeling, viz. the variation of the maximum normal and tangential pull-off forces and the resultant force angle at detachment with the peeling angle. K-fold cross validation is used to improve the effectiveness of the model. The input data is taken from finite element (FE) peeling results. The neural network...

    Pełny tekst do pobrania w portalu

  • Deep neural network architecture search using network morphism

    The paper presents the results of the research on neural architecture search (NAS) algorithm. We utilized the hill climbing algorithm to search for well-performing structures of deep convolutional neural network. Moreover, we used the function preserving transformations which enabled the effective operation of the algorithm in a short period of time. The network obtained with the advantage of NAS was validated on skin lesion classification...

    Pełny tekst do pobrania w serwisie zewnętrznym