Search results for: DEEP NEURAL NETWORK TRAINING

Search results for: DEEP NEURAL NETWORK TRAINING

results on page:
embed this view on your website

Filters

total: 567

clear all filters disabled

The impact of the AC922 Architecture on Performance of Deep Neural Network Training
Publication
- P. Rościszewski
- M. Iwański
- P. Czarnul
- Year 2020
Practical deep learning applications require more and more computing power. New computing architectures emerge, specifically designed for the artificial intelligence applications, including the IBM Power System AC922. In this paper we confront an AC922 (8335-GTG) server equipped with 4 NVIDIA Volta V100 GPUs with selected deep neural network training applications, including four convolutional and one recurrent model. We report...

Full text to download in external service
Benchmarking Deep Neural Network Training Using Multi- and Many-Core Processors
Publication
- P. Czarnul
- K. Jabłońska
- International Journal of Computer Information Systems and Industrial Management Applications - Year 2020
In the paper we provide thorough benchmarking of deep neural network (DNN) training on modern multi- and many-core Intel processors in order to assess performance differences for various deep learning as well as parallel computing parameters. We present performance of DNN training for Alexnet, Googlenet, Googlenet_v2 as well as Resnet_50 for various engines used by the deep learning framework, for various batch sizes. Furthermore,...

Full text to download in external service
Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training
Publication
- P. Rościszewski
- Procedia Computer Science - Year 2017
In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

Full text available to download
Resource constrained neural network training
Publication
- M. Pietrołaj
- M. Blok
- Scientific Reports - Year 2024
Modern applications of neural-network-based AI solutions tend to move from datacenter backends to low-power edge devices. Environmental, computational, and power constraints are inevitable consequences of such a shift. Limiting the bit count of neural network parameters proved to be a valid technique for speeding up and increasing efficiency of the inference process. Hence, it is understandable that a similar approach is gaining...

Full text available to download
Performance and Energy Aware Training of a Deep Neural Network in a Multi-GPU Environment with Power Capping
Publication
- Year 2024
In this paper we demonstrate that it is possible to obtain considerable improvement of performance and energy aware metrics for training of deep neural networks using a modern parallel multi-GPU system, by enforcing selected, non-default power caps on the GPUs. We measure the power and energy consumption of the whole node using a professional, certified hardware power meter. For a high performance workstation with 8 GPUs, we were...

Full text to download in external service
Neural network training with limited precision and asymmetric exponent
Publication
- M. Blok
- M. Pietrołaj
- Journal of Big Data - Year 2022
Along with an extremely increasing number of mobile devices, sensors and other smart utilities, an unprecedented growth of data can be observed in today’s world. In order to address multiple challenges facing the big data domain, machine learning techniques are often leveraged for data analysis, filtering and classification. Wide usage of artificial intelligence with large amounts of data creates growing demand not only for storage...

Full text available to download
Comparative study of methods for artificial neural network training.
Publication
- H. Tiliouine
- S. Zieliński
- Year 2004
Przedstawiono wyniki badań porównawczych następujących metod uczenia sieci neuronowych: propagacji wstecznej błędów, rekursywnej metody najmniejszych kwadratów, metody Zangwill'a i algorytmów ewolucyjnych. Badania dotyczyły projektowania adaptacyjnego regulatora neuronowego napięcia generatora synchronicznego.
Deep neural network architecture search using network morphism
Publication
- Year 2019
The paper presents the results of the research on neural architecture search (NAS) algorithm. We utilized the hill climbing algorithm to search for well-performing structures of deep convolutional neural network. Moreover, we used the function preserving transformations which enabled the effective operation of the algorithm in a short period of time. The network obtained with the advantage of NAS was validated on skin lesion classification...

Full text to download in external service
Limitation of Floating-Point Precision for Resource Constrained Neural Network Training
Publication
- M. Pietrołaj
- Year 2024
Insufficient availability of computational power and runtime memory is a major concern when it comes to experiments in the field of artificial intelligence. One of the promising solutions for this problem is an optimization of internal neural network’s calculations and its parameters’ representation. This work focuses on the mentioned issue by the application of neural network training with limited precision. Based on this research,...

Full text available to download
Creating neural models using an adaptive algorithm for optimal size of neural network and training set.
Publication
- Ł. Balewski
- M. Mrozowski
- Year 2004
Zaprezentowano adaptacyjny algorytm generujący modele neuronowe liniowych układów mikrofalowych, zdolny do oszacowania optymalnego rozmiaru zbiory uczącego i sieci neuronowej. Stworzono kilka modeli nieciągłości falowodowych i mokropaskowych, a następnie zweryfikowano ich poprawność porównując wyniki analiz metodą dopasowania rodzajów i metodą momentów filtrów pasmowo-przepustowych.
Categorization of emotions in dog behavior based on the deep neural network
Publication
- COMPUTATIONAL INTELLIGENCE - Year 2022
The aim of this article is to present a neural system based on stock architecture for recognizing emotional behavior in dogs. Our considerations are inspired by the original work of Franzoni et al. on recognizing dog emotions. An appropriate set of photographic data has been compiled taking into account five classes of emotional behavior in dogs of one breed, including joy, anger, licking, yawning, and sleeping. Focusing on a particular...

Full text available to download
Deep convolutional neural network for predicting kidney tumour malignancy
Publication
- A. Obuchowski
- B. Klaudel
- R. Karski
- B. Rydziński
- M. Glembin
- P. Jasik
- P. Syty
- Year 2021
Purpose: According to the statistics, up to 15-20% of removed solid kidney tumors turn out to be benign in postoperative histopathological examination, despite having been identified as malignant by a radiologist. The aim of the research was to limit the number of unnecessary nephrectomies of benign tumors. Methods or Background: We propose a machine-aided diagnostic system for kidney...

Full text to download in external service
Leveraging Training Strategies of Artificial Neural Network for Classification of Multiday Electromyography Signals
Publication
- M. Akmal
- S. Khalid
- M. Moiz
- M. Abbass
- M. Qureshi
- Z. Mushtaq
- Year 2022
Full text to download in external service
Selection of an artificial pre-training neural network for the classification of inland vessels based on their images
Publication
- K. Bobkowska
- I. Bodus-olkowska Izabela
- Zeszyty Naukowe Akademii Morskiej w Szczecinie - Year 2021
Artificial neural networks (ANN) are the most commonly used algorithms for image classification problems. An image classifier takes an image or video as input and classifies it into one of the possible categories that it was trained to identify. They are applied in various areas such as security, defense, healthcare, biology, forensics, communication, etc. There is no need to create one’s own ANN because there are several pre-trained...

Full text available to download
Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions
Publication
- M. Wang
- T. Sirlapu
- A. Kwaśniewska
- M. Szankin
- M. Bartscherer
- R. Nicolas
- Year 2018
With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

Full text to download in external service
Comparison of Deep Neural Network Learning Algorithms for Mars Terrain Image Segmentation
Publication
- Year 2024
This paper is dedicated to the topic of terrain recognition on Mars using advanced techniques based on the convolutional neural networks (CNN). The work on the project was conducted based on the set of 18K images collected by the Curiosity, Opportunity and Spirit rovers. The data were later processed by the model operating in a Python environment, utilizing Keras and Tensorflow repositories. The model benefits from the pretrained...

Full text available to download
Dataset Related Experimental Investigation of Chess Position Evaluation Using a Deep Neural Network
Publication
- D. Wieczerzak
- P. Czarnul
- Year 2023
The idea of training Articial Neural Networks to evaluate chess positions has been widely explored in the last ten years. In this paper we investigated dataset impact on chess position evaluation. We created two datasets with over 1.6 million unique chess positions each. In one of those we also included randomly generated positions resulting from consideration of potentially unpredictable chess moves. Each position was evaluated...

Full text available to download
GPU Power Capping for Energy-Performance Trade-Offs in Training of Deep Convolutional Neural Networks for Image Recognition
Publication
- Year 2022
In the paper we present performance-energy trade-off investigation of training Deep Convolutional Neural Networks for image recognition. Several representative and widely adopted network models, such as Alexnet, VGG-19, Inception V3, Inception V4, Resnet50 and Resnet152 were tested using systems with Nvidia Quadro RTX 6000 as well as Nvidia V100 GPUs. Using GPU power capping we found other than default configurations minimizing...

Full text available to download
Hybrid Inception-embedded deep neural network ResNet for short and medium-term PV-Wind forecasting
Publication
- A. Feroz
- M. Mansoor
- M. Usman
- Q. Ling
- ENERGY CONVERSION AND MANAGEMENT - Year 2023
Full text to download in external service
Zastosowanie algorytmu ewolucyjnego do uczenia neuronowego regulatora napięcia generatora synchronicznego. Evolutionary algorithm for training a neural network of synchronous generator voltage controller
Publication
- H. Tiliouine
- S. Zieliński
- Year 2005
Najpopularniejsza metoda uczenia wielowarstwowych sieci neuronowych -metoda wstecznej propagacji błędu - charakteryzuje się słabą efektywnością. Z tego względu podejmowane są próby stosowania innych metod do uczenia sieci. W pracy przedstawiono wyniki uczenia sieci realizującej regulator neuronowy, za pomocą algorytmu ewolucyjnego. Obliczenia symulacyjne potwierdziły dobrą zbieżność algorytmu ewolucyjnego w tym zastosowaniu.
Using Deep Neural Network Methods for Forecasting Energy Productivity Based on Comparison of Simulation and DNN Results for Central Poland—Swietokrzyskie Voivodeship
Publication
- M. Pikus
- J. Wąs
- ENERGIES - Year 2023
Full text to download in external service
Using Deep Neural Network Methods for Forecasting Energy Productivity Based on Comparison of Simulation and DNN Results for Central Poland – Swietokrzyskie Voivodeship
Publication
- M. Pikus
- J. Wąs
- Year 2023
Full text to download in external service
Neural networks and deep learning
Publication
- A. P. López-Monroy
- J. S. Garcia Salinas
- Year 2022
In this chapter we will provide the general and fundamental background related to Neural Networks and Deep Learning techniques. Specifically, we divide the fundamentals of deep learning in three parts, the first one introduces Deep Feed Forward Networks and the main training algorithms in the context of optimization. The second part covers Convolutional Neural Networks (CNN) and discusses their main advantages and shortcomings...

Full text to download in external service
Deep Learning Optimization for Edge Devices: Analysis of Training Quantization Parameters
Publication
- A. Kwaśniewska
- M. Szankin
- M. Ozga
- J. Wolfe
- A. Das
- A. Zajac
- J. Rumiński
- P. Rad
- Year 2019
This paper focuses on convolution neural network quantization problem. The quantization has a distinct stage of data conversion from floating-point into integer-point numbers. In general, the process of quantization is associated with the reduction of the matrix dimension via limited precision of the numbers. However, the training and inference stages of deep learning neural network are limited by the space of the memory and a...

Full text available to download
Training of Deep Learning Models Using Synthetic Datasets
Publication
- Z. Kowalczuk
- J. Glinko
- Year 2022
In order to solve increasingly complex problems, the complexity of Deep Neural Networks also needs to be constantly increased, and therefore training such networks requires more and more data. Unfortunately, obtaining such massive real world training data to optimize neural networks parameters is a challenging and time-consuming task. To solve this problem, we propose an easy-touse and general approach to training deep learning...

Full text to download in external service
From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition
Publication
- P. Rościszewski
- Computer Science - Year 2017
Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

Full text available to download
Selected Technical Issues of Deep Neural Networks for Image Classification Purposes
Publication
- Bulletin of the Polish Academy of Sciences-Technical Sciences - Year 2019
In recent years, deep learning and especially Deep Neural Networks (DNN) have obtained amazing performance on a variety of problems, in particular in classification or pattern recognition. Among many kinds of DNNs, the Convolutional Neural Networks (CNN) are most commonly used. However, due to their complexity, there are many problems related but not limited to optimizing network parameters, avoiding overfitting and ensuring good...

Full text available to download
Neural network model of ship magnetic signature for different measurement depths
Publication
- K. Zielonacki
- J. Tarnawski
- Year 2024
This paper presents the development of a model of a corvette-type ship’s magnetic signature using an artificial neural network (ANN). The capabilities of ANNs to learn complex relationships between the vessel’s characteristics and the magnetic field at different depths are proposed as an alternative to a multi-dipole model. A training dataset, consisting of signatures prepared in finite element method (FEM) environment Simulia...

Full text to download in external service
Deep neural networks for human pose estimation from a very low resolution depth image
Publication
- P. Szczuko
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2019
The work presented in the paper is dedicated to determining and evaluating the most efficient neural network architecture applied as a multiple regression network localizing human body joints in 3D space based on a single low resolution depth image. The main challenge was to deal with a noisy and coarse representation of the human body, as observed by a depth sensor from a large distance, and to achieve high localization precision....

Full text available to download
An Improved Convolutional Neural Network for Steganalysis in the Scenario of Reuse of the Stego-Key
Publication
- B. Czaplewski
- Year 2019
The topic of this paper is the use of deep learning techniques, more specifically convolutional neural networks, for steganalysis of digital images. The steganalysis scenario of the repeated use of the stego-key is considered. Firstly, a study of the influence of the depth and width of the convolution layers on the effectiveness of classification was conducted. Next, a study on the influence of depth and width of fully connected...

Full text to download in external service
Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging
Publication
- P. Rościszewski
- J. Kaliski
- Year 2017
In the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modiﬁcation of the training program which minimizes the...

Full text to download in external service
A Comprehensive Analysis of Deep Neural-Based Cerebral Microbleeds Detection System
Publication
- M. Ferlin
- M. Grochowski
- A. Kwasigroch
- A. Mikołajczyk-Bareła
- E. Szurowska
- M. Grzywińska
- A. Sabisz
- Electronics - Year 2021
Machine learning-based systems are gaining interest in the field of medicine, mostly in medical imaging and diagnosis. In this paper, we address the problem of automatic cerebral microbleeds (CMB) detection in magnetic resonance images. It is challenging due to difficulty in distinguishing a true CMB from its mimics, however, if successfully solved it would streamline the radiologists work. To deal with this complex three-dimensional...

Full text available to download
Exploring Neural Networks for Musical Instrument Identification in Polyphonic Audio
Publication
- M. Blaszke
- G. Korvel
- B. Kostek
- IEEE INTELLIGENT SYSTEMS - Year 2024
The purpose of this paper is to introduce neural network-based methods that surpass state-of-the-art (SOTA) models, either by training faster or having simpler architecture, while maintaining comparable effectiveness in musical instrument identification in polyphonic music. Several approaches are presented, including two authors’ proposals, i.e., spiking neural networks (SNN) and a modular deep learning model named FMCNN (Fully...

Full text to download in external service
Paweł Rościszewski dr inż.

People

Paweł Rościszewski received his PhD in Computer Science at Gdańsk University of Technology in 2018 based on PhD thesis entitled: "Optimization of hybrid parallel application execution in heterogeneous high performance computing systems considering execution time and power consumption". Currently, he is an Assistant Professor at the Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Poland....
Musical Instrument Identification Using Deep Learning Approach
Publication
- M. Blaszke
- B. Kostek
- SENSORS - Year 2022
The work aims to propose a novel approach for automatically identifying all instruments present in an audio excerpt using sets of individual convolutional neural networks (CNNs) per tested instrument. The paper starts with a review of tasks related to musical instrument identification. It focuses on tasks performed, input type, algorithms employed, and metrics used. The paper starts with the background presentation, i.e., metadata...

Full text available to download
Data augmentation for improving deep learning in image classification problem
Publication
- A. Mikołajczyk-Bareła
- M. Grochowski
- Year 2018
These days deep learning is the fastest-growing field in the field of Machine Learning (ML) and Deep Neural Networks (DNN). Among many of DNN structures, the Convolutional Neural Networks (CNN) are currently the main tool used for the image analysis and classification purposes. Although great achievements and perspectives, deep neural networks and accompanying learning algorithms have some relevant challenges to tackle. In this...

Full text to download in external service
Accurate Modeling of Antenna Structures by Means of Domain Confinement and Pyramidal Deep Neural Networks
Publication
- S. Kozieł
- N. Calik
- P. Mahouti
- M. Belen
- IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION - Year 2022
The importance of surrogate modeling techniques has been gradually increasing in the design of antenna structures over the recent years. Perhaps the most important reason is a high cost of full-wave electromagnetic (EM) analysis of antenna systems. Although imperative in ensuring evaluation reliability, it entails considerable computational expenses. These are especially pronounced when carrying out EM-driven design tasks such...

Full text available to download
Self-Supervised Learning to Increase the Performance of Skin Lesion Classification
Publication
- Electronics - Year 2020
To successfully train a deep neural network, a large amount of human-labeled data is required. Unfortunately, in many areas, collecting and labeling data is a difficult and tedious task. Several ways have been developed to mitigate the problem associated with the shortage of data, the most common of which is transfer learning. However, in many cases, the use of transfer learning as the only remedy is insufficient. In this study,...

Full text available to download
Efficiency of Artificial Intelligence Methods for Hearing Loss Type Classification: an Evaluation
Publication
- M. Kassjański
- M. Kulawiak
- T. Przewoźny
- D. Tretiakow
- J. Kuryłowicz
- A. Molisz
- K. Koźmiński
- A. Kwaśniewska
- P. Mierzwińska-Dolny
- M. Grono
- Journal of Automation, Mobile Robotics and Intelligent Systems - JAMRIS - Year 2024
The evaluation of hearing loss is primarily conducted by pure tone audiometry testing, which is often regarded as golden standard for assessing auditory function. If the presence of hearing loss is determined, it is possible to differentiate between three types of hearing loss: sensorineural, conductive, and mixed. This study presents a comprehensive comparison of a variety of AI classification models, performed on 4007 pure tone...

Full text to download in external service
Optimized Deep Learning Model for Flood Detection Using Satellite Images
Publication
- A. Stateczny
- H. D. Praveena
- R. H. Krishnappa
- K. R. Chythanya
- B. B. Babysarojam
- Remote Sensing - Year 2023
The increasing amount of rain produces a number of issues in Kerala, particularly in urban regions where the drainage system is frequently unable to handle a significant amount of water in such a short duration. Meanwhile, standard flood detection results are inaccurate for complex phenomena and cannot handle enormous quantities of data. In order to overcome those drawbacks and enhance the outcomes of conventional flood detection...

Full text available to download
Tagged images with bees
Open Research Data
open access
- T. Boiński
- J. Szymański
- series: Bees
Images taken from bee hive with tagged bees. The images are prepared for training yolo5 deep neural network (supplied with the data).
How to Sort Them? A Network for LEGO Bricks Classification
Publication
- Year 2022
LEGO bricks are highly popular due to the ability to build almost any type of creation. This is possible thanks to availability of multiple shapes and colors of the bricks. For the smooth build process the bricks need to properly sorted and arranged. In our work we aim at creating an automated LEGO bricks sorter. With over 3700 different LEGO parts bricks classification has to be done with deep neural networks. The question arises...

Full text available to download
Deep neural networks for data analysis 24/25
e-Learning Courses
- J. Cychnerski
- K. Draszawka
This course covers introduction to supervised machine learning, construction of basic artificial deep neural networks (DNNs) and basic training algorithms, as well as the overview of popular DNNs architectures (convolutional networks, recurrent networks, transformers). The course introduces students to popular regularization techniques for deep models. Besides theory, large part of the course is the project in which students apply...
MP3vec: A Reusable Machine-Constructed Feature Representation for Protein Sequences
Publication
- S. R. Gupte
- D. S. Jain
- A. Srinivasan
- R. Aduri
- Year 2020
—Machine Learning (ML) methods have been used with varying degrees of success on protein prediction tasks, with two inherent limitations. First, prediction performance often depends upon the features extracted from the proteins. Second, experimental data may be insufficient to construct reliable ML models. Here we introduce MP3vec, a transferable representation for protein sequences that is designed to be used specifically for sequence-to-sequence...

Full text to download in external service
Adaptive Hounsfield Scale Windowing in Computed Tomography Liver Segmentation
Publication
- Year 2024
In computed tomography (CT) imaging, the Hounsfield Unit (HU) scale quantifies radiodensity, but its nonlinear nature across organs and lesions complicates machine learning analysis. This paper introduces an automated method for adaptive HU scale windowing in deep learning-based CT liver segmentation. We propose a new neural network layer that optimizes HU scale window parameters during training. Experiments on the Liver Tumor...

Full text to download in external service
Poprawa jakości klasyfikacji głębokich sieci neuronowych poprzez optymalizację ich struktury i dwuetapowy proces uczenia
Publication
- A. Kwasigroch
- Year 2024
W pracy doktorskiej podjęto problem realizacji algorytmów głębokiego uczenia w warunkach deficytu danych uczących. Głównym celem było opracowanie podejścia optymalizującego strukturę sieci neuronowej oraz zastosowanie uczeniu dwuetapowym, w celu uzyskania mniejszych struktur, zachowując przy tym dokładności. Proponowane rozwiązania poddano testom na zadaniu klasyfikacji znamion skórnych na znamiona złośliwe i łagodne. W pierwszym...

Full text available to download
Deep Video Multi-task Learning Towards Generalized Visual Scene Enhancement and Understanding
Publication
- E. Katsaros
- Year 2024
The goal of this thesis was to develop efficient video multi-task convolutional architectures for a range of diverse vision tasks, on RGB scenes, leveraging i) task relationships and ii) motion information to improve multi-task performance. The approach we take starts from the integration of diverse tasks within video multi-task learning networks. We present the first two datasets of their kind in the existing literature, featuring...

Full text available to download
Method for Clustering of Brain Activity Data Derived from EEG Signals
Publication
- FUNDAMENTA INFORMATICAE - Year 2019
A method for assessing separability of EEG signals associated with three classes of brain activity is proposed. The EEG signals are acquired from 23 subjects, gathered from a headset consisting of 14 electrodes. Data are processed by applying Discrete Wavelet Transform (DWT) for the signal analysis and an autoencoder neural network for the brain activity separation. Processing involves 74 wavelets from 3 DWT families: Coiflets,...

Full text available to download
Assessing the attractiveness of human face based on machine learning
Publication
- Year 2023
The attractiveness of the face plays an important role in everyday life, especially in the modern world where social media and the Internet surround us. In this study, an attempt to assess the attractiveness of a face by machine learning is shown. Attractiveness is determined by three deep models whose sum of predictions is the final score. Two annotated datasets available in the literature are employed for training and testing...

Full text available to download
Buried Object Characterization Using Ground Penetrating Radar Assisted by Data-Driven Surrogate-Models
Publication
- R. Yurt
- H. Torpi
- P. Mahouti
- A. Kizilay
- S. Kozieł
- IEEE Access - Year 2023
This work addresses artificial-intelligence-based buried object characterization using 3-D full-wave electromagnetic simulations of a ground penetrating radar (GPR). The task is to characterize cylindrical shape, perfectly electric conductor (PEC) object buried in various dispersive soil media, and in different positions. The main contributions of this work are (i) development of a fast and accurate data driven surrogate modeling...

Full text available to download

Search

Filters

Catalog

Search results for: DEEP NEURAL NETWORK TRAINING

Paweł Rościszewski dr inż.