Search results for: deep convolutional neural network

CNN Architectures for Human Pose Estimation from a Very Low Resolution Depth Image

Publication

P. Szczuko

- Year 2018

The paper is dedicated to proposing and evaluating a number of convolutional neural network architectures for calculating a multiple regression on 3D coordinates of human body joints tracked in a single low resolution depth image. The main challenge was to obtain a high precision in case of a noisy and coarse scan of the body, as observed by a depth sensor from a large distance. The regression network was expected to reason about...

Full text to download in external service

Speech Analytics Based on Machine Learning

Publication

- Year 2019

In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Full text to download in external service

Machine Learning and Deep Learning Methods for Fast and Accurate Assessment of Transthoracic Echocardiogram Image Quality

Publication

W. Nazar
K. Nazar
L. Daniłowicz-Szymanowicz

- Life - Year 2024

High-quality echocardiogram images are the cornerstone of accurate and reliable measurements of the heart. Therefore, this study aimed to develop, validate and compare machine learning and deep learning algorithms for accurate and automated assessment of transthoracic echocardiogram image quality. In total, 4090 single-frame two-dimensional transthoracic echocardiogram...

Full text to download in external service

Machine Learning Applied to Aspirated and Non-Aspirated Allophone Classification—An Approach Based on Audio "Fingerprinting"

Publication

- Year 2018

The purpose of this study is to involve both Convolutional Neural Networks and a typical learning algorithm in the allophone classification process. A list of words including aspirated and non-aspirated allophones pronounced by native and non-native English speakers is recorded and then edited and analyzed. Allophones extracted from English speakers’ recordings are presented in the form of two-dimensional spectrogram images and...

Full text to download in external service

Intelligent Autonomous Robot Supporting Small Pets in Domestic Environment

Publication

- IFAC-PapersOnLine - Year 2019

In this contribution, we present preliminary results of the student project aimed at the development of an intelligent autonomous robot supporting small pets in a domestic environment. The main task of this robot is to protect a freely moving small pets against accidental stepping on them by home residents. For this purpose, we have developed the mobile robot which follows a pet and makes an alarm signal when a human is approaching....

Full text available to download

Architektury klasyfikatorów obrazów

Publication

K. Zawora

- Year 2022

Klasyfikacja obrazów jest zagadnieniem z dziedziny widzenia komputerowego. Polega na całościowej analizie obrazu i przypisaniu go do jednej lub wielu kategorii (klas). Współczesne rozwiązania tego problemu są w znacznej części realizowane z wykorzystaniem konwolucyjnych głębokich sieci neuronowych (convolutional neural network, CNN). W tym rozdziale opisano przełomowe architektury CNN oraz ewolucję state-of-the-art w klasyfikacji...

Full text to download in external service

Using Long-Short term Memory networks with Genetic Algorithm to predict engine condition

Publication

S. Erpolat Tasabat
O. Aydin

- Gazi University Journal of Science - Year 2022

Predictive maintenance (PdM) is a type of approach for maintenance processes, allowing maintenance actions to be managed depending on the machine's current condition. Maintenance is therefore carried out before failures occur. The approach doesn’t only help avoid abrupt failures but also helps lower maintenance cost and provides possibilities to manufacturers to manage maintenance budgets in a more efficient way. A new deep neural...

Full text to download in external service

Open-Set Speaker Identification Using Closed-Set Pretrained Embeddings

Publication

- Year 2022

The paper proposes an approach for extending deep neural networks-based solutions to closed-set speaker identification toward the open-set problem. The idea is built on the characteristics of deep neural networks trained for the classification tasks, where there is a layer consisting of a set of deep features extracted from the analyzed inputs. By extracting this vector and performing anomaly detection against the set of known...

Full text available to download

Towards Cancer Patients Classification Using Liquid Biopsy

Publication

- Year 2021

Liquid biopsy is a useful, minimally invasive diagnostic and monitoring tool for cancer disease. Yet, developing accurate methods, given the potentially large number of input features, and usually small datasets size remains very challenging. Recently, a novel feature parameterization based on the RNA-sequenced platelet data which uses the biological knowledge from the Kyoto Encyclopedia of Genes and Genomes, combined with a classifier...

Full text to download in external service

Driver fatigue detection method based on facial image analysis

Publication

- Year 2024

Nowadays, ensuring road safety is a crucial issue that demands continuous development and measures to minimize the risk of accidents. This paper presents the development of a driver fatigue detection method based on the analysis of facial images. To monitor the driver's condition in real-time, a video camera was used. The method of detection is based on analyzing facial features related to the mouth area and eyes, such as...

Full text to download in external service

LSTM-based method for LOS/NLOS identification in an indoor environment

Publication

- Year 2020

Due to the multipath propagation, harsh indoor environment significantly impacts transmitted signals which may adversely affect the quality of the radiocommunication services, with focus on the real-time ones. This negative effect may be significantly reduced (e.g. resources management and allocation) or compensated (e.g. correction of position estimation in radiolocalisation) by the LOS/NLOS identification algorithm. This paper...

When Neural Networks Meet Decisional DNA: A Promising New Perspective for Knowledge Representation and Sharing

Publication

H. Zhang
C. Sanin
E. Szczerbicki

- CYBERNETICS AND SYSTEMS - Year 2016

ABSTRACT In this article, we introduce a novel concept combining neural network technology and Decisional DNA for knowledge representation and sharing. Instead of using traditional machine learning and knowledge discovery methods, this approach explores the way of knowledge extraction through deep learning processes based on a domain’s past decisional events captured by Decisional DNA. We compare our approach with kNN (k-nearest...

Full text available to download

Sign Language Recognition Using Convolution Neural Networks

Publication

- Year 2024

The objective of this work was to provide an app that can automatically recognize hand gestures from the American Sign Language (ASL) on mobile devices. The app employs a model based on Convolutional Neural Network (CNN) for gesture classification. Various CNN architectures and optimization strategies suitable for devices with limited resources were examined. InceptionV3 and VGG-19 models exhibited negligibly higher accuracy than...

Full text available to download

Investigating Feature Spaces for Isolated Word Recognition

Publication

G. Korvel
G. Tamulevicus
P. Treigys
J. Bernataviciene
B. Kostek

- Year 2018

Much attention is given by researchers to the speech processing task in automatic speech recognition (ASR) over the past decades. The study addresses the issue related to the investigation of the appropriateness of a two-dimensional representation of speech feature spaces for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and timefrequency signal representation...

Data Acquisition and Processing for GeoAI Models to Support Sustainable Agricultural Practices

Publication

A. G. Pereira
A. Ojo
C. Edward
L. Porwol

- Year 2020

There are growing opportunities to leverage new technologies and data sources to address global problems related to sustainability, climate change, and biodiversity loss. The emerging discipline of GeoAI resulting from the convergence of AI and Geospatial science (Geo-AI) is enabling the possibility to harness the increasingly available open Earth Observation data collected from different constellations of satellites and sensors...

Full text available to download

Deep Learning: A Case Study for Image Recognition Using Transfer Learning

Publication

S. Erpolat Tasabat
O. Aydin

- Year 2021

Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Full text to download in external service

Deep Learning

Publication

S. Erpolat Tasabat
O. Aydin

- Year 2021

Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Full text to download in external service

Deep learning techniques for biometric security: A systematic review of presentation attack detection systems

Publication

K. Shaheed
P. Szczuko
M. Kumar
I. Qureshi
Q. Abbas
I. Ullah

- ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE - Year 2024

Biometric technology, including finger vein, fingerprint, iris, and face recognition, is widely used to enhance security in various devices. In the past decade, significant progress has been made in improving biometric sys- tems, thanks to advancements in deep convolutional neural networks (DCNN) and computer vision (CV), along with large-scale training datasets. However, these systems have become targets of various attacks, with...

Full text to download in external service

Bees Detection on Images: Study of Different Color Models for Neural Networks

Publication

- Year 2019

This paper presents an approach to bee detection in video streams using a neural network classifier. We describe the motivation for our research and the methodology of data acquisition. The main contribution to this work is a comparison of different color models used as an input format for a feedforward convolutional architecture applied to bee detection. The detection process has is based on a neural binary classifier that classifies...

Full text available to download

Visual Content Learning in a Cognitive Vision Platform for Hazard Control (CVP-HC)

Publication

C. Silva de Oliveira
C. Sanin
E. Szczerbicki

- CYBERNETICS AND SYSTEMS - Year 2019

This work is part of an effort for the development of a Cognitive Vision Platform for Hazard Control (CVP-HC) for applications in industrial workplaces, adaptable to a wide range of environments. The paper focuses on hazards resulted from the nonuse of personal protective equipment (PPE). Given the results of previous analysis of supervised techniques for the problem of classification of a few PPE (boots, hard hats, and gloves...

Full text available to download

Playback detection using machine learning with spectrogram features approach

Publication

- Year 2017

This paper presents 2D image processing approach to playback detection in automatic speaker verification (ASV) systems using spectrograms as speech signal representation. Three feature extraction and classification methods: histograms of oriented gradients (HOG) with support vector machines (SVM), HAAR wavelets with AdaBoost classifier and deep convolutional neural networks (CNN) were compared on different data partitions in respect...

Full text available to download

A Novel IoT-Perceptive Human Activity Recognition (HAR) Approach Using Multi-Head Convolutional Attention

Publication

H. Zhang
Z. Xiao
J. Wang
F. Li
E. Szczerbicki

- IEEE Internet of Things Journal - Year 2019

Together with fast advancement of the Internet of Things (IoT), smart healthcare applications and systems are equipped with increasingly more wearable sensors and mobile devices. These sensors are used not only to collect data, but also, and more importantly, to assist in daily activity tracking and analyzing of their users. Various human activity recognition (HAR) approaches are used to enhance such tracking. Most of the existing...

Full text available to download

Adaptive Hounsfield Scale Windowing in Computed Tomography Liver Segmentation

Publication

- Year 2024

In computed tomography (CT) imaging, the Hounsfield Unit (HU) scale quantifies radiodensity, but its nonlinear nature across organs and lesions complicates machine learning analysis. This paper introduces an automated method for adaptive HU scale windowing in deep learning-based CT liver segmentation. We propose a new neural network layer that optimizes HU scale window parameters during training. Experiments on the Liver Tumor...

Full text to download in external service

Assessing the attractiveness of human face based on machine learning

Publication

- Year 2023

The attractiveness of the face plays an important role in everyday life, especially in the modern world where social media and the Internet surround us. In this study, an attempt to assess the attractiveness of a face by machine learning is shown. Attractiveness is determined by three deep models whose sum of predictions is the final score. Two annotated datasets available in the literature are employed for training and testing...

Full text available to download

Deep learning-enabled integration of renewable energy sources through photovoltaics in buildings

Publication

M. Arun
T. T. Le
D. Barik
P. Sharma
S. M. Osman
V. K. Huynh
J. Kowalski
V. H. Dong
V. V. Le

- Case Studies in Thermal Engineering - Year 2024

Installing photovoltaic (PV) systems in buildings is one of the most effective strategies for achieving sustainable energy goals and reducing carbon emissions. However, the requirement for efficient energy management, the fluctuating energy demands, and the intermittent nature of solar power are a few of the obstacles to the seamless integration of PV systems into buildings. These complexities surpass the capabilities of rule-based...

Full text available to download

Ranking Speech Features for Their Usage in Singing Emotion Classification

Publication

- Year 2020

This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Full text available to download

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Publication

G. Tamulevicius
G. Korvel
A. B. Yayak
P. Treigys
J. Bernataviciene
B. Kostek

- Electronics - Year 2020

In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Full text available to download

Vehicle detector training with labels derived from background subtraction algorithms in video surveillance

Publication

- Year 2018

Vehicle detection in video from a miniature station- ary closed-circuit television (CCTV) camera is discussed in the paper. The camera provides one of components of the intelligent road sign developed in the project concerning the traffic control with the use of autonomous devices being developed. Modern Convolutional Neural Network (CNN) based detectors need big data input, usually demanding their manual labeling. In the presented...

Improving Accuracy of Respiratory Rate Estimation by Restoring High Resolution Features With Transformers and Recursive Convolutional Models

Publication

A. Kwaśniewska
M. Szankin
J. Rumiński
A. Sarah
D. Gamba

- Year 2021

Non-contact evaluation of vital signs has been becoming increasingly important, especially in light of the COVID- 19 pandemic, which is causing the whole world to examine people’s interactions in public places at a scale never seen before. However, evaluating one’s vital signs can be a relatively complex procedure, which requires both time and physical contact between examiner and examinee. These re- quirements limit the number...

Full text available to download

Deep-Learning-Based Precise Characterization of Microwave Transistors Using Fully-Automated Regression Surrogates

Publication

N. Calik
F. Gunes
S. Kozieł
A. Pietrenko-Dąbrowska
M. Belen
P. Mahouti

- Scientific Reports - Year 2023

Accurate models of scattering and noise parameters of transistors are instrumental in facilitating design procedures of microwave devices such as low-noise amplifiers. Yet, data-driven modeling of transistors is a challenging endeavor due to complex relationships between transistor characteristics and its designable parameters, biasing conditions, and frequency. Artificial neural network (ANN)-based methods, including deep learning...

Full text available to download

Towards bees detection on images: study of different color models for neural networks

Publication

- Year 2019

This paper presents an approach to bee detection in videostreams using a neural network classifier. We describe the motivationfor our research and the methodology of data acquisition. The maincontribution to this work is a comparison of different color models usedas an input format for a feedforward convolutional architecture appliedto bee detection. The detection process has is based on a neural...

Deep Learning-Based Intrusion System for Vehicular Ad Hoc Networks

Publication

L. Fei
Z. Jiayan
S. Jiaqi
E. Szczerbicki

- CMC-Computers Materials & Continua - Year 2020

The increasing use of the Internet with vehicles has made travel more convenient. However, hackers can attack intelligent vehicles through various technical loopholes, resulting in a range of security issues. Due to these security issues, the safety protection technology of the in-vehicle system has become a focus of research. Using the advanced autoencoder network and recurrent neural network in deep learning, we investigated...

Full text available to download

MP3vec: A Reusable Machine-Constructed Feature Representation for Protein Sequences

Publication

S. R. Gupte
D. S. Jain
A. Srinivasan
R. Aduri

- Year 2020

—Machine Learning (ML) methods have been used with varying degrees of success on protein prediction tasks, with two inherent limitations. First, prediction performance often depends upon the features extracted from the proteins. Second, experimental data may be insufficient to construct reliable ML models. Here we introduce MP3vec, a transferable representation for protein sequences that is designed to be used specifically for sequence-to-sequence...

Full text to download in external service

Deep learning for recommending subscription-limited documents

Publication

- Year 2020

Documents recommendation for a commercial, subscription-based online platform is important due to the difficulty in navigation through a large volume and diversity of content available to clients. However, this is also a challenging task due to the number of new documents added every day and decreasing relevance of older contents. To solve this problem, we propose deep neural network architecture that combines autoencoder with...

Full text available to download

Method for Clustering of Brain Activity Data Derived from EEG Signals

Publication

- FUNDAMENTA INFORMATICAE - Year 2019

A method for assessing separability of EEG signals associated with three classes of brain activity is proposed. The EEG signals are acquired from 23 subjects, gathered from a headset consisting of 14 electrodes. Data are processed by applying Discrete Wavelet Transform (DWT) for the signal analysis and an autoencoder neural network for the brain activity separation. Processing involves 74 wavelets from 3 DWT families: Coiflets,...

Full text available to download

Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning

Publication

K. Kąkol

- Year 2023

The Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...

Full text available to download

Vehicle Detection with Self-Training for Adaptative Video Processing Embedded Platform

Publication

- Applied Sciences-Basel - Year 2020

Traffic monitoring from closed-circuit television (CCTV) cameras on embedded systems is the subject of the performed experiments. Solving this problem encounters difficulties related to the hardware limitations, and possible camera placement in various positions which affects the system performance. To satisfy the hardware requirements, vehicle detection is performed using a lightweight Convolutional Neural Network (CNN), named...

Full text available to download

INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH

Publication

G. Korvel
P. Treigys
K. Kąkol
B. Kostek

- International Journal of Applied Mathematics and Computer Science - Year 2023

The Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...

Full text available to download

Poprawa jakości klasyfikacji głębokich sieci neuronowych poprzez optymalizację ich struktury i dwuetapowy proces uczenia

Publication

A. Kwasigroch

- Year 2024

W pracy doktorskiej podjęto problem realizacji algorytmów głębokiego uczenia w warunkach deficytu danych uczących. Głównym celem było opracowanie podejścia optymalizującego strukturę sieci neuronowej oraz zastosowanie uczeniu dwuetapowym, w celu uzyskania mniejszych struktur, zachowując przy tym dokładności. Proponowane rozwiązania poddano testom na zadaniu klasyfikacji znamion skórnych na znamiona złośliwe i łagodne. W pierwszym...

Full text available to download

Optimal selection of input features and an acompanying neural network structure for the classification purposes - skin lesions case study

Publication

- Year 2018

Malignant melanomas are the most deadly type of skin cancers however detected early enough give a high chances for successful treatment. The last years saw the dynamic growth of interest of automatic computer-aided skin cancer diagnosis. Every month brings new research results on new approaches to this problem, new methods of preprocessing, new classifiers, new ideas to follow etc. In particular, the rapid development of dermatoscopy,...

Full text to download in external service

IFE: NN-aided Instantaneous Pitch Estimation

Publication

- Year 2021

Pitch estimation is still an open issue in contemporary signal processing research. Nowadays, growing momentum of machine learning techniques application in the data-driven society allows for tackling this problem from a new perspective. This work leverages such an opportunity to propose a refined Instantaneous Frequency and power based pitch Estimator method called IFE. It incorporates deep neural network based pitch estimation...

Full text available to download

Study of Statistical Text Representation Methods for Performance Improvement of a Hierarchical Attention Network

Publication

- Applied Sciences-Basel - Year 2021

To effectively process textual data, many approaches have been proposed to create text representations. The transformation of a text into a form of numbers that can be computed using computers is crucial for further applications in downstream tasks such as document classification, document summarization, and so forth. In our work, we study the quality of text representations using statistical methods and compare them to approaches...

Full text available to download

BP-EVD: Forward Block-Output Propagation for Efficient Video Denoising

Publication

- IEEE TRANSACTIONS ON IMAGE PROCESSING - Year 2022

Denoising videos in real-time is critical in many applications, including robotics and medicine, where varying light conditions, miniaturized sensors, and optics can substantially compromise image quality. This work proposes the first video denoising method based on a deep neural network that achieves state-of-the-art performance on dynamic scenes while running in real-time on VGA video resolution with no frame latency. The backbone...

Full text to download in external service

Detecting Objects of Various Categories in Optical Remote Sensing Imagery Using Neural Networks

Publication

A. Madajczak
M. Ciecholewski

- Year 2024

The effective detection of objects in remote sensing images is of great research importance, so recent years have seen a significant progress in deep learning techniques in this field. However, despite much valuable research being conducted, many challenges still remain. A lot of research projects focus on detecting objects of a single category (class), while correctly detecting objects of different categories is much harder. The...

Full text to download in external service

Toward Intelligent Recommendations Using the Neural Knowledge DNA

Publication

G. Ning
C. Wu
H. Zhang
E. Szczerbicki

- CYBERNETICS AND SYSTEMS - Year 2021

In this paper we propose a novel recommendation approach using past news click data and the Neural Knowledge DNA (NK-DNA). The Neural Knowledge DNA is a novel knowledge representation method designed to support discovering, storing, reusing, improving, and sharing knowledge among machines and computing systems. We examine our approach for news recommendation tasks on the MIND benchmark dataset. By taking advantages of NK-DNA, deep...

Full text available to download

Urban scene semantic segmentation using the U-Net model

Publication

M. Ciecholewski

- Year 2023

Vision-based semantic segmentation of complex urban street scenes is a very important function during autonomous driving (AD), which will become an important technology in industrialized countries in the near future. Today, advanced driver assistance systems (ADAS) improve traffic safety thanks to the application of solutions that enable detecting objects, recognising road signs, segmenting the road, etc. The basis for these functionalities...

Full text to download in external service

How to Sort Them? A Network for LEGO Bricks Classification

Publication

- Year 2022

LEGO bricks are highly popular due to the ability to build almost any type of creation. This is possible thanks to availability of multiple shapes and colors of the bricks. For the smooth build process the bricks need to properly sorted and arranged. In our work we aim at creating an automated LEGO bricks sorter. With over 3700 different LEGO parts bricks classification has to be done with deep neural networks. The question arises...

Full text available to download

Satellite Image Classification Using a Hierarchical Ensemble Learning and Correlation Coefficient-Based Gravitational Search Algorithm

Publication

K. Thiagarajan
M. Manapakkam Anandan
A. Stateczny
P. Bidare Divakarachari
H. Kivudujogappa Lingappa

- Remote Sensing - Year 2021

Satellite image classification is widely used in various real-time applications, such as the military, geospatial surveys, surveillance and environmental monitoring. Therefore, the effective classification of satellite images is required to improve classification accuracy. In this paper, the combination of Hierarchical Framework and Ensemble Learning (HFEL) and optimal feature selection is proposed for the precise identification...

Full text available to download

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

Publication

M. Piotrowska
A. Czyżewski
T. Ciszewski
G. Korvel
A. Kurowski
B. Kostek

- Journal of the Acoustical Society of America - Year 2021

The approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...

Full text available to download

Search

Filters

Catalog

Search results for: deep convolutional neural network

Olgun Aydin dr