Search results for: CONVOLUTIONAL NEURAL NETWORK - Bridge of Knowledge

Search

Search results for: CONVOLUTIONAL NEURAL NETWORK

Search results for: CONVOLUTIONAL NEURAL NETWORK

  • Exploring Neural Networks for Musical Instrument Identification in Polyphonic Audio

    Publication

    - IEEE INTELLIGENT SYSTEMS - Year 2024

    The purpose of this paper is to introduce neural network-based methods that surpass state-of-the-art (SOTA) models, either by training faster or having simpler architecture, while maintaining comparable effectiveness in musical instrument identification in polyphonic music. Several approaches are presented, including two authors’ proposals, i.e., spiking neural networks (SNN) and a modular deep learning model named FMCNN (Fully...

    Full text to download in external service

  • Neural networks and deep learning

    Publication

    - Year 2022

    In this chapter we will provide the general and fundamental background related to Neural Networks and Deep Learning techniques. Specifically, we divide the fundamentals of deep learning in three parts, the first one introduces Deep Feed Forward Networks and the main training algorithms in the context of optimization. The second part covers Convolutional Neural Networks (CNN) and discusses their main advantages and shortcomings...

    Full text to download in external service

  • 1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type

    Publication

    A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....

  • Selected Technical Issues of Deep Neural Networks for Image Classification Purposes

    In recent years, deep learning and especially Deep Neural Networks (DNN) have obtained amazing performance on a variety of problems, in particular in classification or pattern recognition. Among many kinds of DNNs, the Convolutional Neural Networks (CNN) are most commonly used. However, due to their complexity, there are many problems related but not limited to optimizing network parameters, avoiding overfitting and ensuring good...

    Full text available to download

  • Deep neural networks for human pose estimation from a very low resolution depth image

    Publication

    The work presented in the paper is dedicated to determining and evaluating the most efficient neural network architecture applied as a multiple regression network localizing human body joints in 3D space based on a single low resolution depth image. The main challenge was to deal with a noisy and coarse representation of the human body, as observed by a depth sensor from a large distance, and to achieve high localization precision....

    Full text available to download

  • Sign Language Recognition Using Convolution Neural Networks

    Publication

    The objective of this work was to provide an app that can automatically recognize hand gestures from the American Sign Language (ASL) on mobile devices. The app employs a model based on Convolutional Neural Network (CNN) for gesture classification. Various CNN architectures and optimization strategies suitable for devices with limited resources were examined. InceptionV3 and VGG-19 models exhibited negligibly higher accuracy than...

    Full text available to download

  • Fusion-based Representation Learning Model for Multimode User-generated Social Network Content

    As mobile networks and APPs are developed, user-generated content (UGC), which includes multi-source heterogeneous data like user reviews, tags, scores, images, and videos, has become an essential basis for improving the quality of personalized services. Due to the multi-source heterogeneous nature of the data, big data fusion offers both promise and drawbacks. With the rise of mobile networks and applications, UGC, which includes...

    Full text to download in external service

  • Bees Detection on Images: Study of Different Color Models for Neural Networks

    Publication

    This paper presents an approach to bee detection in video streams using a neural network classifier. We describe the motivation for our research and the methodology of data acquisition. The main contribution to this work is a comparison of different color models used as an input format for a feedforward convolutional architecture applied to bee detection. The detection process has is based on a neural binary classifier that classifies...

    Full text available to download

  • An Intelligent Approach to Short-Term Wind Power Prediction Using Deep Neural Networks

    Publication

    - Journal of Artificial Intelligence and Soft Computing Research - Year 2023

    In this paper, an intelligent approach to the Short-Term Wind Power Prediction (STWPP) problem is considered, with the use of various types of Deep Neural Networks (DNNs). The impact of the prediction time horizon length on accuracy, and the influence of temperature on prediction effectiveness have been analyzed. Three types of DNNs have been implemented and tested, including: CNN (Convolutional Neural Networks), GRU (Gated Recurrent...

    Full text available to download

  • Towards bees detection on images: study of different color models for neural networks

    Publication

    This paper presents an approach to bee detection in videostreams using a neural network classifier. We describe the motivationfor our research and the methodology of data acquisition. The maincontribution to this work is a comparison of different color models usedas an input format for a feedforward convolutional architecture appliedto bee detection. The detection process has is based on a neural...

  • Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition

    Publication

    - JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2018

    convolutional neural network (CNN) which is a class of deep, feed-forward artificial neural network. We decided to analyze audio signal feature maps, namely spectrograms, linear and Mel-scale cepstrograms, and chromagrams. The choice was made upon the fact that CNN performs well in 2D data-oriented processing contexts. Feature maps were employed in the Lithuanian word recognition task. The spectral analysis led to the highest word...

  • Efficiency of Artificial Intelligence Methods for Hearing Loss Type Classification: an Evaluation

    Publication
    • M. Kassjański
    • M. Kulawiak
    • T. Przewoźny
    • D. Tretiakow
    • J. Kuryłowicz
    • A. Molisz
    • K. Koźmiński
    • A. Kwaśniewska
    • P. Mierzwińska-Dolny
    • M. Grono

    - Journal of Automation, Mobile Robotics and Intelligent Systems - JAMRIS - Year 2024

    The evaluation of hearing loss is primarily conducted by pure tone audiometry testing, which is often regarded as golden standard for assessing auditory function. If the presence of hearing loss is determined, it is possible to differentiate between three types of hearing loss: sensorineural, conductive, and mixed. This study presents a comprehensive comparison of a variety of AI classification models, performed on 4007 pure tone...

    Full text to download in external service

  • Comparison of Deep Learning Approaches in Classification of Glacial Landforms

    Publication

    - International Journal of Electronics and Telecommunications - Year 2024

    Glacial landforms, created by the continuous movements of glaciers over millennia, are crucial topics in geomorphological research. Their systematic analysis affords invaluable insights into past climatic oscillations and augments understanding of long-term climate change dynamics. The classification of these types of terrain traditionally depends on labor-intensive manual or semi-automated methods. However, the emergence of automated...

    Full text to download in external service

  • Detecting type of hearing loss with different AI classification methods: a performance review

    Publication
    • M. Kassjański
    • M. Kulawiak
    • T. Przewoźny
    • D. Tretiakow
    • J. Kuryłowicz
    • A. Molisz
    • K. Koźmiński
    • A. Kwaśniewska
    • P. Mierzwińska-Dolny
    • M. Grono

    - Year 2023

    Hearing is one of the most crucial senses for all humans. It allows people to hear and connect with the environment, the people they can meet and the knowledge they need to live their lives to the fullest. Hearing loss can have a detrimental impact on a person's quality of life in a variety of ways, ranging from fewer educational and job opportunities due to impaired communication to social withdrawal in severe situations. Early...

    Full text to download in external service

  • Classifying Emotions in Film Music - A Deep Learning Approach

    The paper presents an application for automatically classifying emotions in film music. A model of emotions is proposed, which is also associated with colors. The model created has nine emotional states, to which colors are assigned according to the color theory in film. Subjective tests are carried out to check the correctness of the assumptions behind the adopted emotion model. For that purpose, a statistical analysis of the...

    Full text available to download

  • Musical Instrument Identification Using Deep Learning Approach

    Publication

    - SENSORS - Year 2022

    The work aims to propose a novel approach for automatically identifying all instruments present in an audio excerpt using sets of individual convolutional neural networks (CNNs) per tested instrument. The paper starts with a review of tasks related to musical instrument identification. It focuses on tasks performed, input type, algorithms employed, and metrics used. The paper starts with the background presentation, i.e., metadata...

    Full text available to download

  • Deep Video Multi-task Learning Towards Generalized Visual Scene Enhancement and Understanding

    Publication

    - Year 2024

    The goal of this thesis was to develop efficient video multi-task convolutional architectures for a range of diverse vision tasks, on RGB scenes, leveraging i) task relationships and ii) motion information to improve multi-task performance. The approach we take starts from the integration of diverse tasks within video multi-task learning networks. We present the first two datasets of their kind in the existing literature, featuring...

    Full text available to download

  • Style Transfer for Detecting Vehicles with Thermal Camera

    Publication

    - Year 2019

    In this work we focus on nighttime vehicle detection for intelligent traffic monitoring from the thermal camera. To train a Convolutional Neural Network (CNN) detector we create a stylized version of COCO (Common Objects in Context) dataset using Style Transfer technique that imitates images obtained from thermal cameras. This new dataset is further used for fine-tuning of the model and as a result detection accuracy on images...

  • Semantic segmentation training using imperfect annotations and loss masking

    One of the most significant factors affecting supervised neural network training is the precision of the annotations. Also, in a case of expert group, the problem of inconsistent data annotations is an integral part of real-world supervised learning processes, well-known to researchers. One practical example is a weak ground truth delineation for medical image segmentation. In this paper, we have developed a new method of accurate...

    Full text to download in external service

  • CNN Architectures for Human Pose Estimation from a Very Low Resolution Depth Image

    Publication

    - Year 2018

    The paper is dedicated to proposing and evaluating a number of convolutional neural network architectures for calculating a multiple regression on 3D coordinates of human body joints tracked in a single low resolution depth image. The main challenge was to obtain a high precision in case of a noisy and coarse scan of the body, as observed by a depth sensor from a large distance. The regression network was expected to reason about...

    Full text to download in external service

  • Pose-Invariant Face Detection by Replacing Deep Neurons with Capsules for Thermal Imagery in Telemedicine

    Abstract— The aim of this work was to examine the potential of thermal imaging as a cost-effective tool for convenient, non- intrusive remote monitoring of elderly people in different possible head orientations, without imposing specific behavior on users, e.g. looking toward the camera. Illumination and pose invariant head tracking is important for many medical applications as it can provide information, e.g. about vital signs, sensory...

    Full text available to download

  • Deep learning approach on surface EEG based Brain Computer Interface

    Publication

    - Year 2022

    In this work we analysed the application of con-volutional neural networks in motor imagery classification for the Brain Computer Interface (BCI) purposes. To increase the accuracy of classification we proposed the solution that combines the Common Spatial Pattern (CSP) with convolutional network (ConvNet). The electroencephalography (EEG) is one of the modalities we try to use for controlling the prosthetic arm. Therefor in this...

    Full text to download in external service

  • User Orientation Detection in Relation to Antenna Geometry in Ultra-Wideband Wireless Body Area Networks Using Deep Learning

    Publication

    - SENSORS - Year 2024

    In this paper, the issue of detecting a user’s position in relation to the antenna geometry in ultra-wideband (UWB) off-body wireless body area network (WBAN) communication using deep learning methods is presented. To measure the impulse response of the channel, a measurement stand consisting of EVB1000 devices and DW1000 radio modules was developed and indoor static measurement scenarios were performed. It was proven that for...

    Full text available to download

  • Architektury klasyfikatorów obrazów

    Publication

    - Year 2022

    Klasyfikacja obrazów jest zagadnieniem z dziedziny widzenia komputerowego. Polega na całościowej analizie obrazu i przypisaniu go do jednej lub wielu kategorii (klas). Współczesne rozwiązania tego problemu są w znacznej części realizowane z wykorzystaniem konwolucyjnych głębokich sieci neuronowych (convolutional neural network, CNN). W tym rozdziale opisano przełomowe architektury CNN oraz ewolucję state-of-the-art w klasyfikacji...

    Full text to download in external service

  • Intelligent Autonomous Robot Supporting Small Pets in Domestic Environment

    In this contribution, we present preliminary results of the student project aimed at the development of an intelligent autonomous robot supporting small pets in a domestic environment. The main task of this robot is to protect a freely moving small pets against accidental stepping on them by home residents. For this purpose, we have developed the mobile robot which follows a pet and makes an alarm signal when a human is approaching....

    Full text available to download

  • Piotr Szczuko dr hab. inż.

    Piotr Szczuko received his M.Sc. degree in 2002. His thesis was dedicated to examination of correlation phenomena between perception of sound and vision for surround sound and digital image. He finished Ph.D. studies in 2007 and one year later completed a dissertation "Application of Fuzzy Rules in Computer Character Animation" that received award of Prime Minister of Poland. His interests include: processing of audio and video, computer...

  • Thermal Images Analysis Methods using Deep Learning Techniques for the Needs of Remote Medical Diagnostics

    Publication

    - Year 2020

    Remote medical diagnostic solutions have recently gained more importance due to global demographic shifts and play a key role in evaluation of health status during epidemic. Contactless estimation of vital signs with image processing techniques is especially important since it allows for obtaining health status without the use of additional sensors. Thermography enables us to reveal additional details, imperceptible in images acquired...

    Full text available to download

  • Driver fatigue detection method based on facial image analysis

    Publication

    Nowadays, ensuring road safety is a crucial issue that demands continuous development and measures to minimize the risk of accidents. This paper presents the development of a driver fatigue detection method based on the analysis of facial images. To monitor the driver's condition in real-time, a video camera was used. The method of detection is based on analyzing facial features related to the mouth area and eyes, such as...

    Full text to download in external service

  • Towards Cancer Patients Classification Using Liquid Biopsy

    Liquid biopsy is a useful, minimally invasive diagnostic and monitoring tool for cancer disease. Yet, developing accurate methods, given the potentially large number of input features, and usually small datasets size remains very challenging. Recently, a novel feature parameterization based on the RNA-sequenced platelet data which uses the biological knowledge from the Kyoto Encyclopedia of Genes and Genomes, combined with a classifier...

    Full text to download in external service

  • Underground Water Level Prediction in Remote Sensing Images Using Improved Hydro Index Value with Ensemble Classifier

    Publication
    • A. Stateczny
    • S. C. Narahari
    • P. Vurubindi
    • N. S. Guptha
    • K. Srinivas

    - Remote Sensing - Year 2023

    The economic sustainability of aquifers across the world relies on accurate and rapid estimates of groundwater storage changes, but this becomes difficult due to the absence of insitu groundwater surveys in most areas. By closing the water balance, hydrologic remote sensing measures offer a possible method for quantifying changes in groundwater storage. However, it is uncertain to what extent remote sensing data can provide an...

    Full text available to download

  • An automated learning model for twitter sentiment analysis using Ranger AdaBelief optimizer based Bidirectional Long Short Term Memory

    Publication

    - EXPERT SYSTEMS - Year 2024

    Sentiment analysis is an automated approach which is utilized in process of analysing textual data to describe public opinion. The sentiment analysis has major role in creating impact in the day-to-day life of individuals. However, a precise interpretation of text still relies as a major concern in classifying sentiment. So, this research introduced Bidirectional Long Short Term Memory with Ranger AdaBelief Optimizer (Bi-LSTM RAO)...

    Full text to download in external service

  • Deep neural networks for data analysis 24/25

    e-Learning Courses
    • J. Cychnerski
    • K. Draszawka

    This course covers introduction to supervised machine learning, construction of basic artificial deep neural networks (DNNs) and basic training algorithms, as well as the overview of popular DNNs architectures (convolutional networks, recurrent networks, transformers). The course introduces students to popular regularization techniques for deep models. Besides theory, large part of the course is the project in which students apply...

  • Vehicle detector training with labels derived from background subtraction algorithms in video surveillance

    Publication

    - Year 2018

    Vehicle detection in video from a miniature station- ary closed-circuit television (CCTV) camera is discussed in the paper. The camera provides one of components of the intelligent road sign developed in the project concerning the traffic control with the use of autonomous devices being developed. Modern Convolutional Neural Network (CNN) based detectors need big data input, usually demanding their manual labeling. In the presented...

  • Buried Object Characterization Using Ground Penetrating Radar Assisted by Data-Driven Surrogate-Models

    Publication
    • R. Yurt
    • H. Torpi
    • P. Mahouti
    • A. Kizilay
    • S. Kozieł

    - IEEE Access - Year 2023

    This work addresses artificial-intelligence-based buried object characterization using 3-D full-wave electromagnetic simulations of a ground penetrating radar (GPR). The task is to characterize cylindrical shape, perfectly electric conductor (PEC) object buried in various dispersive soil media, and in different positions. The main contributions of this work are (i) development of a fast and accurate data driven surrogate modeling...

    Full text available to download

  • A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

    Publication
    • G. Tamulevicius
    • G. Korvel
    • A. B. Yayak
    • P. Treigys
    • J. Bernataviciene
    • B. Kostek

    - Electronics - Year 2020

    In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

    Full text available to download

  • Ranking Speech Features for Their Usage in Singing Emotion Classification

    Publication

    - Year 2020

    This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

    Full text available to download

  • Data augmentation for improving deep learning in image classification problem

    Publication

    These days deep learning is the fastest-growing field in the field of Machine Learning (ML) and Deep Neural Networks (DNN). Among many of DNN structures, the Convolutional Neural Networks (CNN) are currently the main tool used for the image analysis and classification purposes. Although great achievements and perspectives, deep neural networks and accompanying learning algorithms have some relevant challenges to tackle. In this...

    Full text to download in external service

  • Spatiotemporal Assessment of Satellite Image Time Series for Land Cover Classification Using Deep Learning Techniques: A Case Study of Reunion Island, France

    Publication
    • N. N. Navnath
    • K. Chandrasekaran
    • A. Stateczny
    • V. M. Sundaram
    • P. Panneer

    - Remote Sensing - Year 2022

    Current Earth observation systems generate massive amounts of satellite image time series to keep track of geographical areas over time to monitor and identify environmental and climate change. Efficiently analyzing such data remains an unresolved issue in remote sensing. In classifying land cover, utilizing SITS rather than one image might benefit differentiating across classes because of their varied temporal patterns. The aim...

    Full text available to download

  • Urban scene semantic segmentation using the U-Net model

    Publication

    - Year 2023

    Vision-based semantic segmentation of complex urban street scenes is a very important function during autonomous driving (AD), which will become an important technology in industrialized countries in the near future. Today, advanced driver assistance systems (ADAS) improve traffic safety thanks to the application of solutions that enable detecting objects, recognising road signs, segmenting the road, etc. The basis for these functionalities...

    Full text to download in external service

  • Optimized Deep Learning Model for Flood Detection Using Satellite Images

    Publication
    • A. Stateczny
    • H. D. Praveena
    • R. H. Krishnappa
    • K. R. Chythanya
    • B. B. Babysarojam

    - Remote Sensing - Year 2023

    The increasing amount of rain produces a number of issues in Kerala, particularly in urban regions where the drainage system is frequently unable to handle a significant amount of water in such a short duration. Meanwhile, standard flood detection results are inaccurate for complex phenomena and cannot handle enormous quantities of data. In order to overcome those drawbacks and enhance the outcomes of conventional flood detection...

    Full text available to download

  • Mask Detection and Classification in Thermal Face Images

    Publication

    Face masks are recommended to reduce the transmission of many viruses, especially SARS-CoV-2. Therefore, the automatic detection of whether there is a mask on the face, what type of mask is worn, and how it is worn is an important research topic. In this work, the use of thermal imaging was considered to analyze the possibility of detecting (localizing) a mask on the face, as well as to check whether it is possible to classify...

    Full text available to download

  • INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH

    Publication

    The Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...

    Full text available to download

  • Vehicle Detection with Self-Training for Adaptative Video Processing Embedded Platform

    Publication

    Traffic monitoring from closed-circuit television (CCTV) cameras on embedded systems is the subject of the performed experiments. Solving this problem encounters difficulties related to the hardware limitations, and possible camera placement in various positions which affects the system performance. To satisfy the hardware requirements, vehicle detection is performed using a lightweight Convolutional Neural Network (CNN), named...

    Full text available to download

  • Equal Baseline Camera Array—Calibration, Testbed and Applications

    Publication

    - Applied Sciences-Basel - Year 2021

    This paper presents research on 3D scanning by taking advantage of a camera array consisting of up to five adjacent cameras. Such an array makes it possible to make a disparity map with a higher precision than a stereo camera, however it preserves the advantages of a stereo camera such as a possibility to operate in wide range of distances and in highly illuminated areas. In an outdoor environment, the array is a competitive alternative...

    Full text available to download

  • Training of Deep Learning Models Using Synthetic Datasets

    Publication

    - Year 2022

    In order to solve increasingly complex problems, the complexity of Deep Neural Networks also needs to be constantly increased, and therefore training such networks requires more and more data. Unfortunately, obtaining such massive real world training data to optimize neural networks parameters is a challenging and time-consuming task. To solve this problem, we propose an easy-touse and general approach to training deep learning...

    Full text to download in external service

  • CNN-CLFFA: Support Mobile Edge Computing in Transportation Cyber Physical System

    Publication
    • A. Bhansali
    • R. Kumar Patra
    • P. Bidare Divakarachari
    • P. Falkowski-Gilski
    • G. Shivakanth
    • S. N. Patil

    - IEEE Access - Year 2024

    In the present scenario, the transportation Cyber Physical System (CPS) improves the reliability and efficiency of the transportation systems by enhancing the interactions between the physical and cyber systems. With the provision of better storage ability and enhanced computing, cloud computing extends transportation CPS in Mobile Edge Computing (MEC). By inspecting the existing literatures, the cloud computing cannot fulfill...

    Full text available to download

  • A Novel Spatio–Temporal Deep Learning Vehicle Turns Detection Scheme Using GPS-Only Data

    Publication

    - IEEE Access - Year 2023

    Whether the computer is driving your car or you are, advanced driver assistance systems (ADAS) come into play on all levels, from weather monitoring to safety. These modern-day ADASs use various assisting tools for drivers to keep the journey safe; these sophisticated tools provide early signals of numerous events, such as road conditions, emerging traffic scenarios, and weather warnings. Many urban applications, such as car-sharing...

    Full text available to download

  • Deep Learning Basics 2023/24

    e-Learning Courses
    • K. Draszawka

    A course about the basics of deep learning intended for students of Computer Science. It includes an introduction to supervised machine learning, the architecture of basic artificial neural networks and their training algorithms, as well as more advanced architectures (convolutional networks, recurrent networks, transformers) and regularization and optimization techniques.

  • A Novel IoT-Perceptive Human Activity Recognition (HAR) Approach Using Multi-Head Convolutional Attention

    Publication

    - IEEE Internet of Things Journal - Year 2019

    Together with fast advancement of the Internet of Things (IoT), smart healthcare applications and systems are equipped with increasingly more wearable sensors and mobile devices. These sensors are used not only to collect data, but also, and more importantly, to assist in daily activity tracking and analyzing of their users. Various human activity recognition (HAR) approaches are used to enhance such tracking. Most of the existing...

    Full text available to download

  • Balanced Spider Monkey Optimization with Bi-LSTM for Sustainable Air Quality Prediction

    Publication

    - Sustainability - Year 2023

    A reliable air quality prediction model is required for pollution control, human health monitoring, and sustainability. The existing air quality prediction models lack efficiency due to overfitting in prediction model and local optima trap in feature selection. This study proposes the Balanced Spider Monkey Optimization (BSMO) technique for effective feature selection to overcome the local optima trap and overfitting problems....

    Full text available to download