Filtry
wszystkich: 474
Wyniki wyszukiwania dla: DATASET QUALITY
-
ECG measurement in the bathtub - drl on one side of the bathtub, the other side of the bathtub is grounded and the system ground is grounded- men
Dane BadawczeThe measurement data shows the measurement of the ECG signal in water in the bathtub. The data includes the measurement time, the reference ECG signal from the chest, and the ECG signal measured by electrodes placed in the bathtub without contact with the human body. Using the presented data, it is possible to estimate the optimal arrangement of measuring...
-
Closer Look at the Uncertainty Estimation in Semantic Segmentation under Distributional Shift
PublikacjaWhile recent computer vision algorithms achieve impressive performance on many benchmarks, they lack robustness - presented with an image from a different distribution, (e.g. weather or lighting conditions not considered during training), they may produce an erroneous prediction. Therefore, it is desired that such a model will be able to reliably predict its confidence measure. In this work, uncertainty estimation for the task...
-
Consumer Bankruptcy Prediction Using Balanced and Imbalanced Data
PublikacjaThis paper examines the usefulness of logit regression in forecasting the consumer bankruptcy of households using an imbalanced dataset. The research on consumer bankruptcy prediction is of paramount importance as it aims to build statistical models that can identify consumers in a difficult financial situation that may lead to consumer bankruptcy. In the face of the current global pandemic crisis, the future of household finances...
-
A comprehensive review of open data platforms, prevalent technologies, and functionalities
PublikacjaOpen data can play a crucial role in different sectors of the world,such as government, science, research, technology, culture, andfinance. There are several necessary measures that every organiza-tion needs to consider before opening data. There are three majorsteps to opening the data: (1) Preparation stage, and (2) launchingthe open data initiative (3) In this case, the feedback mechanismstudy such as expand and sustain stage,...
-
Medical Image Dataset Annotation Service (MIDAS)
PublikacjaMIDAS (Medical Image Dataset Annotation Service) is a custom-tailored tool for creating and managing datasets either for deep learning, as well as machine learning or any form of statistical research. The aim of the project is to provide one-fit-all platform for creating medical image datasets that could easily blend in hospital's workflow. In our work, we focus on the importance of medical data anonimization, discussing the...
-
Investigating Feature Spaces for Isolated Word Recognition
PublikacjaMuch attention is given by researchers to the speech processing task in automatic speech recognition (ASR) over the past decades. The study addresses the issue related to the investigation of the appropriateness of a two-dimensional representation of speech feature spaces for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and timefrequency signal representation...
-
Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging
PublikacjaIn the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modification of the training program which minimizes the...
-
Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives
PublikacjaLong-term Web archives comprise Web documents gathered over longer time periods and can easily reach hundreds of terabytes in size. Semantic annotations such as named entities can facilitate intelligent access to the Web archive data. However, the annotation of the entire archive content on this scale is often infeasible. The most efficient way to access the documents within Web archives is provided through their URLs, which are...
-
Going all in or spreading your bet: a configurational perspective on open innovation interaction channels in production sectors
PublikacjaUsing different interaction channels within open innovation partnerships holds the potential to enhance the chance of success in production sectors. However, our comprehension of how open innovation partnerships are affected by varying combinations of interaction channels, and how this corelates with their level of open innovation output, remains limited. There are discrepancies in the current literature regarding the individual...
-
Prediction of fracture toughness in fibre-reinforced concrete, mortar, and rocks using various Machine learning techniques
PublikacjaMachine Learning (ML) method is widely used in engineering applications such as fracture mechanics. In this study, twenty different ML algorithms were employed and compared for the prediction of the fracture toughness and fracture load in modes I, II, and mixed-mode (I-II) of various materials, including fibre-reinforced concrete, cement mortar, sandstone, white travertine, marble, and granite. A set of 401 specimens of “Brazilian...
-
Towards application of uncertainty quantification procedure combined with experimental procedure for assessment of the accuracy of the DEM approach dedicated for granular flow modeling
PublikacjaThere is a high demand for accurate and fast numerical models for dense granular flows found in many industrial applications. Nevertheless, before numerical model can be used its need to be always validated against experimental data. During the validation, it is important to consider how the measurement data sets, as well as the numerical models, are affected by errors and uncertainties. In this study, the uncertainty quantification...
-
Urban scene semantic segmentation using the U-Net model
PublikacjaVision-based semantic segmentation of complex urban street scenes is a very important function during autonomous driving (AD), which will become an important technology in industrialized countries in the near future. Today, advanced driver assistance systems (ADAS) improve traffic safety thanks to the application of solutions that enable detecting objects, recognising road signs, segmenting the road, etc. The basis for these functionalities...
-
The influence of external interference on AFM imaging, the use of a protective helmet
Dane BadawczeThis collection is of purely practical importance, showing how the presence of external disturbances can adversely affect the quality of imaging with an atomic force microscope. For this reason, it is also advisable to provide a link to a workshop-like study [1] as well as a huge number of commercial solutions available after entering the keyword "AFM...
-
Systematic Literature Review for Emotion Recognition from EEG Signals
PublikacjaResearchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...
-
Systematic Literature Review for Emotion Recognition from EEG Signals
PublikacjaResearchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...
-
Categorization of emotions in dog behavior based on the deep neural network
PublikacjaThe aim of this article is to present a neural system based on stock architecture for recognizing emotional behavior in dogs. Our considerations are inspired by the original work of Franzoni et al. on recognizing dog emotions. An appropriate set of photographic data has been compiled taking into account five classes of emotional behavior in dogs of one breed, including joy, anger, licking, yawning, and sleeping. Focusing on a particular...
-
Independent dynamics of low, intermediate, and high frequency spectral intracranial EEG activities during human memory formation
PublikacjaA wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various frequency ranges are coordinated across the space of the human cortex and time of memory processing is inconclusive. They can either be coordinated together across the frequency spectrum at the same cortical site and time or induced independently in particular bands. We used a large dataset of human intracranial...
-
Solubility Characteristics of Acetaminophen and Phenacetin in Binary Mixtures of Aqueous Organic Solvents: Experimental and Deep Machine Learning Screening of Green Dissolution Media
PublikacjaThe solubility of active pharmaceutical ingredients is a mandatory physicochemical characteristic in pharmaceutical practice. However, the number of potential solvents and their mixtures prevents direct measurements of all possible combinations for finding environmentally friendly, operational and cost-effective solubilizers. That is why support from theoretical screening seems to be valuable. Here, a collection of acetaminophen...
-
Selected Technical Issues of Deep Neural Networks for Image Classification Purposes
PublikacjaIn recent years, deep learning and especially Deep Neural Networks (DNN) have obtained amazing performance on a variety of problems, in particular in classification or pattern recognition. Among many kinds of DNNs, the Convolutional Neural Networks (CNN) are most commonly used. However, due to their complexity, there are many problems related but not limited to optimizing network parameters, avoiding overfitting and ensuring good...
-
CNN-CLFFA: Support Mobile Edge Computing in Transportation Cyber Physical System
PublikacjaIn the present scenario, the transportation Cyber Physical System (CPS) improves the reliability and efficiency of the transportation systems by enhancing the interactions between the physical and cyber systems. With the provision of better storage ability and enhanced computing, cloud computing extends transportation CPS in Mobile Edge Computing (MEC). By inspecting the existing literatures, the cloud computing cannot fulfill...
-
Transfer learning in imagined speech EEG-based BCIs
PublikacjaThe Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...
-
ELECTRICAL CONDUCTIVITY AND pH IN SURFACE WATER AS TOOL FOR IDENTIFICATION OF CHEMICAL DIVERSITY
PublikacjaIn the present study, the creeks and lakes located at the western shore of Admiralty Bay were analysed. The impact of various sources of water supply was considered, based on the parameters of temperature, pH and specific electrolytic conductivity (SEC25). All measurements were conducted during a field campaign in January-February 2017. A multivariate dataset was also created and a biplot of SEC25 and pH of the investigated waters...
-
Deep learning-based waste detection in natural and urban environments
PublikacjaWaste pollution is one of the most significant environmental issues in the modern world. The importance of recycling is well known, both for economic and ecological reasons, and the industry demands high efficiency. Current studies towards automatic waste detection are hardly comparable due to the lack of benchmarks and widely accepted standards regarding the used metrics and data. Those problems are addressed in this article by...
-
Magnetic Signature Description of Ellipsoid-Shape Vessel Using 3D Multi-Dipole Model Fitted on Cardinal Directions
PublikacjaThe article presents a continuation of the research on the 3D multi-dipole model applied to the reproduction of magnetic signatures of ferromagnetic objects. The model structure has been modified to improve its flexibility - model parameters determined by optimization can now be located in the cuboid contour representing the object's hull. To stiffen the model, the training dataset was expanded to data collected from all four cardinal...
-
Know your safety indicator – A determination of merchant vessels Bow Crossing Range based on big data analytics
PublikacjaEven in the era of automatization maritime safety constantly needs improvements. Regardless of the presence of crew members on board, both manned and autonomous ships should follow clear guidelines (no matter as bridge procedures or algorithms). To date, many safety indicators, especially in collision avoidance have been proposed. One of such parameters commonly used in day-to-day navigation but usually omitted by researchers is...
-
Interrelations between Travel Patterns and Urban Spatial Structure of the Largest Russian Cities
PublikacjaThe study presented within this dissertation involves the analysis of the relationship between urban spatial structure and travel patterns in the largest Russian cities. It is an empirical investigation of how the spatial structure, formed during the Soviet and post-Soviet periods, affects the travel patterns in the largest cities of contemporary Russia. It aims to determine what measures, both urban structure and transportation...
-
The use of fast molecular descriptors and artificial neural networks approach in organochlorine compounds electron ionization mass spectra classification
PublikacjaDeveloping of theoretical tools can be very helpful for supporting new pollutant detection. Nowadays, a combination of mass spectrometry and chromatographic techniques are the most basic environmental monitoring methods. In this paper, two organochlorine compound mass spectra classification systems were proposed. The classification models were developed within the framework of artificial neural networks (ANNs) and fast 1D and...
-
An Analysis of Neural Word Representations for Wikipedia Articles Classification
PublikacjaOne of the current popular methods of generating word representations is an approach based on the analysis of large document collections with neural networks. It creates so-called word-embeddings that attempt to learn relationships between words and encode this information in the form of a low-dimensional vector. The goal of this paper is to examine the differences between the most popular embedding models and the typical bag-of-words...
-
Machine learning-based prediction of preplaced aggregate concrete characteristics
PublikacjaPreplaced-Aggregate Concrete (PAC) is a type of preplaced concrete where coarse aggregate is placed in the mold and a Portland cement-sand grout with admixtures is injected to fill the voids. Due to the complex nature of PAC, many studies were conducted to determine the effects of admixtures and the compressive and tensile strengths of PAC. Considering that a prediction tool is needed to estimate the compressive and tensile...
-
Exploring the Solubility Limits of Edaravone in Neat Solvents and Binary Mixtures: Experimental and Machine Learning Study
PublikacjaThis study explores the edaravone solubility space encompassing both neat and binary dissolution media. Efforts were made to reveal the inherent concentration limits of common pure and mixed solvents. For this purpose, the published solubility data of the title drug were scrupulously inspected and cured, which made the dataset consistent and coherent. However, the lack of some important types of solvents in the collection called...
-
Super-resolved Thermal Imagery for High-accuracy Facial Areas Detection and Analysis
PublikacjaIn this study, we evaluate various Convolutional Neural Networks based Super-Resolution (SR) models to improve facial areas detection in thermal images. In particular, we analyze the influence of selected spatiotemporal properties of thermal image sequences on detection accuracy. For this purpose, a thermal face database was acquired for 40 volunteers. Contrary to most of existing thermal databases of faces, we publish our dataset...
-
Impact of optimization of ALS point cloud on classification
PublikacjaAirborne laser scanning (ALS) is one of the LIDAR technologies (Light Detection and Ranging). It provides information about the terrain in form of a point cloud. During measurement is acquired: spatial data (object’s coordinates X, Y, Z) and collateral data such as intensity of reflected signal. The obtained point cloud is typically applied for generating a digital terrain model (DTM) and a digital surface model (DSM). For DTM...
-
Fault diagnosis of marine 4-stroke diesel engines using a one-vs-one extreme learning ensemble
PublikacjaThis paper proposes a novel approach for intelligent fault diagnosis for stroke Diesel marine engines, which are commonly used in on-road and marine transportation. The safety and reliability of a ship's work rely strongly on the performance of such an engine; therefore, early detection of any type of failure that affects the engine is of crucial importance. Automatic diagnostic systems are of special importance because they can...
-
Emissions and toxic units of solvent, monomer and additive residues released to gaseous phase from latex balloons
PublikacjaThis study describes the VOCs emissions from commercially available latex balloons. Nine compounds are determined to be emitted from 13 types of balloons of different colors and imprints in 30 and 60°C. The average values of total volatile organic compounds (TVOCs) emitted from studied samples ranged from 0.054 up to 7.18 μg∙g-1 and from 0.27 up to 36.13 μg∙g-1 for 30oC and 60oC, respectively. The dataset is treated with principal...
-
Graph Representation Integrating Signals for Emotion Recognition and Analysis
PublikacjaData reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...
-
imPlatelet classifier: image‐converted RNA biomarker profiles enable blood‐based cancer diagnostics
PublikacjaLiquid biopsies offer a minimally invasive sample collection, outperforming traditional biopsies employed for cancer evaluation. The widely used material is blood, which is the source of tumor-educated platelets. Here, we developed the imPlatelet classifier, which converts RNA-sequenced platelet data into images in which each pixel corresponds to the expression level of a certain gene. Biological knowledge from the Kyoto Encyclopedia...
-
Quantitative Risk Assessment in Construction Disputes Based on Machine Learning Tools
PublikacjaA high monetary value of the construction projects is one of the reasons of frequent disputes between a general contractor (GC) and a client. A construction site is a unique, one-time, and single-product factory with many parties involved and dependent on each other. The organizational dependencies and their complexity make any fault or mistake propagate and influence the final result (delays, cost overruns). The constant will...
-
Machine learning-based seismic fragility and seismic vulnerability assessment of reinforced concrete structures
PublikacjaMany studies have been performed to put quantifying uncertainties into the seismic risk assessment of reinforced concrete (RC) buildings. This paper provides a risk-assessment support tool for purpose of retrofitting and potential design strategies of RC buildings. Machine Learning (ML) algorithms were developed in Python software by innovative methods of hyperparameter optimization, such as halving search, grid search, random...
-
Dependent self-employed individuals: are they different from paid employees?
PublikacjaThis study focuses on dependent self-employment, which covers a situation where a person works for the same employer as a typical worker while on a self-employment contractual basis, i.e., without a traditional employment contract and without certain rights granted to "regular" employees. The research exploits the individual-level dataset of 35 European countries extracted from the 2017 edition of the European Labour Force Survey...
-
Machine learning-based prediction of preplaced aggregate concrete characteristics
PublikacjaPreplaced-Aggregate Concrete (PAC) is a type of preplaced concrete where coarse aggregate is placed in the mold and a Portland cement-sand grout with admixtures is injected to fill the voids. Due to the complex nature of PAC, many studies were conducted to determine the effects of admixtures and the compressive and tensile strengths of PAC. Considering that a prediction tool is needed to estimate the compressive and tensile strengths...
-
Platelet RNA Sequencing Data Through the Lens of Machine Learning
PublikacjaLiquid biopsies offer minimally invasive diagnosis and monitoring of cancer disease. This biosource is often analyzed using sequencing, which generates highly complex data that can be used using machine learning tools. Nevertheless, validating the clinical applications of such methods is challenging. It requires: (a) using data from many patients; (b) verifying potential bias concerning sample collection; and (c) adding interpretability...
-
Towards High-Value Datasets Determination for Data-Driven Development: A Systematic Literature Review
PublikacjaOpen government data (OGD) is seen as a political and socio-economic phenomenon that promises to promote civic engagement and stimulate public sector innovations in various areas of public life. To bring the expected benefits, data must be reused and transformed into value-added products or services. This, in turn, sets another precondition for data that are expected to not only be available and comply with open data principles,...
-
Diurnal variability of atmospheric water vapour, precipitation and cloud top temperature across the global tropics derived from satellite observations and GNSS technique
PublikacjaThe diurnal cycle of convection plays an important role in clouds and water vapour distribution across the global tropics. In this study, we utilize integrated moisture derived from the global navigation satellite system (GNSS), satellite precipitation estimates from TRMM and merged infrared dataset to investigate links between variability in tropospheric moisture, clouds development and precipitation at a diurnal time scale. Over...
-
The surface of the sensor used in the analysis of odorous substances
Dane BadawczeHuman industrial activity usually leads to smaller or larger interference with the ecosystem, contributing to changes affecting the quality of life. An example may be the emission of gaseous substances, not necessarily toxic, but due to their intense smell, they can cause discomfort to people exposed to their inhalation. The problem is so important...
-
Towards Scalable Simulation of Federated Learning
PublikacjaFederated learning (FL) allows to train models on decentralized data while maintaining data privacy, which unlocks the availability of large and diverse datasets for many practical applications. The ongoing development of aggregation algorithms, distribution architectures and software implementations aims for enabling federated setups employing thousands of distributed devices, selected from millions. Since the availability of...
-
Comparative study on total nitrogen prediction in wastewater treatment 1 plant and effect of various feature selection methods on machine learning algorithms performance
PublikacjaWastewater characteristics prediction in wastewater treatment plants (WWTPs) is valuable and can reduce the number of sampling, energy, and cost. Feature Selection (FS) methods are used in the pre-processing section for enhancing the model performance. This study aims to evaluate the effect of seven different FS methods (filter, wrapper, and embedded methods) on enhancing the prediction accuracy for total nitrogen (TN) in the WWTP...
-
Metoda OptD do redukcji danych w opracowaniu wyników pomiarów linii elektroenergetycznych
PublikacjaSkaning laserowy to technologia dostarczająca we względnie krótkim czasie dużą ilość danych pomiarowych. Jest to zarazem pozytywna jak i negatywna cecha tej technologii. Z jednej strony w wyniku skaningu otrzymuje się dane, które szczegółowo odzwierciedlają pomierzony obiekt. Z drugiej strony trudność sprawia przetwarzanie takiej ilości danych i nie zawsze wszystkie dane ze skaningu są niezbędne do realizacji wybranego zadania....
-
Detecting Apples in the Wild: Potential for Harvest Quantity Estimation
PublikacjaKnowing the exact number of fruits and trees helps farmers to make better decisions in their orchard production management. The current practice of crop estimation practice often involves manual counting of fruits (before harvesting), which is an extremely time-consuming and costly process. Additionally, this is not practicable for large orchards. Thanks to the changes that have taken place in recent years in the field of image...
-
LDNet: A Robust Hybrid Approach for Lie Detection Using Deep Learning Techniques
PublikacjaDeception detection is regarded as a concern for everyone in their daily lives and affects social interactions. The human face is a rich source of data that offers trustworthy markers of deception. The deception or lie detection systems are non-intrusive, cost-effective, and mobile by identifying facial expressions. Over the last decade, numerous studies have been conducted on deception detection using several advanced techniques....
-
An automated learning model for twitter sentiment analysis using Ranger AdaBelief optimizer based Bidirectional Long Short Term Memory
PublikacjaSentiment analysis is an automated approach which is utilized in process of analysing textual data to describe public opinion. The sentiment analysis has major role in creating impact in the day-to-day life of individuals. However, a precise interpretation of text still relies as a major concern in classifying sentiment. So, this research introduced Bidirectional Long Short Term Memory with Ranger AdaBelief Optimizer (Bi-LSTM RAO)...