Filtry
wszystkich: 467
Wyniki wyszukiwania dla: DATASET QUALITY
-
Methodology of Constructing and Analyzing the Hierarchical Contextually-Oriented Corpora
PublikacjaMethodology of Constructing and Analyzing the Hierarchical structure of the Contextually-Oriented Corpora was developed. The methodology contains the following steps: Contextual Component of the Corpora’s Structure Building; Text Analysis of the Contextually-Oriented Hierarchical Corpus. Main contribution of this study is the following: hierarchical structure of the Corpus provides advanced possibilities for identification of the...
-
Processing of LiDAR and Multibeam Sonar Point Cloud Data for 3D Surface and Object Shape Reconstruction
PublikacjaUnorganised point cloud dataset, as a transitional data model in several applications, usually contains a considerable amount of undesirable irregularities, such as strong variability of local point density, missing data, overlapping points and noise caused by scattering characteristics of the environment. For these reasons, further processing of such data, e.g. for construction of higher order geometric models of the topography...
-
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
PublikacjaAutomatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...
-
Driver’s Condition Detection System Using Multimodal Imaging and Machine Learning Algorithms
PublikacjaTo this day, driver fatigue remains one of the most significant causes of road accidents. In this paper, a novel way of detecting and monitoring a driver’s physical state has been proposed. The goal of the system was to make use of multimodal imaging from RGB and thermal cameras working simultaneously to monitor the driver’s current condition. A custom dataset was created consisting of thermal and RGB video samples. Acquired data...
-
Regeneration Project of Market Places GOSPOSTRATEG – “Polanki” Market in Gdańsk-Oliwa Pilot Project Monitoring Dataset
PublikacjaThe dataset entitled Monitoring of activities carried out as part of prototyping and implementation of the pilot project in the area of the “Polanki” market and its direct neighbourhood, in the Gdańsk-Oliwa district, step1; stage from July 2020 year contains tabular monitoring lists (quantitative and qualitative documentation report in the form of tables) of activities carried out as part of the prototyping and implementation of...
-
Global value chains and wages under different wage setting mechanisms
PublikacjaThis study examines whether, and how, differences in wage bargaining schemes shape the relationship between global value chains (GVCs) and the wages of workers while considering both GVC participation and position in GVC. Our dataset is derived from the European Structure of Earnings Survey (SES), containing employee–employer data from 18 European countries, merged with sectoral data from the World Input-Output Database (WIOD)....
-
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
PublikacjaIn this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...
-
Analysis of the Capability of Deep Learning Algorithms for EEG-based Brain-Computer Interface Implementation
PublikacjaMachine learning models have received significant attention for their exceptional performance in classifying electroencephalography (EEG) data. They have proven to be highly effective in extracting intricate patterns and features from the raw signal data, thereby contributing to their success in EEG classification tasks. In this study, we explore the possibilities of utilizing contemporary machine learning algorithms in decoding...
-
Focus on Misinformation: Improving Medical Experts’ Efficiency of Misinformation Detection
PublikacjaFighting medical disinformation in the era of the global pandemic is an increasingly important problem. As of today, automatic systems for assessing the credibility of medical information do not offer sufficient precision to be used without human supervision, and the involvement of medical expert annotators is required. Thus, our work aims to optimize the utilization of medical experts’ time. We use the dataset of sentences taken...
-
Multi-task Video Enhancement for Dental Interventions
PublikacjaA microcamera firmly attached to a dental handpiece allows dentists to continuously monitor the progress of conservative dental procedures. Video enhancement in video-assisted dental interventions alleviates low-light, noise, blur, and camera handshakes that collectively degrade visual comfort. To this end, we introduce a novel deep network for multi-task video enhancement that enables macro-visualization of dental scenes. In particular,...
-
Ranking Speech Features for Their Usage in Singing Emotion Classification
PublikacjaThis paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...
-
Investigating Feature Spaces for Isolated Word Recognition
PublikacjaThe study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...
-
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
PublikacjaIn this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...
-
Improving Traffic Light Recognition Methods using Shifting Time-Windows
PublikacjaWe propose a novel method of improving algorithms recognizing traffic lights in video sequences. Our focus is on algorithms for applications which notify the driver of a light in sight. Many existing methods process images in the recording separately. Our method bases on the observation that real-life videos depict underlying continuous processes. We named our method FSA (Frame Sequence Analyzed). It is applicable for any underlying...
-
Improving methods for detecting people in video recordings using shifting time-windows
PublikacjaWe propose a novel method for improving algorithms which detect the presence of people in video sequences. Our focus is on algorithms for applications which require reporting and analyzing all scenes with detected people in long recordings. Therefore one of the target qualities of the classification result is its stability, understood as a low number of invalid scene boundaries. Many existing methods process images in the recording...
-
Vehicle detector training with labels derived from background subtraction algorithms in video surveillance
PublikacjaVehicle detection in video from a miniature station- ary closed-circuit television (CCTV) camera is discussed in the paper. The camera provides one of components of the intelligent road sign developed in the project concerning the traffic control with the use of autonomous devices being developed. Modern Convolutional Neural Network (CNN) based detectors need big data input, usually demanding their manual labeling. In the presented...
-
A Triplet-Learnt Coarse-to-Fine Reranking for Vehicle Re-identification
PublikacjaVehicle re-identification refers to the task of matching the same query vehicle across non-overlapping cameras and diverse viewpoints. Research interest on the field emerged with intelligent transportation systems and the necessity for public security maintenance. Compared to person, vehicle re-identification is more intricate, facing the challenges of lower intra-class and higher inter-class similarities. Motivated by deep...
-
Areas of Updraft Air Motion in an Idealised Weather Research and Forecasting Model Simulation of Atmospheric Boundary Layer Response to Different Floe Size Distributions
PublikacjaPresented dataset is part of a numerical modelling study focusing on the analysis of the influence of sea ice floe size distribution (FSD) on the horizontal and vertical structure of convection in the atmosphere. The total area and spatial arrangement of the up-drafts indicates that the FSD affects the total moisture content and the values of area averaged turbulent fluxes in the model domain. In fact, while convective updrafts...
-
Simulations of the Derecho Event in Poland of 11th August 2017 Using WRF Model
PublikacjaThis series contains datasets related to the forecasting of a severe weather event, a derecho, in Poland on 11 August 2017. The simulations were conducted using the Weather Research and Forecasting (WRF) model version 4.2.1 with different initial and boundary conditions of the pressure and model levels derived from 5 global models: Global Forecast System (GFS), Global Data Assimilation System (GDAS), European Centre for Medium-Range...
-
Optimized Computational Intelligence Model for Estimating the Flexural Behavior of Composite Shear Walls
PublikacjaThis article presents a novel approach to estimate the flexural capacity of reinforced concrete-filled composite plate shear walls using an optimized computational intelligence model. The proposed model was developed and validated based on 47 laboratory data points and the Transit Search (TS) optimization algorithm. Using 80% of the experimental dataset, the optimized model was selected by determining the unknown coefficients of...
-
Intracranial electrophysiological recordings from the human brain during memory tasks with pupillometry
PublikacjaData comprise intracranial EEG (iEEG) brain activity represented by stereo EEG (sEEG) signals, recorded from over 100 electrode channels implanted in any one patient across various brain regions. The iEEG signals were recorded in epilepsy patients (N=10) undergoing invasive monitoring and localization of seizures when they were performing a battery of four memory tasks lasting approx. 1 hour in total. Gaze tracking on the task...
-
Pursuing the Deep-Learning-Based Classification of Exposed and Imagined Colors from EEG
PublikacjaEEG-based brain-computer interfaces are systems aiming to integrate disabled people into their environments. Nevertheless, their control could not be intuitive or depend on an active external stimulator to generate the responses for interacting with it. Targeting the second issue, a novel paradigm is explored in this paper, which depends on a passive stimulus by measuring the EEG responses of a subject to the primary colors (red,...
-
Predicting sulfanilamide solubility in the binary mixtures using a reference solvent approach
PublikacjaBackground. Solubility is a fundamental physicochemical property of active pharmaceutical ingredients. The optimization of a dissolution medium aims not only to increase solubility and other aspects are to be included such as environmental impact, toxicity degree, availability, and costs. Obtaining comprehensive...
-
A Novel IoT-Perceptive Human Activity Recognition (HAR) Approach Using Multi-Head Convolutional Attention
PublikacjaTogether with fast advancement of the Internet of Things (IoT), smart healthcare applications and systems are equipped with increasingly more wearable sensors and mobile devices. These sensors are used not only to collect data, but also, and more importantly, to assist in daily activity tracking and analyzing of their users. Various human activity recognition (HAR) approaches are used to enhance such tracking. Most of the existing...
-
Combining Road Network Data from OpenStreetMap with an Authoritative Database
PublikacjaComputer modeling of road networks requires detailed and up-to-date dataset. This paper proposes a method of combining authoritative databases with OpenStreetMap (OSM) system. The complete route is established by finding paths in the graph constructed from partial data obtained from OSM. In order to correlate data from both sources, a method of coordinate conversion is proposed. The algorithm queries road data from OSM and provides...
-
Toward Robust Pedestrian Detection With Data Augmentation
PublikacjaIn this article, the problem of creating a safe pedestrian detection model that can operate in the real world is tackled. While recent advances have led to significantly improved detection accuracy on various benchmarks, existing deep learning models are vulnerable to invisible to the human eye changes in the input image which raises concerns about its safety. A popular and simple technique for improving robustness is using data...
-
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublikacjaIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...
-
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublikacjaIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic...
-
The Specific Nature of Chemical Composition of Water from Volcanic Lakes Based on Bali Case Study
PublikacjaThe research area was localized in the Indonesian Archipelago, at the latitude of eight and nine degrees S on the one of the Lesser Sunda group island provinces, Bali (563,3 km2). Two massive calderas (Mount Batur 1717 m above sea level.; Mount Sangiyang 2093 m above sea level) are one of the most prominent landforms in the chain of volcanic mountain ranges of the Bali Island. Lake Batur (17,18 km2) and Batur Spring (which are...
-
Bimodal Emotion Recognition Based on Vocal and Facial Features
PublikacjaEmotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...
-
Food Classification from Images Using a Neural Network Based Approach with NVIDIA Volta and Pascal GPUs
PublikacjaIn the paper we investigate the problem of food classification from images, for the Food-101 dataset extended with 31 additional food classes from Polish cuisine. We adopted transfer learning and firstly measured training times for models such as MobileNet, MobileNetV2, ResNet50, ResNet50V2, ResNet101, ResNet101V2, InceptionV3, InceptionResNetV2, Xception, NasNetMobile and DenseNet, for systems with NVIDIA Tesla V100 (Volta) and...
-
How to Sort Them? A Network for LEGO Bricks Classification
PublikacjaLEGO bricks are highly popular due to the ability to build almost any type of creation. This is possible thanks to availability of multiple shapes and colors of the bricks. For the smooth build process the bricks need to properly sorted and arranged. In our work we aim at creating an automated LEGO bricks sorter. With over 3700 different LEGO parts bricks classification has to be done with deep neural networks. The question arises...
-
Mask Detection and Classification in Thermal Face Images
PublikacjaFace masks are recommended to reduce the transmission of many viruses, especially SARS-CoV-2. Therefore, the automatic detection of whether there is a mask on the face, what type of mask is worn, and how it is worn is an important research topic. In this work, the use of thermal imaging was considered to analyze the possibility of detecting (localizing) a mask on the face, as well as to check whether it is possible to classify...
-
Methodology for Performing Bathymetric Measurements of Shallow Waterbodies Using an UAV, and their Processing Based on the SVR Algorithm
PublikacjaState-of-art methods of bathymetric measurements for shallow waterbodies use Global Navigation Satellite System (GNSS) receiver, bathymetric Light Detection and Ranging (LiDAR) sensor or satellite imagery. Currently, photogrammetric methods with the application of Unmanned Aerial Vehicles (UAV) are gathering great importance. This publication aims to present step-by-step methodology for carrying out the bathymetric measurements...
-
Methodology for Performing Bathymetric Measurements of Shallow Waterbodies Using an UAV, and their Processing Based on the SVR Algorithm
PublikacjaState-of-art methods of bathymetric measurements for shallow waterbodies use Global Navigation Satellite System (GNSS) receiver, bathymetric Light Detection and Ranging (LiDAR) sensor or satellite imagery. Currently, photogrammetric methods with the application of Unmanned Aerial Vehicles (UAV) are gathering great importance. This publication aims to present step-by-step methodology for carrying out the bathymetric measurements...
-
Ontological Model for Contextual Data Defining Time Series for Emotion Recognition and Analysis
PublikacjaOne of the major challenges facing the field of Affective Computing is the reusability of datasets. Existing affective-related datasets are not consistent with each other, they store a variety of information in different forms, different formats, and the terms used to describe them are not unified. This paper proposes a new ontology, ROAD, as a solution to this problem, by formally describing the datasets and unifying the terms...
-
Atomic force microscopy images of copper electrical contacts wear under the influence of friction
Dane BadawczeMeasurement of wear of copper electrical contacts under the influence of friction. Imaging in contact mode in the variant of scanning spreading resistance microscopy. Additionally, there are spectroscopic current-voltage curves showing local changes in electrical conductivity. NTEGRA Prima (NT-MDT) device. Probe NSG 01Pt.
-
Long term measurements of PM1, PM2.5, PM10 and NO2 in open-air at Gdansk (Poland) area using low-cost sensors together with the reference results
Dane BadawczeThe measurements results of open-air measurements made using the following low-cost sensors: particulate matter (PM) sensor SPS30 from Sensirion, NO2 electrochemical sensor SGX-7NO2 from SGX Sensortech, NO2 electrochemical sensor 7E4-NO2 from SemaTech, compact MOS air quality sensor MiCS 2714 from SGX Sensortech, BME280 (Bosch) environmental sensor...
-
ECG measurement in the bathtub - getting into the bathtub- men
Dane BadawczeThe measurement data shows the measurement of the ECG signal in water in the bathtub. The data includes the measurement time, the reference ECG signal from the chest, and the ECG signal measured by electrodes placed in the bathtub without contact with the human body. Using the presented data, it is possible to estimate the optimal arrangement of measuring...
-
EXTREME RAINFALLS AS A CAUSE OF URBAN FLASH FLOODS; A CASE STUDY OF THE ERBIL-KURDISTAN REGION OF IRAQ
PublikacjaAim of the study The current paper aims to give a detailed evaluation and analysis of some extreme rainfall events that happened in the last decade in terms of spatial and temporal rainfall distribution, intensity rate, and exceedance probability. Moreover, it examines the effects of each analysed aspect on the resulting flash floods in the studied area. Material and methods In their glossary of meteorology, American Meteorology...
-
Speech Analytics Based on Machine Learning
PublikacjaIn this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...
-
Increasing K-Means Clustering Algorithm Effectivity for Using in Source Code Plagiarism Detection
PublikacjaThe problem of plagiarism is becoming increasingly more significant with the growth of Internet technologies and the availability of information resources. Many tools have been successfully developed to detect plagiarisms in textual documents, but the situation is more complicated in the field of plagiarism of source codes, where the problem is equally serious. At present, there are no complex tools available to detect plagiarism...
-
Information Extraction from Polish Radiology Reports using Language Models
PublikacjaRadiology reports are vital elements of directing patient care. They are usually delivered in free text form, which makes them prone to errors, such as omission in reporting radiological findings and using difficult-to-comprehend mental shortcuts. Although structured reporting is the recommended method, its adoption continues to be limited. Radiologists find structured reports too limiting and burdensome. In this paper, we propose...
-
Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA
PublikacjaLarge-scale Graph Convolutional Network (GCN) inference on traditional CPU/GPU systems is challenging due to a large memory footprint, sparse computational patterns, and irregular memory accesses with poor locality. Intel’s Programmable Integrated Unffied Memory Architecture (PIUMA) is designed to address these challenges for graph analytics. In this paper, a detailed characterization of GCNs is presented using the Open-Graph Benchmark...
-
Comparison of Selected Neural Network Models Used for Automatic Liver Tumor Segmentation
PublikacjaAutomatic and accurate segmentation of liver tumors is crucial for the diagnosis and treatment of hepatocellular carcinoma or metastases. However, the task remains challenging due to imprecise boundaries and significant variations in the shape, size, and location of tumors. The present study focuses on tumor segmentation as a more critical aspect from a medical perspective, compared to liver parenchyma segmentation, which is the...
-
Improving Accuracy of Respiratory Rate Estimation by Restoring High Resolution Features With Transformers and Recursive Convolutional Models
PublikacjaNon-contact evaluation of vital signs has been becoming increasingly important, especially in light of the COVID- 19 pandemic, which is causing the whole world to examine people’s interactions in public places at a scale never seen before. However, evaluating one’s vital signs can be a relatively complex procedure, which requires both time and physical contact between examiner and examinee. These re- quirements limit the number...
-
Information and communication technologies versus diffusion and substitution of financial innovations. The case of exchange-traded funds in Japan and South Korea
PublikacjaThe substitution between financial innovations, exchange-traded funds (ETFs), and stock index derivatives (i.e. index financial instruments) is one of the relatively understudied topics of the financial sciences. The current study aims to verify empirically the diffusion and substitution of ETFs in the market for index financial instruments. It presents in-depth analysis of the development of index financial instruments traded...
-
Detection of the acoustic interferences during AFM operation
Dane BadawczeAtomic force microscopy is a particularly complicated surface imaging technique due to the large number of factors that affect the quality of the resulting images. They are obviously difficult and sometimes even impossible to control at the same time. One of such factors may even be the seismological location of the building or the influence of mechanical...
-
ECG measurement in the bathtub - drl on the outside of the bathtub on one side- men
Dane BadawczeThe measurement data shows the measurement of the ECG signal in water in the bathtub. The data includes the measurement time, the reference ECG signal from the chest, and the ECG signal measured by electrodes placed in the bathtub without contact with the human body. Using the presented data, it is possible to estimate the optimal arrangement of measuring...
-
ECG measurement in the bathtub - electrodes at the feet, drl behind the back - women
Dane BadawczeThe measurement data shows the measurement of the ECG signal in water in the bathtub. The data includes the measurement time, the reference ECG signal from the chest, and the ECG signal measured by electrodes placed in the bathtub without contact with the human body. Using the presented data, it is possible to estimate the optimal arrangement of measuring...