Filtry
wszystkich: 2267
wybranych: 266
Wyniki wyszukiwania dla: DATASET FEATURES, DATASET PROFILING VOCABULARIES
-
RDF dataset profiling - a survey of features, methods, vocabularies and applications
PublikacjaThe Web of Data, and in particular Linked Data, has seen tremendous growth over the past years. However, reuse and take-up of these rich data sources is often limited and focused on a few well-known and established RDF datasets. This can be partially attributed to the lack of reliable and up-to-date information about the characteristics of available datasets. While RDF datasets vary heavily with respect to the features related...
-
Noise profiling for speech enhancement employing machine learning models
PublikacjaThis paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...
-
Applying the Lombard Effect to Speech-in-Noise Communication
PublikacjaThis study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...
-
Non-Contact Temperature Measurements Dataset
PublikacjaThe dataset titled The influence of the distance of the pyrometer from the surface of the radiating object on the accuracy of measurements contains temperature measurements using a selection of four commercially available pyrometers (CHY 314P, TM-F03B, TFA 31.1125 and AB-8855) as a function of the measuring distance. The dataset allows a comparison of the accuracy and measuring precision of the devices, which are very important...
-
AITP - AI Thermal Pedestrians Dataset
PublikacjaEfficient pedestrian detection is a very important task in ensuring safety within road conditions, especially after sunset. One way to achieve this goal is to use thermal imaging in conjunction with deep learning methods and an annotated dataset for models training. In this work, such a dataset has been created by capturing thermal images of pedestrians in different weather and traffic conditions. All images were manually annotated...
-
The Optimum Dataset method – examples of the application
PublikacjaData reduction is a procedure to decrease the dataset in order to make their analysis more effective and easier. Reduction of the dataset is an issue that requires proper planning, so after reduction it meets all the user’s expectations. Evidently, it is better if the result is an optimal solution in terms of adopted criteria. Within reduction methods, which provide the optimal solution there is the Optimum Dataset method (OptD)...
-
AC Motor Voltage and Audible Noise Dataset
PublikacjaThe dataset titled AC motor voltage and audible noise waveforms in ship’s electrical drive systems with frequency converters contains the voltage and sound measurement results recorded in a marine frequency controlled AC drive system. The dataset is part of research focussing on the impact of the ship’s electrical drive systems with frequency converters on vibrations and the level of audible noise on ships. The dataset allows the...
-
DevEmo—Software Developers’ Facial Expression Dataset
PublikacjaThe COVID-19 pandemic has increased the relevance of remote activities and digital tools for education, work, and other aspects of daily life. This reality has highlighted the need for emotion recognition technology to better understand the emotions of computer users and provide support in remote environments. Emotion recognition can play a critical role in improving the remote experience and ensuring that individuals are able...
-
Long-Term Measurement of Physiological Parameters – Child Dataset
PublikacjaThe dataset titled “Long-term measurement of physiological parameters – child is one dataset” of the bigger series named Long-term measurement of physiological parameters. The dataset contains physiological parameter measurements such as skin temperature and resistance, blood pulse, as well as the stress detection marker, which can have a value of 0 when there is no stress detected or 1 when stress appeared. Additionally, the dataset...
-
Video of LEGO Bricks on Conveyor Belt Dataset Series
PublikacjaThe dataset series titled Video of LEGO bricks on conveyor belt is composed of 14 datasets containing video recordings of a moving white conveyor belt. The recordings were created using a smartphone camera in Full HD resolution. The dataset allows for the preparation of data for neural network training, and building of a LEGO sorting machine that can help builders to organise their collections.
-
Macrophytobenthos in the Puck Bay in 2010–2018 Dataset
PublikacjaThe dataset titled Biomass of macrophytobenthos in the Puck Bay in 2010-2018 con-tains data on the qualitative composition and biomass of macrophytobenthos (flow-er plants and macroalgae) in samples collected in the Puck Bay area (Gulf of Gdańsk, southern Baltic Sea) at 20 stations between 2010–2018. The data was supplemented with additional information: values of measured parameters of water and sediment, e.g. tem-perature...
-
Application of the Optimum Dataset Method in Archeological Studies on Barrows
PublikacjaLight Detection and Ranging (LiDAR) became one of the technologies used in archaeological research. It allows for relatively easy detection of archaeological sites that have their own field form, e.g.: barrows, fortresses, tracts, ancient fields [1]. As a result of the scanning, the so-called point cloud is obtained, often consisting of millions of points. Such large measurement datasets are very time-consuming and labor-intensive...
-
The Central European GNSS Research Network (CEGRN) dataset
PublikacjaThe Central European GNSS Research Network (CEGRN) collects GNSS data since 1994 from contributors which today include 42 Institutions in 33 Countries. CEGRN returns a dataset of coordinates and velocities computed according to international standards and the most recent processing procedures and recommendations. We provide a dataset of 1229 positions and velocities resulting from 3 or more repetitions of coordinate measurements...
-
Educational Dataset of Handheld Doppler Blood Flow Recordings
PublikacjaVital signals registration plays a significant role in biomedical engineering and education process. Well acquired data allow future engineers to observe certain physical phenomena as well learn how to correctly process and interpret the data. This dataset was designed for students to learn about Doppler phenomena and to demonstrate correctly and incorrectly acquired signals as well as the basic methods of signal processing. This...
-
Description of the Dataset Hanow – Praecepta de Arte Disputandi – Transcription and Photographs
PublikacjaThis article briefly characterises the “Hanow – Praecepta de arte disputandi – transcription and photographs” research dataset. The dataset was created based on photographs and transcriptions of the manuscript of the Latin lectures on the rules of effective discussion (the title of the manuscript: Praecepta de arte disputandi) by Michael Chris-toph Hanow (1695–1773), professor of Gdańsk Academic Gymnasium. The original document...
-
Medical Image Dataset Annotation Service (MIDAS)
PublikacjaMIDAS (Medical Image Dataset Annotation Service) is a custom-tailored tool for creating and managing datasets either for deep learning, as well as machine learning or any form of statistical research. The aim of the project is to provide one-fit-all platform for creating medical image datasets that could easily blend in hospital's workflow. In our work, we focus on the importance of medical data anonimization, discussing the...
-
Crack Mouth Opening Displacement for EH36 Shipbuilding Steel Measurements Dataset
PublikacjaThe dataset titled EH36 steel for shipbuilding (plate thickness 50 mm) – CMOD – force record, a0/W=0.6 contains a CMOD (Crack Mouth Opening Displacement) – Force record which is the base for evaluation of the fracture toughness of structural steel. Bend specimens with a Bx2B section (B = 50 mm), and relative initial crack length a0/W=0.60 were used. The test was carried out at ambient temperature in accordance with the ISO 12135...
-
Impedance Spectra of RC Model as a Result of Testing Pulse Excitation Measurement Method Dataset
PublikacjaThe dataset titled Impedance spectra of RC model as a result of testing pulse excitation measurement method contains the impedance spectrum of an exemplary test RC model obtained using pulse excitation. The dataset allows presentation of the accuracy of the impedance spectroscopy measuring instrument, which uses the pulse excitation method to shorten the time of the whole spectrum acquisition.
-
Down-Sampling of Large LiDAR Dataset in the Context of Off-Road Objects Extraction
PublikacjaNowadays, LiDAR (Light Detection and Ranging) is used in many fields, such as transportation. Thanks to the recent technological improvements, the current generation of LiDAR mapping instruments available on the market allows to acquire up to millions of three-dimensional (3D) points per second. On the one hand, such improvements allowed the development of LiDAR-based systems with increased productivity, enabling the quick acquisition...
-
Measurement of the Temporal and Spatial Temperature Distribution on the Surface of PVCP Tissue Phantom Illuminated by Laser Dataset
PublikacjaThe dataset entitled Measurement of the temporal and spatial temperature distribution on the surface of PVCP tissue phantom illuminated by laser was obtained with a laboratory set-up for characterisation of the thermal properties of optical tissue phantoms during laser irradiation. The dataset contains a single image file representing the spatial temperature distribution on the surface of a PVCP tissue phantom. This thermal image...
-
Dataset Related Experimental Investigation of Chess Position Evaluation Using a Deep Neural Network
PublikacjaThe idea of training Articial Neural Networks to evaluate chess positions has been widely explored in the last ten years. In this paper we investigated dataset impact on chess position evaluation. We created two datasets with over 1.6 million unique chess positions each. In one of those we also included randomly generated positions resulting from consideration of potentially unpredictable chess moves. Each position was evaluated...
-
The molecular entities in linked data dataset
Publikacja -
G2DC-PL+: a gridded 2 km daily climate dataset for the union of the Polish territory and the Vistula and Odra basins
PublikacjaG2DC-PL+, a gridded 2 km daily climate dataset for the union of the Polish territory and the Vistula and Odra basins, is an update and extension of the CHASE-PL Forcing Data – Gridded Daily Precipitation and Temperature Dataset – 5 km (CPLFD-GDPT5). The latter was the first publicly available, high-resolution climate forcing dataset in Poland, used for a range of purposes including hydrological modelling and bias correction of...
-
Description of the Dataset Rhetoric at School – a Selection of the Syllabi from the Academic Gymnasium in Gdańsk – Transcription and Photographs
PublikacjaThe research dataset described in the article was based on photographs and transcription of a textual record from Latin syllabi for classes at the Gdańsk Academic Gymnasium. The syllabi concern the years 1645/1648/1652/1653. The original document is held in the collection of the Gdańsk Library of the Polish Academy of Sciences [reference number: Ma 3920 8o]. The collected research material can be used for studying the practical...
-
Constructing a Dataset of Speech Recordingswith Lombard Effect
PublikacjaThepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...
-
Using Synchronously Registered Biosignals Dataset for Teaching Basics of Medical Data Analysis – Case Study
PublikacjaMedical data analysis and processing strongly relies on the data quality itself. The correct data registration allows many unnecessary steps in data processing to be avoided. Moreover, it takes a certain amount of experience to acquire data that can produce replicable results. Because consistency is crucial in the teaching process, students have access to pre-recorded real data without the necessity of using additional equipment...
-
AGAR a Microbial Colony Dataset for Deep Learning Detection
Publikacja -
Regeneration Project of Market Places GOSPOSTRATEG – “Polanki” Market in Gdańsk-Oliwa Pilot Project Monitoring Dataset
PublikacjaThe dataset entitled Monitoring of activities carried out as part of prototyping and implementation of the pilot project in the area of the “Polanki” market and its direct neighbourhood, in the Gdańsk-Oliwa district, step1; stage from July 2020 year contains tabular monitoring lists (quantitative and qualitative documentation report in the form of tables) of activities carried out as part of the prototyping and implementation of...
-
Dataset Relating Collective Angst, Identifications, Essentialist Continuity and Collective Action for Progressive City Policy among Gdańsk Residents
PublikacjaThis dataset contains the individual responses of 456 residents of Gdańsk who participated in the study. The study was conducted before the second term of the presidential election in Poland in 2020. Demographic variables as well as psychological measures of angst, place attachment, identification in-group continuity and willingness to engage in collective action were collected. We also measured the perception of the risk of...
-
Generation of microbial colonies dataset with deep learning style transfer
Publikacja -
Process of Medical Dataset Construction for Machine Learning-Multifield Study and Guidelines
PublikacjaThe acquisition of high-quality data and annotations is essential for the training of efficient machine learning algorithms, while being an expensive and time-consuming process. Although the process of data processing and training and testing of machine learning models is well studied and considered in the literature, the actual procedures of obtaining data and their annotations in collaboration with physicians are in most cases...
-
A European Multi Lake Survey dataset of environmental variables, phytoplankton pigments and cyanotoxins
Publikacja -
Segmentation Quality Refinement in Large-Scale Medical Image Dataset with Crowd-Sourced Annotations
PublikacjaDeployment of different techniques of deep learning including Convolutional Neural Networks (CNN) in image classification systems has accomplished outstanding results. However, the advantages and potential impact of such a system can be completely negated if it does not reach a target accuracy. To achieve high classification accuracy with low variance in medical image classification system, there is needed the large size of the...
-
Identification of High-Value Dataset determinants: is there a silver bullet for efficient sustainability-oriented data-driven development?
PublikacjaOpen Government Data (OGD) are seen as one of the trends that has the potential to benefit the economy, improve the quality, efficiency, and transparency of public administration, and change the lives of citizens, and the society as a whole facilitating efficient sustainability-oriented data-driven services. However, the quick achievement of these benefits is closely related to the “value” of the OGD, i.e., how useful, and reusable...
-
Effective Air Quality Prediction Using Reinforced Swarm Optimization and Bi-Directional Gated Recurrent Unit
PublikacjaIn the present scenario, air quality prediction (AQP) is a complex task due to high variability, volatility, and dynamic nature in space and time of particulates and pollutants. Recently, several nations have had poor air quality due to the high emission of particulate matter (PM2.5) that affects human health conditions, especially in urban areas. In this research, a new optimization-based regression model was implemented for effective...
-
Induction of the common-sense hierarchies in lexical data
PublikacjaUnsupervised organization of a set of lexical concepts that captures common-sense knowledge inducting meaningful partitioning of data is described. Projection of data on principal components allow for dentification of clusters with wide margins, and the procedure is recursively repeated within each cluster. Application of this idea to a simple dataset describing animals created hierarchical partitioning with each clusters related...
-
Thermal imaging in automatic rodent’s social behaviour analysis
PublikacjaLaboratory rodent social behaviour analysis is an extremely important task for biological, medical and pharmacological researches. In this work thermal images features that facilitate analysis are presented. Methods to distinguish objects on the basis of thermal distribution are tested. Actions of grooming or biting one rodent by another - important social behaviour incidents - are clearly visible...
-
High-Resolution Wind Wave Parameters in the Area of the Gulf of Gdańsk During 21 Extreme Storms
PublikacjaThis dataset contains the results of wind-wave parameter modelling in the area of the Gulf of Gdańsk (Southern Baltic). For the simulations, a high resolution SWAN model was used. The dataset consists of the significant wave height, the direction of the wave approaching the shore and the wave period during 21 historical, extreme storms. The storms were selected by an automatic search over the 44-year-long significant wave height...
-
Mechanical Properties of Human Stomach Tissue
PublikacjaThe dataset entitled Determination of mechanical properties of human stomach tissues subjected to uniaxial stretching contains: the length of the sample as a function of the corresponding load (tensile force) and the initial values of the average width and average thickness of the sample. All tests were conducted in a self-developed tensile test machine: PG TissueTester. The dataset allows the coefficients of various models of...
-
A Triplet-Learnt Coarse-to-Fine Reranking for Vehicle Re-identification
PublikacjaVehicle re-identification refers to the task of matching the same query vehicle across non-overlapping cameras and diverse viewpoints. Research interest on the field emerged with intelligent transportation systems and the necessity for public security maintenance. Compared to person, vehicle re-identification is more intricate, facing the challenges of lower intra-class and higher inter-class similarities. Motivated by deep...
-
Methodology of Constructing and Analyzing the Hierarchical Contextually-Oriented Corpora
PublikacjaMethodology of Constructing and Analyzing the Hierarchical structure of the Contextually-Oriented Corpora was developed. The methodology contains the following steps: Contextual Component of the Corpora’s Structure Building; Text Analysis of the Contextually-Oriented Hierarchical Corpus. Main contribution of this study is the following: hierarchical structure of the Corpus provides advanced possibilities for identification of the...
-
Selection of Visual Descriptors for the Purpose of Multi-camera Object Re-identification
PublikacjaA comparative analysis of various visual descriptors is presented in this chapter. The descriptors utilize many aspects of image data: colour, texture, gradient, and statistical moments. The descriptor list is supplemented with local features calculated in close vicinity of key points found automatically in the image. The goal of the analysis is to find descriptors that are best suited for particular task, i.e. re-identification...
-
Vehicle Detection and Speed Estimation Using Millimetre Wave Radar
PublikacjaThe dataset titled Data from 76- to 81-GHz mmWave Sensor located at S7 road contains data recorded employing an IWR1642 mmWave sensor from Texas Instruments. The data comes from two sessions lasting 24h each. The dataset provides the possibility to perform analyses related to car traffic intensity on one of the carriageways of the motorway heading to the Gdańsk metropolitan area. Based on the gathered data, it is possible to calculate...
-
Reduction of measurement data before Digital Terrain Model generation vs. DTM generalisation
PublikacjaModern data acquisition technologies provide large datasets that are not always necessary in its entirety to properly accomplish the goal of the study. In addition, such datasets are often cumbersome for rational processing, and their processing is time and labour consuming. Therefore, methods that enable to reduce the size of the measurement dataset, such as the generalization of the Digital Terrain Model (DTM) or the reduction...
-
Balanced Spider Monkey Optimization with Bi-LSTM for Sustainable Air Quality Prediction
PublikacjaA reliable air quality prediction model is required for pollution control, human health monitoring, and sustainability. The existing air quality prediction models lack efficiency due to overfitting in prediction model and local optima trap in feature selection. This study proposes the Balanced Spider Monkey Optimization (BSMO) technique for effective feature selection to overcome the local optima trap and overfitting problems....
-
Using Convolutional Neural Networks for Corneal Arcus Detection Towards Familial Hypercholesterolemia Screening
PublikacjaFamilial hypercholesterolemia (FH) is a highly undiagnosed disease. Among FH patients, the onset of premature coronary artery disease is 13 times higher than in the general population. Early diagnosis and treatment is essential to prevent cardiovascular diseases and their complications, and to prolong life. One of the clinical criteria of FH is the occurrence of a corneal arcus (CA) among patients, especially those under 45 years...
-
Real-Time Facial Features Detection from Low Resolution Thermal Images with Deep Classification Models
PublikacjaDeep networks have already shown a spectacular success for object classification and detection for various applications from everyday use cases to advanced medical problems. The main advantage of the classification models over the detection models is less time and effort needed for dataset preparation, because classification networks do not require bounding box annotations, but labels at the image level only. Yet, after passing...
-
Personalized prediction of the secondary oocytes number after ovarian stimulation: A machine learning model based on clinical and genetic data
PublikacjaControlled ovarian stimulation is tailored to the patient based on clinical parameters but estimating the number of retrieved metaphase II (MII) oocytes is a challenge. Here, we have developed a model that takes advantage of the patient’s genetic and clinical characteristics simultaneously for predicting the stimulation outcome. Sequence variants in reproduction-related genes identified by next-generation sequencing were matched...
-
The OptD-multi method in LiDAR processing
PublikacjaNew and constantly developing technology for acquiring spatial data, such as LiDAR (light detection and ranging), is a source for large volume of data. However, such amount of data is not always needed for developing the most popular LiDAR products: digital terrain model (DTM) or digital surface model. Therefore, in many cases, the number of contained points are reduced in the pre-processing stage. The degree of reduction is determined...
-
Style Transfer for Detecting Vehicles with Thermal Camera
PublikacjaIn this work we focus on nighttime vehicle detection for intelligent traffic monitoring from the thermal camera. To train a Convolutional Neural Network (CNN) detector we create a stylized version of COCO (Common Objects in Context) dataset using Style Transfer technique that imitates images obtained from thermal cameras. This new dataset is further used for fine-tuning of the model and as a result detection accuracy on images...