Search results for: DATASET FEATURES, DATASET PROFILING VOCABULARIES - Bridge of Knowledge

Search

Search results for: DATASET FEATURES, DATASET PROFILING VOCABULARIES

Filters

total: 2302
filtered: 269

clear all filters


Chosen catalog filters

  • Category

  • Year

  • Options

clear Chosen catalog filters disabled

Search results for: DATASET FEATURES, DATASET PROFILING VOCABULARIES

  • RDF dataset profiling - a survey of features, methods, vocabularies and applications

    Publication
    • M. B. Ellefi
    • B. Zohra
    • J. G. Breslin
    • E. Demidova
    • S. Dietze
    • K. Todorov
    • J. Szymański

    - Semantic Web - Year 2018

    The Web of Data, and in particular Linked Data, has seen tremendous growth over the past years. However, reuse and take-up of these rich data sources is often limited and focused on a few well-known and established RDF datasets. This can be partially attributed to the lack of reliable and up-to-date information about the characteristics of available datasets. While RDF datasets vary heavily with respect to the features related...

  • Noise profiling for speech enhancement employing machine learning models

    Publication

    - Journal of the Acoustical Society of America - Year 2022

    This paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...

    Full text available to download

  • Applying the Lombard Effect to Speech-in-Noise Communication

    Publication

    - Electronics - Year 2023

    This study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...

    Full text available to download

  • Non-Contact Temperature Measurements Dataset

    Publication

    - Year 2022

    The dataset titled The influence of the distance of the pyrometer from the surface of the radiating object on the accuracy of measurements contains temperature measurements using a selection of four commercially available pyrometers (CHY 314P, TM-F03B, TFA 31.1125 and AB-8855) as a function of the measuring distance. The dataset allows a comparison of the accuracy and measuring precision of the devices, which are very important...

    Full text available to download

  • AITP - AI Thermal Pedestrians Dataset

    Efficient pedestrian detection is a very important task in ensuring safety within road conditions, especially after sunset. One way to achieve this goal is to use thermal imaging in conjunction with deep learning methods and an annotated dataset for models training. In this work, such a dataset has been created by capturing thermal images of pedestrians in different weather and traffic conditions. All images were manually annotated...

    Full text to download in external service

  • The Optimum Dataset method – examples of the application

    Publication

    - Year 2018

    Data reduction is a procedure to decrease the dataset in order to make their analysis more effective and easier. Reduction of the dataset is an issue that requires proper planning, so after reduction it meets all the user’s expectations. Evidently, it is better if the result is an optimal solution in terms of adopted criteria. Within reduction methods, which provide the optimal solution there is the Optimum Dataset method (OptD)...

    Full text to download in external service

  • AC Motor Voltage and Audible Noise Dataset

    Publication

    - Year 2022

    The dataset titled AC motor voltage and audible noise waveforms in ship’s electrical drive systems with frequency converters contains the voltage and sound measurement results recorded in a marine frequency controlled AC drive system. The dataset is part of research focussing on the impact of the ship’s electrical drive systems with frequency converters on vibrations and the level of audible noise on ships. The dataset allows the...

    Full text available to download

  • DevEmo—Software Developers’ Facial Expression Dataset

    The COVID-19 pandemic has increased the relevance of remote activities and digital tools for education, work, and other aspects of daily life. This reality has highlighted the need for emotion recognition technology to better understand the emotions of computer users and provide support in remote environments. Emotion recognition can play a critical role in improving the remote experience and ensuring that individuals are able...

    Full text available to download

  • Long-Term Measurement of Physiological Parameters – Child Dataset

    Publication

    - Year 2022

    The dataset titled “Long-term measurement of physiological parameters – child is one dataset” of the bigger series named Long-term measurement of physiological parameters. The dataset contains physiological parameter measurements such as skin temperature and resistance, blood pulse, as well as the stress detection marker, which can have a value of 0 when there is no stress detected or 1 when stress appeared. Additionally, the dataset...

    Full text available to download

  • Video of LEGO Bricks on Conveyor Belt Dataset Series

    Publication

    - Year 2022

    The dataset series titled Video of LEGO bricks on conveyor belt is composed of 14 datasets containing video recordings of a moving white conveyor belt. The recordings were created using a smartphone camera in Full HD resolution. The dataset allows for the preparation of data for neural network training, and building of a LEGO sorting machine that can help builders to organise their collections.

    Full text available to download

  • Macrophytobenthos in the Puck Bay in 2010–2018 Dataset

    Publication

    - Year 2022

    The dataset titled Biomass of macrophytobenthos in the Puck Bay in 2010-2018 con-tains data on the qualitative composition and biomass of macrophytobenthos (flow-er plants and macroalgae) in samples collected in the Puck Bay area (Gulf of Gdańsk, southern Baltic Sea) at 20 stations between 2010–2018. The data was supplemented with additional information: values of measured parameters of water and sediment, e.g. tem-perature...

    Full text available to download

  • Application of the Optimum Dataset Method in Archeological Studies on Barrows

    Publication

    - Year 2018

    Light Detection and Ranging (LiDAR) became one of the technologies used in archaeological research. It allows for relatively easy detection of archaeological sites that have their own field form, e.g.: barrows, fortresses, tracts, ancient fields [1]. As a result of the scanning, the so-called point cloud is obtained, often consisting of millions of points. Such large measurement datasets are very time-consuming and labor-intensive...

    Full text to download in external service

  • The Central European GNSS Research Network (CEGRN) dataset

    Publication
    • J. Zurutuza
    • A. Caporali
    • M. Bertocco
    • M. Ishchenko
    • O. Khoda
    • H. Steffen
    • M. Figurski
    • E. Parseliunas
    • S. Berk
    • G. Nykiel

    - Data in Brief - Year 2019

    The Central European GNSS Research Network (CEGRN) collects GNSS data since 1994 from contributors which today include 42 Institutions in 33 Countries. CEGRN returns a dataset of coordinates and velocities computed according to international standards and the most recent processing procedures and recommendations. We provide a dataset of 1229 positions and velocities resulting from 3 or more repetitions of coordinate measurements...

    Full text available to download

  • Educational Dataset of Handheld Doppler Blood Flow Recordings

    Publication

    - Year 2022

    Vital signals registration plays a significant role in biomedical engineering and education process. Well acquired data allow future engineers to observe certain physical phenomena as well learn how to correctly process and interpret the data. This dataset was designed for students to learn about Doppler phenomena and to demonstrate correctly and incorrectly acquired signals as well as the basic methods of signal processing. This...

    Full text available to download

  • Description of the Dataset Hanow – Praecepta de Arte Disputandi – Transcription and Photographs

    Publication

    - Year 2022

    This article briefly characterises the “Hanow – Praecepta de arte disputandi – transcription and photographs” research dataset. The dataset was created based on photographs and transcriptions of the manuscript of the Latin lectures on the rules of effective discussion (the title of the manuscript: Praecepta de arte disputandi) by Michael Chris-toph Hanow (1695–1773), professor of Gdańsk Academic Gymnasium. The original document...

    Full text available to download

  • Medical Image Dataset Annotation Service (MIDAS)

    Publication

    - Year 2020

    MIDAS (Medical Image Dataset Annotation Service) is a custom-tailored tool for creating and managing datasets either for deep learning, as well as machine learning or any form of statistical research. The aim of the project is to provide one-fit-all platform for creating medical image datasets that could easily blend in hospital's workflow. In our work, we focus on the importance of medical data anonimization, discussing the...

    Full text to download in external service

  • Crack Mouth Opening Displacement for EH36 Shipbuilding Steel Measurements Dataset

    Publication

    - Year 2022

    The dataset titled EH36 steel for shipbuilding (plate thickness 50 mm) – CMOD – force record, a0/W=0.6 contains a CMOD (Crack Mouth Opening Displacement) – Force record which is the base for evaluation of the fracture toughness of structural steel. Bend specimens with a Bx2B section (B = 50 mm), and relative initial crack length a0/W=0.60 were used. The test was carried out at ambient temperature in accordance with the ISO 12135...

    Full text available to download

  • Impedance Spectra of RC Model as a Result of Testing Pulse Excitation Measurement Method Dataset

    Publication

    - Year 2022

    The dataset titled Impedance spectra of RC model as a result of testing pulse excitation measurement method contains the impedance spectrum of an exemplary test RC model obtained using pulse excitation. The dataset allows presentation of the accuracy of the impedance spectroscopy measuring instrument, which uses the pulse excitation method to shorten the time of the whole spectrum acquisition.

    Full text available to download

  • Down-Sampling of Large LiDAR Dataset in the Context of Off-Road Objects Extraction

    Publication

    - Geosciences - Year 2020

    Nowadays, LiDAR (Light Detection and Ranging) is used in many fields, such as transportation. Thanks to the recent technological improvements, the current generation of LiDAR mapping instruments available on the market allows to acquire up to millions of three-dimensional (3D) points per second. On the one hand, such improvements allowed the development of LiDAR-based systems with increased productivity, enabling the quick acquisition...

    Full text available to download

  • Measurement of the Temporal and Spatial Temperature Distribution on the Surface of PVCP Tissue Phantom Illuminated by Laser Dataset

    Publication

    - Year 2022

    The dataset entitled Measurement of the temporal and spatial temperature distribution on the surface of PVCP tissue phantom illuminated by laser was obtained with a laboratory set-up for characterisation of the thermal properties of optical tissue phantoms during laser irradiation. The dataset contains a single image file representing the spatial temperature distribution on the surface of a PVCP tissue phantom. This thermal image...

    Full text available to download

  • Dataset Related Experimental Investigation of Chess Position Evaluation Using a Deep Neural Network

    Publication

    The idea of training Articial Neural Networks to evaluate chess positions has been widely explored in the last ten years. In this paper we investigated dataset impact on chess position evaluation. We created two datasets with over 1.6 million unique chess positions each. In one of those we also included randomly generated positions resulting from consideration of potentially unpredictable chess moves. Each position was evaluated...

    Full text to download in external service

  • The molecular entities in linked data dataset

    Publication

    - Data in Brief - Year 2020

    Full text to download in external service

  • G2DC-PL+: a gridded 2 km daily climate dataset for the union of the Polish territory and the Vistula and Odra basins

    Publication

    - Earth System Science Data - Year 2021

    G2DC-PL+, a gridded 2 km daily climate dataset for the union of the Polish territory and the Vistula and Odra basins, is an update and extension of the CHASE-PL Forcing Data – Gridded Daily Precipitation and Temperature Dataset – 5 km (CPLFD-GDPT5). The latter was the first publicly available, high-resolution climate forcing dataset in Poland, used for a range of purposes including hydrological modelling and bias correction of...

    Full text available to download

  • Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits

    Publication

    The Contextual Multi-Armed Bandits (CMAB) framework is pivotal for learning to make decisions. However, due to challenges in deploying online algorithms, there is a shift towards offline policy learning, which relies on pre-existing datasets. This study examines the relationship between the quality of these datasets and the performance of offline policy learning algorithms, specifically, Neural Greedy and NeuraLCB. Our results...

    Full text available to download

  • Description of the Dataset Rhetoric at School – a Selection of the Syllabi from the Academic Gymnasium in Gdańsk – Transcription and Photographs

    Publication

    - Year 2022

    The research dataset described in the article was based on photographs and transcription of a textual record from Latin syllabi for classes at the Gdańsk Academic Gymnasium. The syllabi concern the years 1645/1648/1652/1653. The original document is held in the collection of the Gdańsk Library of the Polish Academy of Sciences [reference number: Ma 3920 8o]. The collected research material can be used for studying the practical...

    Full text available to download

  • Constructing a Dataset of Speech Recordingswith Lombard Effect

    Publication

    - Year 2020

    Thepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...

  • Using Synchronously Registered Biosignals Dataset for Teaching Basics of Medical Data Analysis – Case Study

    Publication

    - Year 2022

    Medical data analysis and processing strongly relies on the data quality itself. The correct data registration allows many unnecessary steps in data processing to be avoided. Moreover, it takes a certain amount of experience to acquire data that can produce replicable results. Because consistency is crucial in the teaching process, students have access to pre-recorded real data without the necessity of using additional equipment...

    Full text available to download

  • AGAR a Microbial Colony Dataset for Deep Learning Detection

    Publication
    • S. Majchrowska
    • J. Pawlowski
    • G. Gula
    • T. Bonus
    • A. Hanas
    • A. Loch
    • A. Pawlak
    • J. Roszkowiak
    • T. Golan
    • Z. Drulis-Kawa

    - Year 2021

    Full text to download in external service

  • Regeneration Project of Market Places GOSPOSTRATEG – “Polanki” Market in Gdańsk-Oliwa Pilot Project Monitoring Dataset

    Publication

    - Year 2022

    The dataset entitled Monitoring of activities carried out as part of prototyping and implementation of the pilot project in the area of the “Polanki” market and its direct neighbourhood, in the Gdańsk-Oliwa district, step1; stage from July 2020 year contains tabular monitoring lists (quantitative and qualitative documentation report in the form of tables) of activities carried out as part of the prototyping and implementation of...

    Full text available to download

  • Dataset Relating Collective Angst, Identifications, Essentialist Continuity and Collective Action for Progressive City Policy among Gdańsk Residents

    Publication

    - Year 2022

    This dataset contains the individual responses of 456 residents of Gdańsk who participated in the study. The study was conducted before the second term of the presidential election in Poland in 2020. Demographic variables as well as psychological measures of angst, place attachment, identification in-group continuity and willingness to engage in collective action were collected. We also measured the perception of the risk of...

    Full text available to download

  • Generation of microbial colonies dataset with deep learning style transfer

    Publication

    - Scientific Reports - Year 2022

    Full text to download in external service

  • Process of Medical Dataset Construction for Machine Learning-Multifield Study and Guidelines

    Publication

    The acquisition of high-quality data and annotations is essential for the training of efficient machine learning algorithms, while being an expensive and time-consuming process. Although the process of data processing and training and testing of machine learning models is well studied and considered in the literature, the actual procedures of obtaining data and their annotations in collaboration with physicians are in most cases...

  • A European Multi Lake Survey dataset of environmental variables, phytoplankton pigments and cyanotoxins

    Publication
    • E. Mantzouki
    • J. Campbell
    • E. van
    • P. Visser
    • I. Konstantinou
    • M. Antoniou
    • G. Giuliani
    • D. Machado-Vieira
    • A. Gurjão
    • D. Maronić... and 196 others

    - Scientific Data - Year 2018

    Full text to download in external service

  • Segmentation Quality Refinement in Large-Scale Medical Image Dataset with Crowd-Sourced Annotations

    Publication

    Deployment of different techniques of deep learning including Convolutional Neural Networks (CNN) in image classification systems has accomplished outstanding results. However, the advantages and potential impact of such a system can be completely negated if it does not reach a target accuracy. To achieve high classification accuracy with low variance in medical image classification system, there is needed the large size of the...

    Full text to download in external service

  • Identification of High-Value Dataset determinants: is there a silver bullet for efficient sustainability-oriented data-driven development?

    Publication

    - Year 2023

    Open Government Data (OGD) are seen as one of the trends that has the potential to benefit the economy, improve the quality, efficiency, and transparency of public administration, and change the lives of citizens, and the society as a whole facilitating efficient sustainability-oriented data-driven services. However, the quick achievement of these benefits is closely related to the “value” of the OGD, i.e., how useful, and reusable...

    Full text to download in external service

  • Effective Air Quality Prediction Using Reinforced Swarm Optimization and Bi-Directional Gated Recurrent Unit

    Publication

    - Sustainability - Year 2023

    In the present scenario, air quality prediction (AQP) is a complex task due to high variability, volatility, and dynamic nature in space and time of particulates and pollutants. Recently, several nations have had poor air quality due to the high emission of particulate matter (PM2.5) that affects human health conditions, especially in urban areas. In this research, a new optimization-based regression model was implemented for effective...

    Full text available to download

  • Induction of the common-sense hierarchies in lexical data

    Publication

    Unsupervised organization of a set of lexical concepts that captures common-sense knowledge inducting meaningful partitioning of data is described. Projection of data on principal components allow for dentification of clusters with wide margins, and the procedure is recursively repeated within each cluster. Application of this idea to a simple dataset describing animals created hierarchical partitioning with each clusters related...

  • Thermal imaging in automatic rodent’s social behaviour analysis

    Publication

    - Year 2016

    Laboratory rodent social behaviour analysis is an extremely important task for biological, medical and pharmacological researches. In this work thermal images features that facilitate analysis are presented. Methods to distinguish objects on the basis of thermal distribution are tested. Actions of grooming or biting one rodent by another - important social behaviour incidents - are clearly visible...

    Full text to download in external service

  • High-Resolution Wind Wave Parameters in the Area of the Gulf of Gdańsk During 21 Extreme Storms

    Publication

    This dataset contains the results of wind-wave parameter modelling in the area of the Gulf of Gdańsk (Southern Baltic). For the simulations, a high resolution SWAN model was used. The dataset consists of the significant wave height, the direction of the wave approaching the shore and the wave period during 21 historical, extreme storms. The storms were selected by an automatic search over the 44-year-long significant wave height...

    Full text available to download

  • Mechanical Properties of Human Stomach Tissue

    Publication

    - Year 2022

    The dataset entitled Determination of mechanical properties of human stomach tissues subjected to uniaxial stretching contains: the length of the sample as a function of the corresponding load (tensile force) and the initial values of the average width and average thickness of the sample. All tests were conducted in a self-developed tensile test machine: PG TissueTester. The dataset allows the coefficients of various models of...

    Full text available to download

  • A Triplet-Learnt Coarse-to-Fine Reranking for Vehicle Re-identification

    Publication

    - Year 2020

    Vehicle re-identification refers to the task of matching the same query vehicle across non-overlapping cameras and diverse viewpoints. Research interest on the field emerged with intelligent transportation systems and the necessity for public security maintenance. Compared to person, vehicle re-identification is more intricate, facing the challenges of lower intra-class and higher inter-class similarities. Motivated by deep...

    Full text to download in external service

  • Methodology of Constructing and Analyzing the Hierarchical Contextually-Oriented Corpora

    Publication

    - Year 2018

    Methodology of Constructing and Analyzing the Hierarchical structure of the Contextually-Oriented Corpora was developed. The methodology contains the following steps: Contextual Component of the Corpora’s Structure Building; Text Analysis of the Contextually-Oriented Hierarchical Corpus. Main contribution of this study is the following: hierarchical structure of the Corpus provides advanced possibilities for identification of the...

    Full text available to download

  • Selection of Visual Descriptors for the Purpose of Multi-camera Object Re-identification

    A comparative analysis of various visual descriptors is presented in this chapter. The descriptors utilize many aspects of image data: colour, texture, gradient, and statistical moments. The descriptor list is supplemented with local features calculated in close vicinity of key points found automatically in the image. The goal of the analysis is to find descriptors that are best suited for particular task, i.e. re-identification...

    Full text to download in external service

  • Vehicle Detection and Speed Estimation Using Millimetre Wave Radar

    Publication

    - Year 2022

    The dataset titled Data from 76- to 81-GHz mmWave Sensor located at S7 road contains data recorded employing an IWR1642 mmWave sensor from Texas Instruments. The data comes from two sessions lasting 24h each. The dataset provides the possibility to perform analyses related to car traffic intensity on one of the carriageways of the motorway heading to the Gdańsk metropolitan area. Based on the gathered data, it is possible to calculate...

    Full text available to download

  • Reduction of measurement data before Digital Terrain Model generation vs. DTM generalisation

    Publication

    - SURVEY REVIEW - Year 2018

    Modern data acquisition technologies provide large datasets that are not always necessary in its entirety to properly accomplish the goal of the study. In addition, such datasets are often cumbersome for rational processing, and their processing is time and labour consuming. Therefore, methods that enable to reduce the size of the measurement dataset, such as the generalization of the Digital Terrain Model (DTM) or the reduction...

    Full text to download in external service

  • Balanced Spider Monkey Optimization with Bi-LSTM for Sustainable Air Quality Prediction

    Publication

    - Sustainability - Year 2023

    A reliable air quality prediction model is required for pollution control, human health monitoring, and sustainability. The existing air quality prediction models lack efficiency due to overfitting in prediction model and local optima trap in feature selection. This study proposes the Balanced Spider Monkey Optimization (BSMO) technique for effective feature selection to overcome the local optima trap and overfitting problems....

    Full text available to download

  • Using Convolutional Neural Networks for Corneal Arcus Detection Towards Familial Hypercholesterolemia Screening

    Publication

    Familial hypercholesterolemia (FH) is a highly undiagnosed disease. Among FH patients, the onset of premature coronary artery disease is 13 times higher than in the general population. Early diagnosis and treatment is essential to prevent cardiovascular diseases and their complications, and to prolong life. One of the clinical criteria of FH is the occurrence of a corneal arcus (CA) among patients, especially those under 45 years...

    Full text available to download

  • Real-Time Facial Features Detection from Low Resolution Thermal Images with Deep Classification Models

    Deep networks have already shown a spectacular success for object classification and detection for various applications from everyday use cases to advanced medical problems. The main advantage of the classification models over the detection models is less time and effort needed for dataset preparation, because classification networks do not require bounding box annotations, but labels at the image level only. Yet, after passing...

    Full text to download in external service

  • Personalized prediction of the secondary oocytes number after ovarian stimulation: A machine learning model based on clinical and genetic data

    Publication
    • K. Zieliński
    • S. Pukszta
    • M. Mickiewicz
    • M. Kotlarz
    • P. Wygocki
    • M. Zieleń
    • D. Drzewiecka
    • D. Drzyzga
    • A. Kloska
    • J. Jakóbkiewicz-Banecka

    - PLoS Computational Biology - Year 2023

    Controlled ovarian stimulation is tailored to the patient based on clinical parameters but estimating the number of retrieved metaphase II (MII) oocytes is a challenge. Here, we have developed a model that takes advantage of the patient’s genetic and clinical characteristics simultaneously for predicting the stimulation outcome. Sequence variants in reproduction-related genes identified by next-generation sequencing were matched...

    Full text available to download

  • The OptD-multi method in LiDAR processing

    Publication

    - MEASUREMENT SCIENCE & TECHNOLOGY - Year 2017

    New and constantly developing technology for acquiring spatial data, such as LiDAR (light detection and ranging), is a source for large volume of data. However, such amount of data is not always needed for developing the most popular LiDAR products: digital terrain model (DTM) or digital surface model. Therefore, in many cases, the number of contained points are reduced in the pre-processing stage. The degree of reduction is determined...

    Full text to download in external service