displaying 1000 best results Help
Search results for: DATASET
-
AITP - AI Thermal Pedestrians Dataset
PublicationEfficient pedestrian detection is a very important task in ensuring safety within road conditions, especially after sunset. One way to achieve this goal is to use thermal imaging in conjunction with deep learning methods and an annotated dataset for models training. In this work, such a dataset has been created by capturing thermal images of pedestrians in different weather and traffic conditions. All images were manually annotated...
-
The Optimum Dataset method – examples of the application
PublicationData reduction is a procedure to decrease the dataset in order to make their analysis more effective and easier. Reduction of the dataset is an issue that requires proper planning, so after reduction it meets all the user’s expectations. Evidently, it is better if the result is an optimal solution in terms of adopted criteria. Within reduction methods, which provide the optimal solution there is the Optimum Dataset method (OptD)...
-
Non-Contact Temperature Measurements Dataset
PublicationThe dataset titled The influence of the distance of the pyrometer from the surface of the radiating object on the accuracy of measurements contains temperature measurements using a selection of four commercially available pyrometers (CHY 314P, TM-F03B, TFA 31.1125 and AB-8855) as a function of the measuring distance. The dataset allows a comparison of the accuracy and measuring precision of the devices, which are very important...
-
The molecular entities in linked data dataset
Publication -
DevEmo—Software Developers’ Facial Expression Dataset
PublicationThe COVID-19 pandemic has increased the relevance of remote activities and digital tools for education, work, and other aspects of daily life. This reality has highlighted the need for emotion recognition technology to better understand the emotions of computer users and provide support in remote environments. Emotion recognition can play a critical role in improving the remote experience and ensuring that individuals are able...
-
AC Motor Voltage and Audible Noise Dataset
PublicationThe dataset titled AC motor voltage and audible noise waveforms in ship’s electrical drive systems with frequency converters contains the voltage and sound measurement results recorded in a marine frequency controlled AC drive system. The dataset is part of research focussing on the impact of the ship’s electrical drive systems with frequency converters on vibrations and the level of audible noise on ships. The dataset allows the...
-
Macrophytobenthos in the Puck Bay in 2010–2018 Dataset
PublicationThe dataset titled Biomass of macrophytobenthos in the Puck Bay in 2010-2018 con-tains data on the qualitative composition and biomass of macrophytobenthos (flow-er plants and macroalgae) in samples collected in the Puck Bay area (Gulf of Gdańsk, southern Baltic Sea) at 20 stations between 2010–2018. The data was supplemented with additional information: values of measured parameters of water and sediment, e.g. tem-perature...
-
Medical Image Dataset Annotation Service (MIDAS)
PublicationMIDAS (Medical Image Dataset Annotation Service) is a custom-tailored tool for creating and managing datasets either for deep learning, as well as machine learning or any form of statistical research. The aim of the project is to provide one-fit-all platform for creating medical image datasets that could easily blend in hospital's workflow. In our work, we focus on the importance of medical data anonimization, discussing the...
-
Constructing a Dataset of Speech Recordingswith Lombard Effect
PublicationThepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...
-
Long-Term Measurement of Physiological Parameters – Child Dataset
PublicationThe dataset titled “Long-term measurement of physiological parameters – child is one dataset” of the bigger series named Long-term measurement of physiological parameters. The dataset contains physiological parameter measurements such as skin temperature and resistance, blood pulse, as well as the stress detection marker, which can have a value of 0 when there is no stress detected or 1 when stress appeared. Additionally, the dataset...
-
Video of LEGO Bricks on Conveyor Belt Dataset Series
PublicationThe dataset series titled Video of LEGO bricks on conveyor belt is composed of 14 datasets containing video recordings of a moving white conveyor belt. The recordings were created using a smartphone camera in Full HD resolution. The dataset allows for the preparation of data for neural network training, and building of a LEGO sorting machine that can help builders to organise their collections.
-
Application of the Optimum Dataset Method in Archeological Studies on Barrows
PublicationLight Detection and Ranging (LiDAR) became one of the technologies used in archaeological research. It allows for relatively easy detection of archaeological sites that have their own field form, e.g.: barrows, fortresses, tracts, ancient fields [1]. As a result of the scanning, the so-called point cloud is obtained, often consisting of millions of points. Such large measurement datasets are very time-consuming and labor-intensive...
-
The Central European GNSS Research Network (CEGRN) dataset
PublicationThe Central European GNSS Research Network (CEGRN) collects GNSS data since 1994 from contributors which today include 42 Institutions in 33 Countries. CEGRN returns a dataset of coordinates and velocities computed according to international standards and the most recent processing procedures and recommendations. We provide a dataset of 1229 positions and velocities resulting from 3 or more repetitions of coordinate measurements...
-
Educational Dataset of Handheld Doppler Blood Flow Recordings
PublicationVital signals registration plays a significant role in biomedical engineering and education process. Well acquired data allow future engineers to observe certain physical phenomena as well learn how to correctly process and interpret the data. This dataset was designed for students to learn about Doppler phenomena and to demonstrate correctly and incorrectly acquired signals as well as the basic methods of signal processing. This...
-
AGAR a Microbial Colony Dataset for Deep Learning Detection
Publication -
RDF dataset profiling - a survey of features, methods, vocabularies and applications
PublicationThe Web of Data, and in particular Linked Data, has seen tremendous growth over the past years. However, reuse and take-up of these rich data sources is often limited and focused on a few well-known and established RDF datasets. This can be partially attributed to the lack of reliable and up-to-date information about the characteristics of available datasets. While RDF datasets vary heavily with respect to the features related...
-
Generation of microbial colonies dataset with deep learning style transfer
Publication -
Description of the Dataset Hanow – Praecepta de Arte Disputandi – Transcription and Photographs
PublicationThis article briefly characterises the “Hanow – Praecepta de arte disputandi – transcription and photographs” research dataset. The dataset was created based on photographs and transcriptions of the manuscript of the Latin lectures on the rules of effective discussion (the title of the manuscript: Praecepta de arte disputandi) by Michael Chris-toph Hanow (1695–1773), professor of Gdańsk Academic Gymnasium. The original document...
-
Crack Mouth Opening Displacement for EH36 Shipbuilding Steel Measurements Dataset
PublicationThe dataset titled EH36 steel for shipbuilding (plate thickness 50 mm) – CMOD – force record, a0/W=0.6 contains a CMOD (Crack Mouth Opening Displacement) – Force record which is the base for evaluation of the fracture toughness of structural steel. Bend specimens with a Bx2B section (B = 50 mm), and relative initial crack length a0/W=0.60 were used. The test was carried out at ambient temperature in accordance with the ISO 12135...
-
Process of Medical Dataset Construction for Machine Learning-Multifield Study and Guidelines
PublicationThe acquisition of high-quality data and annotations is essential for the training of efficient machine learning algorithms, while being an expensive and time-consuming process. Although the process of data processing and training and testing of machine learning models is well studied and considered in the literature, the actual procedures of obtaining data and their annotations in collaboration with physicians are in most cases...
-
Down-Sampling of Large LiDAR Dataset in the Context of Off-Road Objects Extraction
PublicationNowadays, LiDAR (Light Detection and Ranging) is used in many fields, such as transportation. Thanks to the recent technological improvements, the current generation of LiDAR mapping instruments available on the market allows to acquire up to millions of three-dimensional (3D) points per second. On the one hand, such improvements allowed the development of LiDAR-based systems with increased productivity, enabling the quick acquisition...
-
News that Moves the Market: DSEX-News Dataset for Forecasting DSE Using BERT
PublicationStock market is a complex and dynamic industry that has always presented challenges for stakeholders and investors due to its unpredictable nature. This unpredictability motivates the need for more accurate prediction models. Traditional prediction models have limitations in handling the dynamic nature of the stock market. Additionally, previous methods have used less relevant data, leading to suboptimal performance. This study...
-
Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits
PublicationThe Contextual Multi-Armed Bandits (CMAB) framework is pivotal for learning to make decisions. However, due to challenges in deploying online algorithms, there is a shift towards offline policy learning, which relies on pre-existing datasets. This study examines the relationship between the quality of these datasets and the performance of offline policy learning algorithms, specifically, Neural Greedy and NeuraLCB. Our results...
-
A European Multi Lake Survey dataset of environmental variables, phytoplankton pigments and cyanotoxins
Publication -
Towards Gender Harmony Dataset: Gender Beliefs and Gender Stereotypes in 62 Countries
Publication -
Impedance Spectra of RC Model as a Result of Testing Pulse Excitation Measurement Method Dataset
PublicationThe dataset titled Impedance spectra of RC model as a result of testing pulse excitation measurement method contains the impedance spectrum of an exemplary test RC model obtained using pulse excitation. The dataset allows presentation of the accuracy of the impedance spectroscopy measuring instrument, which uses the pulse excitation method to shorten the time of the whole spectrum acquisition.
-
Dataset Related Experimental Investigation of Chess Position Evaluation Using a Deep Neural Network
PublicationThe idea of training Articial Neural Networks to evaluate chess positions has been widely explored in the last ten years. In this paper we investigated dataset impact on chess position evaluation. We created two datasets with over 1.6 million unique chess positions each. In one of those we also included randomly generated positions resulting from consideration of potentially unpredictable chess moves. Each position was evaluated...
-
A European-wide dataset to uncover adaptive traits of Listeria monocytogenes to diverse ecological niches
Publication -
Segmentation Quality Refinement in Large-Scale Medical Image Dataset with Crowd-Sourced Annotations
PublicationDeployment of different techniques of deep learning including Convolutional Neural Networks (CNN) in image classification systems has accomplished outstanding results. However, the advantages and potential impact of such a system can be completely negated if it does not reach a target accuracy. To achieve high classification accuracy with low variance in medical image classification system, there is needed the large size of the...
-
Measurement of the Temporal and Spatial Temperature Distribution on the Surface of PVCP Tissue Phantom Illuminated by Laser Dataset
PublicationThe dataset entitled Measurement of the temporal and spatial temperature distribution on the surface of PVCP tissue phantom illuminated by laser was obtained with a laboratory set-up for characterisation of the thermal properties of optical tissue phantoms during laser irradiation. The dataset contains a single image file representing the spatial temperature distribution on the surface of a PVCP tissue phantom. This thermal image...
-
Description of the Dataset Rhetoric at School – a Selection of the Syllabi from the Academic Gymnasium in Gdańsk – Transcription and Photographs
PublicationThe research dataset described in the article was based on photographs and transcription of a textual record from Latin syllabi for classes at the Gdańsk Academic Gymnasium. The syllabi concern the years 1645/1648/1652/1653. The original document is held in the collection of the Gdańsk Library of the Polish Academy of Sciences [reference number: Ma 3920 8o]. The collected research material can be used for studying the practical...
-
Using Synchronously Registered Biosignals Dataset for Teaching Basics of Medical Data Analysis – Case Study
PublicationMedical data analysis and processing strongly relies on the data quality itself. The correct data registration allows many unnecessary steps in data processing to be avoided. Moreover, it takes a certain amount of experience to acquire data that can produce replicable results. Because consistency is crucial in the teaching process, students have access to pre-recorded real data without the necessity of using additional equipment...
-
Regeneration Project of Market Places GOSPOSTRATEG – “Polanki” Market in Gdańsk-Oliwa Pilot Project Monitoring Dataset
PublicationThe dataset entitled Monitoring of activities carried out as part of prototyping and implementation of the pilot project in the area of the “Polanki” market and its direct neighbourhood, in the Gdańsk-Oliwa district, step1; stage from July 2020 year contains tabular monitoring lists (quantitative and qualitative documentation report in the form of tables) of activities carried out as part of the prototyping and implementation of...
-
Identification of High-Value Dataset determinants: is there a silver bullet for efficient sustainability-oriented data-driven development?
PublicationOpen Government Data (OGD) are seen as one of the trends that has the potential to benefit the economy, improve the quality, efficiency, and transparency of public administration, and change the lives of citizens, and the society as a whole facilitating efficient sustainability-oriented data-driven services. However, the quick achievement of these benefits is closely related to the “value” of the OGD, i.e., how useful, and reusable...
-
G2DC-PL+: a gridded 2 km daily climate dataset for the union of the Polish territory and the Vistula and Odra basins
PublicationG2DC-PL+, a gridded 2 km daily climate dataset for the union of the Polish territory and the Vistula and Odra basins, is an update and extension of the CHASE-PL Forcing Data – Gridded Daily Precipitation and Temperature Dataset – 5 km (CPLFD-GDPT5). The latter was the first publicly available, high-resolution climate forcing dataset in Poland, used for a range of purposes including hydrological modelling and bias correction of...
-
Dataset Relating Collective Angst, Identifications, Essentialist Continuity and Collective Action for Progressive City Policy among Gdańsk Residents
PublicationThis dataset contains the individual responses of 456 residents of Gdańsk who participated in the study. The study was conducted before the second term of the presidential election in Poland in 2020. Demographic variables as well as psychological measures of angst, place attachment, identification in-group continuity and willingness to engage in collective action were collected. We also measured the perception of the risk of...
-
Chromium FTW dataset
Open Research DataThis dataset contains the results of chromium and nutrients (N and PO4-P) removal in floating treatment wetland microcosm experiment with two cosmopolitan species of parennials: Phragmites australis and Iris pseudacorus.
-
Bricks images dataset
Open Research DataThe set contains 200 images of various wooden bricks of various shapes and colors placed on a background (blanket) with some heart shaped patterns. Each photo is available in 300x300 and 224x224 pixels size in PNG format. Photos are divided in 10 classes – 8 types of bricks photographed form various angles + 2 additional classes (multiple bricks at...
-
DATASET DATASET: GrounDwater sAlinizaTion and leaching AsseSsmEnt Tool: a holistic approach for coastal areas
ProjectsProject realized in Department of Geotechnical and Hydraulic Engineering according to WATER4ALL/I/38/DATASET/2024 agreement from 2024-08-06
-
AITP - AI Thermal Pedestrians Dataset
Open Research DataAITP is a pedestrian detection dataset consisting of 9178 annotated thermal images. The training set contains 7801 images on which15448 pedestrians were labeled. The test set has 1377 images on which 2731 objects were marked. All images are in PNG file format (120x160) captured with FLIR Lepton Thermal Camera on the streets of Gdańsk, Poland. All pedestrians...
-
Rain Gardens GC_MS analysis dataset
Open Research DataThis dataset contains the results of samples analysis (no-target analysis: scan mode) using gas chromatography coupled with mass spectrometry GC–MS (GC-2030 NEXIS MS, Shimadzu, Japan or Thermo Scientific, Waltham, USA).
-
Rain Gardens SW quality dataset
Open Research DataThis dataset contains the results of parameters of storm water runoff and storm water quality in rain garden units. Samples were collected from 4 different rain gardens in Gdansk, Poland.
-
OntoValidate: OntoNotes 5.0 NER validation dataset
Open Research DataOntoValidate dataset consists of 603 randomly chosen raw textsfrom the original OntoNote 5.0 dataset (3637 raw texts in total).
-
Greencoin Project - open phase application dataset
Open Research DataThis dataset captures detailed transactional records of the Greencoin project, focusing on rewarding pro-environmental behavior in the Tricity region of Poland. It includes data on user interactions such as quiz completions, challenges, and other sustainable actions, with corresponding timestamps and wallet balances. This data supports research on gamification...
-
ArchBGal32cB 441Glu mutein gene analysis dataset
Open Research Data -
Rain Gardens LC_MS/MS analysis dataset
Open Research DataThis dataset contains the results of samples analysis (target analysis with certified reference materials) using ultra-high performance liquid chromatography tandem mass spectrometry (UHPLC-MS/MS, Shimadzu, Japan).
-
A study of the alignment of information sounds in public spaces - dataset
Open Research DataDataset used during work on master's thesis. Contains R scripts, used recordins (.wav) and csv files with results of objective and subjective analysis.
-
Rain Gardens SW particle size analysis dataset
Open Research DataThis dataset contains the results of laser diffraction particle size analysis of storm water runoff and storm water quality in rain garden units. Samples were collected from 4 different rain gardens in Gdansk, Poland.
-
WikiPrefs: human preferences dataset build from text edits
Open Research DataThe WikiPrefs dataset is a human preferences dataset for Large Language Models alignment. It was built using the EditPrefs method from historical edits of Wikipedia featured articles
-
SegSperm - a dataset of sperm images for blurry and small object segmentation
Open Research DataMany deep learning applications require figure-ground segmentation. The performance of segmentation models varies across modalities and acquisition settings.