Wyniki wyszukiwania dla: DATASET FEATURES, DATASET PROFILING VOCABULARIES - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: DATASET FEATURES, DATASET PROFILING VOCABULARIES

Wyniki wyszukiwania dla: DATASET FEATURES, DATASET PROFILING VOCABULARIES

  • RDF dataset profiling - a survey of features, methods, vocabularies and applications

    Publikacja
    • M. B. Ellefi
    • B. Zohra
    • J. G. Breslin
    • E. Demidova
    • S. Dietze
    • K. Todorov
    • J. Szymański

    - Semantic Web - Rok 2018

    The Web of Data, and in particular Linked Data, has seen tremendous growth over the past years. However, reuse and take-up of these rich data sources is often limited and focused on a few well-known and established RDF datasets. This can be partially attributed to the lack of reliable and up-to-date information about the characteristics of available datasets. While RDF datasets vary heavily with respect to the features related...

  • Noise profiling for speech enhancement employing machine learning models

    Publikacja

    - Journal of the Acoustical Society of America - Rok 2022

    This paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...

    Pełny tekst do pobrania w portalu

  • Applying the Lombard Effect to Speech-in-Noise Communication

    Publikacja

    - Electronics - Rok 2023

    This study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...

    Pełny tekst do pobrania w portalu

  • Non-Contact Temperature Measurements Dataset

    Publikacja

    - Rok 2022

    The dataset titled The influence of the distance of the pyrometer from the surface of the radiating object on the accuracy of measurements contains temperature measurements using a selection of four commercially available pyrometers (CHY 314P, TM-F03B, TFA 31.1125 and AB-8855) as a function of the measuring distance. The dataset allows a comparison of the accuracy and measuring precision of the devices, which are very important...

    Pełny tekst do pobrania w portalu

  • AITP - AI Thermal Pedestrians Dataset

    Efficient pedestrian detection is a very important task in ensuring safety within road conditions, especially after sunset. One way to achieve this goal is to use thermal imaging in conjunction with deep learning methods and an annotated dataset for models training. In this work, such a dataset has been created by capturing thermal images of pedestrians in different weather and traffic conditions. All images were manually annotated...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • The Optimum Dataset method – examples of the application

    Publikacja

    - Rok 2018

    Data reduction is a procedure to decrease the dataset in order to make their analysis more effective and easier. Reduction of the dataset is an issue that requires proper planning, so after reduction it meets all the user’s expectations. Evidently, it is better if the result is an optimal solution in terms of adopted criteria. Within reduction methods, which provide the optimal solution there is the Optimum Dataset method (OptD)...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • AC Motor Voltage and Audible Noise Dataset

    Publikacja

    - Rok 2022

    The dataset titled AC motor voltage and audible noise waveforms in ship’s electrical drive systems with frequency converters contains the voltage and sound measurement results recorded in a marine frequency controlled AC drive system. The dataset is part of research focussing on the impact of the ship’s electrical drive systems with frequency converters on vibrations and the level of audible noise on ships. The dataset allows the...

    Pełny tekst do pobrania w portalu

  • DevEmo—Software Developers’ Facial Expression Dataset

    The COVID-19 pandemic has increased the relevance of remote activities and digital tools for education, work, and other aspects of daily life. This reality has highlighted the need for emotion recognition technology to better understand the emotions of computer users and provide support in remote environments. Emotion recognition can play a critical role in improving the remote experience and ensuring that individuals are able...

    Pełny tekst do pobrania w portalu

  • Long-Term Measurement of Physiological Parameters – Child Dataset

    Publikacja

    - Rok 2022

    The dataset titled “Long-term measurement of physiological parameters – child is one dataset” of the bigger series named Long-term measurement of physiological parameters. The dataset contains physiological parameter measurements such as skin temperature and resistance, blood pulse, as well as the stress detection marker, which can have a value of 0 when there is no stress detected or 1 when stress appeared. Additionally, the dataset...

    Pełny tekst do pobrania w portalu

  • Video of LEGO Bricks on Conveyor Belt Dataset Series

    Publikacja

    - Rok 2022

    The dataset series titled Video of LEGO bricks on conveyor belt is composed of 14 datasets containing video recordings of a moving white conveyor belt. The recordings were created using a smartphone camera in Full HD resolution. The dataset allows for the preparation of data for neural network training, and building of a LEGO sorting machine that can help builders to organise their collections.

    Pełny tekst do pobrania w portalu

  • Macrophytobenthos in the Puck Bay in 2010–2018 Dataset

    Publikacja

    - Rok 2022

    The dataset titled Biomass of macrophytobenthos in the Puck Bay in 2010-2018 con-tains data on the qualitative composition and biomass of macrophytobenthos (flow-er plants and macroalgae) in samples collected in the Puck Bay area (Gulf of Gdańsk, southern Baltic Sea) at 20 stations between 2010–2018. The data was supplemented with additional information: values of measured parameters of water and sediment, e.g. tem-perature...

    Pełny tekst do pobrania w portalu

  • Application of the Optimum Dataset Method in Archeological Studies on Barrows

    Publikacja

    - Rok 2018

    Light Detection and Ranging (LiDAR) became one of the technologies used in archaeological research. It allows for relatively easy detection of archaeological sites that have their own field form, e.g.: barrows, fortresses, tracts, ancient fields [1]. As a result of the scanning, the so-called point cloud is obtained, often consisting of millions of points. Such large measurement datasets are very time-consuming and labor-intensive...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • The Central European GNSS Research Network (CEGRN) dataset

    Publikacja
    • J. Zurutuza
    • A. Caporali
    • M. Bertocco
    • M. Ishchenko
    • O. Khoda
    • H. Steffen
    • M. Figurski
    • E. Parseliunas
    • S. Berk
    • G. Nykiel

    - Data in Brief - Rok 2019

    The Central European GNSS Research Network (CEGRN) collects GNSS data since 1994 from contributors which today include 42 Institutions in 33 Countries. CEGRN returns a dataset of coordinates and velocities computed according to international standards and the most recent processing procedures and recommendations. We provide a dataset of 1229 positions and velocities resulting from 3 or more repetitions of coordinate measurements...

    Pełny tekst do pobrania w portalu

  • Educational Dataset of Handheld Doppler Blood Flow Recordings

    Publikacja

    - Rok 2022

    Vital signals registration plays a significant role in biomedical engineering and education process. Well acquired data allow future engineers to observe certain physical phenomena as well learn how to correctly process and interpret the data. This dataset was designed for students to learn about Doppler phenomena and to demonstrate correctly and incorrectly acquired signals as well as the basic methods of signal processing. This...

    Pełny tekst do pobrania w portalu

  • Description of the Dataset Hanow – Praecepta de Arte Disputandi – Transcription and Photographs

    Publikacja

    - Rok 2022

    This article briefly characterises the “Hanow – Praecepta de arte disputandi – transcription and photographs” research dataset. The dataset was created based on photographs and transcriptions of the manuscript of the Latin lectures on the rules of effective discussion (the title of the manuscript: Praecepta de arte disputandi) by Michael Chris-toph Hanow (1695–1773), professor of Gdańsk Academic Gymnasium. The original document...

    Pełny tekst do pobrania w portalu

  • Medical Image Dataset Annotation Service (MIDAS)

    Publikacja

    - Rok 2020

    MIDAS (Medical Image Dataset Annotation Service) is a custom-tailored tool for creating and managing datasets either for deep learning, as well as machine learning or any form of statistical research. The aim of the project is to provide one-fit-all platform for creating medical image datasets that could easily blend in hospital's workflow. In our work, we focus on the importance of medical data anonimization, discussing the...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Crack Mouth Opening Displacement for EH36 Shipbuilding Steel Measurements Dataset

    Publikacja

    - Rok 2022

    The dataset titled EH36 steel for shipbuilding (plate thickness 50 mm) – CMOD – force record, a0/W=0.6 contains a CMOD (Crack Mouth Opening Displacement) – Force record which is the base for evaluation of the fracture toughness of structural steel. Bend specimens with a Bx2B section (B = 50 mm), and relative initial crack length a0/W=0.60 were used. The test was carried out at ambient temperature in accordance with the ISO 12135...

    Pełny tekst do pobrania w portalu

  • Impedance Spectra of RC Model as a Result of Testing Pulse Excitation Measurement Method Dataset

    Publikacja

    - Rok 2022

    The dataset titled Impedance spectra of RC model as a result of testing pulse excitation measurement method contains the impedance spectrum of an exemplary test RC model obtained using pulse excitation. The dataset allows presentation of the accuracy of the impedance spectroscopy measuring instrument, which uses the pulse excitation method to shorten the time of the whole spectrum acquisition.

    Pełny tekst do pobrania w portalu

  • Down-Sampling of Large LiDAR Dataset in the Context of Off-Road Objects Extraction

    Publikacja

    - Geosciences - Rok 2020

    Nowadays, LiDAR (Light Detection and Ranging) is used in many fields, such as transportation. Thanks to the recent technological improvements, the current generation of LiDAR mapping instruments available on the market allows to acquire up to millions of three-dimensional (3D) points per second. On the one hand, such improvements allowed the development of LiDAR-based systems with increased productivity, enabling the quick acquisition...

    Pełny tekst do pobrania w portalu

  • Measurement of the Temporal and Spatial Temperature Distribution on the Surface of PVCP Tissue Phantom Illuminated by Laser Dataset

    Publikacja

    The dataset entitled Measurement of the temporal and spatial temperature distribution on the surface of PVCP tissue phantom illuminated by laser was obtained with a laboratory set-up for characterisation of the thermal properties of optical tissue phantoms during laser irradiation. The dataset contains a single image file representing the spatial temperature distribution on the surface of a PVCP tissue phantom. This thermal image...

    Pełny tekst do pobrania w portalu

  • Dataset Related Experimental Investigation of Chess Position Evaluation Using a Deep Neural Network

    Publikacja

    The idea of training Articial Neural Networks to evaluate chess positions has been widely explored in the last ten years. In this paper we investigated dataset impact on chess position evaluation. We created two datasets with over 1.6 million unique chess positions each. In one of those we also included randomly generated positions resulting from consideration of potentially unpredictable chess moves. Each position was evaluated...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • The molecular entities in linked data dataset

    Publikacja

    - Data in Brief - Rok 2020

    Pełny tekst do pobrania w serwisie zewnętrznym

  • G2DC-PL+: a gridded 2 km daily climate dataset for the union of the Polish territory and the Vistula and Odra basins

    Publikacja

    - Earth System Science Data - Rok 2021

    G2DC-PL+, a gridded 2 km daily climate dataset for the union of the Polish territory and the Vistula and Odra basins, is an update and extension of the CHASE-PL Forcing Data – Gridded Daily Precipitation and Temperature Dataset – 5 km (CPLFD-GDPT5). The latter was the first publicly available, high-resolution climate forcing dataset in Poland, used for a range of purposes including hydrological modelling and bias correction of...

    Pełny tekst do pobrania w portalu

  • Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits

    Publikacja

    The Contextual Multi-Armed Bandits (CMAB) framework is pivotal for learning to make decisions. However, due to challenges in deploying online algorithms, there is a shift towards offline policy learning, which relies on pre-existing datasets. This study examines the relationship between the quality of these datasets and the performance of offline policy learning algorithms, specifically, Neural Greedy and NeuraLCB. Our results...

    Pełny tekst do pobrania w portalu

  • Description of the Dataset Rhetoric at School – a Selection of the Syllabi from the Academic Gymnasium in Gdańsk – Transcription and Photographs

    Publikacja

    - Rok 2022

    The research dataset described in the article was based on photographs and transcription of a textual record from Latin syllabi for classes at the Gdańsk Academic Gymnasium. The syllabi concern the years 1645/1648/1652/1653. The original document is held in the collection of the Gdańsk Library of the Polish Academy of Sciences [reference number: Ma 3920 8o]. The collected research material can be used for studying the practical...

    Pełny tekst do pobrania w portalu

  • Constructing a Dataset of Speech Recordingswith Lombard Effect

    Publikacja

    - Rok 2020

    Thepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...

  • Using Synchronously Registered Biosignals Dataset for Teaching Basics of Medical Data Analysis – Case Study

    Publikacja

    - Rok 2022

    Medical data analysis and processing strongly relies on the data quality itself. The correct data registration allows many unnecessary steps in data processing to be avoided. Moreover, it takes a certain amount of experience to acquire data that can produce replicable results. Because consistency is crucial in the teaching process, students have access to pre-recorded real data without the necessity of using additional equipment...

    Pełny tekst do pobrania w portalu

  • AGAR a Microbial Colony Dataset for Deep Learning Detection

    Publikacja
    • S. Majchrowska
    • J. Pawlowski
    • G. Gula
    • T. Bonus
    • A. Hanas
    • A. Loch
    • A. Pawlak
    • J. Roszkowiak
    • T. Golan
    • Z. Drulis-Kawa

    - Rok 2021

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Regeneration Project of Market Places GOSPOSTRATEG – “Polanki” Market in Gdańsk-Oliwa Pilot Project Monitoring Dataset

    Publikacja

    - Rok 2022

    The dataset entitled Monitoring of activities carried out as part of prototyping and implementation of the pilot project in the area of the “Polanki” market and its direct neighbourhood, in the Gdańsk-Oliwa district, step1; stage from July 2020 year contains tabular monitoring lists (quantitative and qualitative documentation report in the form of tables) of activities carried out as part of the prototyping and implementation of...

    Pełny tekst do pobrania w portalu

  • Dataset Relating Collective Angst, Identifications, Essentialist Continuity and Collective Action for Progressive City Policy among Gdańsk Residents

    Publikacja

    - Rok 2022

    This dataset contains the individual responses of 456 residents of Gdańsk who participated in the study. The study was conducted before the second term of the presidential election in Poland in 2020. Demographic variables as well as psychological measures of angst, place attachment, identification in-group continuity and willingness to engage in collective action were collected. We also measured the perception of the risk of...

    Pełny tekst do pobrania w portalu

  • Generation of microbial colonies dataset with deep learning style transfer

    Publikacja

    - Scientific Reports - Rok 2022

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Process of Medical Dataset Construction for Machine Learning-Multifield Study and Guidelines

    Publikacja

    The acquisition of high-quality data and annotations is essential for the training of efficient machine learning algorithms, while being an expensive and time-consuming process. Although the process of data processing and training and testing of machine learning models is well studied and considered in the literature, the actual procedures of obtaining data and their annotations in collaboration with physicians are in most cases...

  • A European Multi Lake Survey dataset of environmental variables, phytoplankton pigments and cyanotoxins

    Publikacja
    • E. Mantzouki
    • J. Campbell
    • E. van
    • P. Visser
    • I. Konstantinou
    • M. Antoniou
    • G. Giuliani
    • D. Machado-Vieira
    • A. Gurjão
    • D. Maronić... i 196 innych

    - Scientific Data - Rok 2018

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Segmentation Quality Refinement in Large-Scale Medical Image Dataset with Crowd-Sourced Annotations

    Publikacja

    Deployment of different techniques of deep learning including Convolutional Neural Networks (CNN) in image classification systems has accomplished outstanding results. However, the advantages and potential impact of such a system can be completely negated if it does not reach a target accuracy. To achieve high classification accuracy with low variance in medical image classification system, there is needed the large size of the...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Identification of High-Value Dataset determinants: is there a silver bullet for efficient sustainability-oriented data-driven development?

    Publikacja

    - Rok 2023

    Open Government Data (OGD) are seen as one of the trends that has the potential to benefit the economy, improve the quality, efficiency, and transparency of public administration, and change the lives of citizens, and the society as a whole facilitating efficient sustainability-oriented data-driven services. However, the quick achievement of these benefits is closely related to the “value” of the OGD, i.e., how useful, and reusable...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Jacek Nikodem

    Osoby

    Dataset - tablice rejestracyjne Archiwa zabezpieczone hasłem - proszę o kontakt w celu przekazania klucza do plików.

  • Chromium FTW dataset

    Dane Badawcze

    This dataset contains the results of chromium and nutrients (N and PO4-P) removal in floating treatment wetland microcosm experiment with two cosmopolitan species of parennials: Phragmites australis and Iris pseudacorus.

  • AITP - AI Thermal Pedestrians Dataset

    Dane Badawcze
    open access
    • A. Górska
    • P. Guzal
    • I. Namiotko
    • A. Wędołowska
    • M. Włoszczyńska
    • J. Rumiński

    AITP is a pedestrian detection dataset consisting of 9178 annotated thermal images. The training set contains 7801 images on which15448 pedestrians were labeled.  The test set has 1377 images on which 2731 objects were marked. All images are in PNG file format (120x160) captured with FLIR Lepton Thermal Camera on the streets of Gdańsk, Poland. All pedestrians...

  • ArchBGal32cB 441Glu mutein gene analysis dataset

    Dane Badawcze
    open access

     

  • Rain Gardens GC_MS analysis dataset

    Dane Badawcze
    open access

    This dataset contains the results of samples analysis (no-target analysis: scan mode) using gas chromatography coupled with mass spectrometry GC–MS (GC-2030 NEXIS MS, Shimadzu, Japan or Thermo Scientific, Waltham, USA).

  • Rain Gardens SW quality dataset

    Dane Badawcze
    open access

    This dataset contains the results of parameters of storm water runoff and storm water quality in rain garden units. Samples were collected from 4 different rain gardens in Gdansk, Poland.

  • Rain Gardens LC_MS/MS analysis dataset

    Dane Badawcze
    open access

    This dataset contains the results of samples analysis (target analysis with certified reference materials) using ultra-high performance liquid chromatography tandem mass spectrometry (UHPLC-MS/MS, Shimadzu, Japan).

  • SESNED: Dataset for Event-Based Non-Intrusive Load Monitoring Research

    Dane Badawcze
    wersja 1.1 open access
    • B. Gawin
    • R. Małkowski
    • K. Główczewski
    • M. Olszewski
    • P. Tomasik

    Sescom NILM Energy Dataset (SESNED ) description

  • Bricks images dataset

    Dane Badawcze
    open access

    The set contains 200 images of various wooden bricks of various shapes and colors placed on a background (blanket) with some heart shaped patterns. Each photo is available in 300x300 and 224x224 pixels size in PNG format. Photos are divided in 10 classes – 8 types of bricks photographed form various angles + 2 additional classes (multiple bricks at...

  • SegSperm - a dataset of sperm images for blurry and small object segmentation

    Dane Badawcze

    Many deep learning applications require figure-ground segmentation. The performance of segmentation models varies across modalities and acquisition settings.

  • Rain Gardens SW particle size analysis dataset

    Dane Badawcze
    open access

    This dataset contains the results of laser diffraction particle size analysis of storm water runoff and storm water quality in rain garden units. Samples were collected from 4 different rain gardens in Gdansk, Poland.

  • Vident-lab: a dataset for multi-task video processing of phantom dental scenes

    We introduce a new, asymmetrically annotated dataset of natural teeth in phantom scenes for multi-task video processing: restoration, teeth segmentation, and inter-frame homography estimation. Pairs of frames were acquired with a beam splitter. The dataset constitutes a low-quality frame, its high-quality counterpart, a teeth segmentation mask, and...

  • Rust QA: question answering dataset for "The Rust Programming Language" in SQuAD 2.0 format

    Dane Badawcze
    open access
    • S. Olewniczak
    • M. Maciszka
    • K. Paluszewski
    • G. Pozorski
    • W. Rosenthal
    • Ł. Zaleski

    Rust QA is a dataset for training and evaluating QA systems. The dataset consists of 1068 questions to "The Rust Programming Language" book (https://doc.rust-lang.org/stable/book/) with the answers provided as text spans from the book. The dataset is released in SQuAD 2.0 format.

  • Piotr Krajewski dr

    Piotr Krajewski pracuje jako starszy bibliotekarz w Bibliotece Politechniki Gdańskiej. Jako pracownik Sekcji Informacji Naukowo-Technicznej skupia się przede wszystkim na zagadnieniach związanych z ruchem Open Access oraz rolą repozytoriów instytucjonalnych w jego rozwoju. Jest także autorem artykułów poruszających kwestie standaryzacji statystyk wykorzystania zasobów elektronicznych jak również problematykę „drapieżnych wydawców”....

  • Elgold: gold standard, multi-genre dataset for named entity recognition and linking

    Dane Badawcze

    The dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.