Search results for: dataset quality - Bridge of Knowledge

Search

Search results for: dataset quality

Filters

total: 456
filtered: 273

clear all filters


Chosen catalog filters

  • Category

  • Year

  • Options

clear Chosen catalog filters disabled

Search results for: dataset quality

  • Reduction of measurement data before Digital Terrain Model generation vs. DTM generalisation

    Publication

    - SURVEY REVIEW - Year 2018

    Modern data acquisition technologies provide large datasets that are not always necessary in its entirety to properly accomplish the goal of the study. In addition, such datasets are often cumbersome for rational processing, and their processing is time and labour consuming. Therefore, methods that enable to reduce the size of the measurement dataset, such as the generalization of the Digital Terrain Model (DTM) or the reduction...

    Full text to download in external service

  • Using Convolutional Neural Networks for Corneal Arcus Detection Towards Familial Hypercholesterolemia Screening

    Publication

    Familial hypercholesterolemia (FH) is a highly undiagnosed disease. Among FH patients, the onset of premature coronary artery disease is 13 times higher than in the general population. Early diagnosis and treatment is essential to prevent cardiovascular diseases and their complications, and to prolong life. One of the clinical criteria of FH is the occurrence of a corneal arcus (CA) among patients, especially those under 45 years...

    Full text available to download

  • Description of the Dataset Hanow – Praecepta de Arte Disputandi – Transcription and Photographs

    Publication

    - Year 2022

    This article briefly characterises the “Hanow – Praecepta de arte disputandi – transcription and photographs” research dataset. The dataset was created based on photographs and transcriptions of the manuscript of the Latin lectures on the rules of effective discussion (the title of the manuscript: Praecepta de arte disputandi) by Michael Chris-toph Hanow (1695–1773), professor of Gdańsk Academic Gymnasium. The original document...

    Full text available to download

  • The OptD-multi method in LiDAR processing

    Publication

    - MEASUREMENT SCIENCE & TECHNOLOGY - Year 2017

    New and constantly developing technology for acquiring spatial data, such as LiDAR (light detection and ranging), is a source for large volume of data. However, such amount of data is not always needed for developing the most popular LiDAR products: digital terrain model (DTM) or digital surface model. Therefore, in many cases, the number of contained points are reduced in the pre-processing stage. The degree of reduction is determined...

    Full text to download in external service

  • Style Transfer for Detecting Vehicles with Thermal Camera

    Publication

    - Year 2019

    In this work we focus on nighttime vehicle detection for intelligent traffic monitoring from the thermal camera. To train a Convolutional Neural Network (CNN) detector we create a stylized version of COCO (Common Objects in Context) dataset using Style Transfer technique that imitates images obtained from thermal cameras. This new dataset is further used for fine-tuning of the model and as a result detection accuracy on images...

  • Crack Mouth Opening Displacement for EH36 Shipbuilding Steel Measurements

    Publication

    - Year 2022

    The dataset titled EH36 steel for shipbuilding (plate thicnkness 50mm) - CMOD - force record, a0/W = 0.6 contains CMOD (Crack Mouth Opening Displacement) - Force record which is the base for evaluation of fracture toughness of structural steel. Bend specimens witch Bx2B section (B= 50mm), and relative initial crack length a0 / W = 0.60 were used. The test was carried out at ambient temperature in accordance to ISO 12135 standard....

  • Exploratory analysis and ranking of analytical procedures for short-chain chlorinated paraffins determination in environmental solid samples

    Short-chain chlorinated paraffins are ones of the most recent chemical compounds that have been classified as persistent organic pollutants. They have various applications and are emitted to the environment. Despite the fact, that the content levels of these compounds in the environmental compartments should be monitored, there is still a lack of well-defined and validated analytical procedures, proposed or suggested by the national...

    Full text available to download

  • Crack Mouth Opening Displacement for EH36 Shipbuilding Steel Measurements Dataset

    Publication

    - Year 2022

    The dataset titled EH36 steel for shipbuilding (plate thickness 50 mm) – CMOD – force record, a0/W=0.6 contains a CMOD (Crack Mouth Opening Displacement) – Force record which is the base for evaluation of the fracture toughness of structural steel. Bend specimens with a Bx2B section (B = 50 mm), and relative initial crack length a0/W=0.60 were used. The test was carried out at ambient temperature in accordance with the ISO 12135...

    Full text available to download

  • Extending touch-less interaction with smart glasses by implementing EMG module

    In this paper we propose to use temporal muscle contraction to perform certain actions. Method: The set of muscle contractions corresponding to one of three actions including “single-click”, “double-click” “click-n-hold” and “non-action” were recorded. After recording certain amount of signals, the set of five parameters was calculated. These parameters served as an input matrix for the neural network. Two-layer feedforward neural...

    Full text to download in external service

  • Viewpoint independent shape-based object classification for video surveillance

    Publication

    A method for shape based object classification is presented.Unlike object dimension based methods it does not require any system calibration techniques. A number of 3D object models are utilized as a source of training dataset for a specified camera orientation. Usage of the 3D models allows to perform the dataset creation process semiautomatically. The background subtraction method is used for the purpose of detecting moving objects...

    Full text to download in external service

  • Long-Term GNSS Tropospheric Parameters for the Tropics (2001-2018) Derived from Selected IGS Stations

    Publication

    - Year 2022

    This paper describes dataset “Tropospheric parameters derived from selected IGS stations in the tropics for the years 2001-2018” contains GNSS-derived zenith tropospheric delay (ZTD), a posteriori corrected zenith wet delay (ZWD), and precipitable water vapour (PWV) time series. These troposphere-related data were estimated for the Jan 2001 – Dec 2018 period for 43 International GNSS Service (IGS) stations located across the global...

    Full text available to download

  • Outlier detection method by using deep neural networks

    Publication

    - Year 2017

    Detecting outliers in the data set is quite important for building effective predictive models. Consistent prediction can not be made through models created with data sets containing outliers, or robust models can not be created. In such cases, it may be possible to exclude observations that are determined to be outlier from the data set, or to assign less weight to these points of observation than to other points of observation....

    Full text to download in external service

  • Training of Deep Learning Models Using Synthetic Datasets

    Publication

    - Year 2022

    In order to solve increasingly complex problems, the complexity of Deep Neural Networks also needs to be constantly increased, and therefore training such networks requires more and more data. Unfortunately, obtaining such massive real world training data to optimize neural networks parameters is a challenging and time-consuming task. To solve this problem, we propose an easy-touse and general approach to training deep learning...

    Full text to download in external service

  • Methodology for Processing of 3D Multibeam Sonar Big Data for Comparative Navigation

    Publication

    - Remote Sensing - Year 2019

    Autonomous navigation is an important task for unmanned vehicles operating both on the surface and underwater. A sophisticated solution for autonomous non-global navigational satellite system navigation is comparative (terrain reference) navigation. We present a method for fast processing of 3D multibeam sonar data to make depth area comparable with depth areas from bathymetric electronic navigational charts as source maps during...

    Full text available to download

  • Dataset Related Experimental Investigation of Chess Position Evaluation Using a Deep Neural Network

    Publication

    The idea of training Articial Neural Networks to evaluate chess positions has been widely explored in the last ten years. In this paper we investigated dataset impact on chess position evaluation. We created two datasets with over 1.6 million unique chess positions each. In one of those we also included randomly generated positions resulting from consideration of potentially unpredictable chess moves. Each position was evaluated...

    Full text available to download

  • Searching for Solvents with an Increased Carbon Dioxide Solubility Using Multivariate Statistics

    Publication

    - MOLECULES - Year 2020

    Ionic liquids (ILs) are used in various fields of chemistry. One of them is CO2 capture, a process that is quite well described. The solubility of CO2 in ILs can be used as a model to investigate gas absorption processes. The aim is to find the relationships between the solubility of CO2 and other variables—physicochemical properties and parameters related to greenness. In this study, 12 variables are used to describe a dataset...

    Full text available to download

  • Application of the Optimum Dataset Method in Archeological Studies on Barrows

    Publication

    - Year 2018

    Light Detection and Ranging (LiDAR) became one of the technologies used in archaeological research. It allows for relatively easy detection of archaeological sites that have their own field form, e.g.: barrows, fortresses, tracts, ancient fields [1]. As a result of the scanning, the so-called point cloud is obtained, often consisting of millions of points. Such large measurement datasets are very time-consuming and labor-intensive...

    Full text to download in external service

  • High performance filtering for big datasets from Airborne Laser Scanning with CUDA technology

    Publication

    - SURVEY REVIEW - Year 2018

    There are many studies on the problems of processing big datasets provided by Airborne Laser Scanning (ALS). The processing of point clouds is often executed in stages or on the fragments of the measurement set. Therefore, solutions that enable the processing of the entire cloud at the same time in a simple, fast, efficient way are the subject of many researches. In this paper, authors propose to use General-Purpose computation...

    Full text to download in external service

  • MULTI-OBJECTIVE OPTIMIZATION PROBLEM IN THE OptD-MULTI METHOD

    Publication

    - Metrology and Measurement Systems - Year 2019

    New measurement technologies, e.g. Light Detection And Ranging (LiDAR), generate very large datasets. In many cases, it is reasonable to reduce the number of measuring points, but in such a way that the datasets after reduction satisfy specific optimization criteria. For this purpose the Optimum Dataset (OptD) method proposed in [1] and [2] can be applied. The OptD method with the use of several optimization criteria is called...

    Full text available to download

  • Standard deviation as the optimization criterion in the OptD method and its influence on the generated DTM

    Publication

    - E3S Web of Conferences - Year 2018

    Reduction of the measurement dataset is one of the current issues related to constantly developing technologies that provide large datasets, eg. laser scanning. It could seems that presence and evolution of processors computer, increase of hard drive capacity etc. is the solution for development of such large datasets. And in fact it is, however, the “lighter” datasets are easier to work with. Additionally, reduced datasets can...

    Full text available to download

  • Data on LEGO sets release dates and worldwide retail prices combined with aftermarket transaction prices in Poland between June 2018 and June 2023

    Publication

    - Data in Brief - Year 2024

    The dataset contains LEGO bricks sets item count and pricing history for AI-based set pricing prediction. The data spans the timeframe from June 2018 to June 2023. The data was obtained from three sources: Brickset.com (LEGO sets retail prices, release dates, and IDs), Lego.com official web page (ID number of each set that was released by Lego, its retail prices, the current status of the set) and promoklocki.pl web page (the retail...

    Full text available to download

  • Down-Sampling of Large LiDAR Dataset in the Context of Off-Road Objects Extraction

    Publication

    - Geosciences - Year 2020

    Nowadays, LiDAR (Light Detection and Ranging) is used in many fields, such as transportation. Thanks to the recent technological improvements, the current generation of LiDAR mapping instruments available on the market allows to acquire up to millions of three-dimensional (3D) points per second. On the one hand, such improvements allowed the development of LiDAR-based systems with increased productivity, enabling the quick acquisition...

    Full text available to download

  • Herbarium of Division of Marine Biology and Ecology as the Primary Basis for Conservation Status Assessments in the Gulf of Gdańsk

    Publication

    - Year 2022

    The dataset titled Herbarium of Division of Marine Biology and Ecology University of Gdańsk (DMBE) is a research herbarium encompassing specimens of vascular plants and algae hosted by the Laboratory of Marine Plant Ecology at the University of Gdańsk, Poland. The aim of Herbarium is to preserve marine plant and algae collections mostly from the Gulf of Gdańsk, but the herbarium also holds specimens from other parts of the world.

    Full text available to download

  • Using contextual conditional preferences for recommendation taska: a case study in the movie domain

    Publication

    - Studia Informatica Pomerania - Year 2016

    Recommendation engines aim to propose users items they are interested in by looking at the user interaction with a system. However, individual interests may be drastically influenced by the context in which decisions are taken. We present an attempt to model user interests via a set of contextual conditional preferences. We show that usage of proposed preferences gives reasonable values of the accuracy and the precision even when...

    Full text to download in external service

  • Induction of the common-sense hierarchies in lexical data

    Publication

    Unsupervised organization of a set of lexical concepts that captures common-sense knowledge inducting meaningful partitioning of data is described. Projection of data on principal components allow for dentification of clusters with wide margins, and the procedure is recursively repeated within each cluster. Application of this idea to a simple dataset describing animals created hierarchical partitioning with each clusters related...

  • Selection of Visual Descriptors for the Purpose of Multi-camera Object Re-identification

    A comparative analysis of various visual descriptors is presented in this chapter. The descriptors utilize many aspects of image data: colour, texture, gradient, and statistical moments. The descriptor list is supplemented with local features calculated in close vicinity of key points found automatically in the image. The goal of the analysis is to find descriptors that are best suited for particular task, i.e. re-identification...

    Full text to download in external service

  • Dataset Relating Collective Angst, Identifications, Essentialist Continuity and Collective Action for Progressive City Policy among Gdańsk Residents

    Publication

    - Year 2022

    This dataset contains the individual responses of 456 residents of Gdańsk who participated in the study. The study was conducted before the second term of the presidential election in Poland in 2020. Demographic variables as well as psychological measures of angst, place attachment, identification in-group continuity and willingness to engage in collective action were collected. We also measured the perception of the risk of...

    Full text available to download

  • On Bayesian Tracking and Prediction of Radar Cross Section

    We consider the problem of Bayesian tracking of radar cross section. The adopted observation model employs the gamma family, which covers all Swerling cases in a unified framework. State dynamics are modeled using a nonstationary autoregressive gamma process. The principal component of the proposed solution is a nontrivial gamma approximation, applied during the time update recursion. The superior performance of the proposed approach...

    Full text available to download

  • Minimal Sets of Lefschetz Periods for Morse-Smale Diffeomorphisms of a Connected Sum of g Real Projective Planes

    Publication

    - Year 2022

    The dataset titled Database of the minimal sets of Lefschetz periods for Morse-Smale diffeomorphisms of a connected sum of g real projective planes contains all of the values of the topological invariant called the minimal set of Lefschetz periods, computed for Morse-Smale diffeomorphisms of a non-orientable compact surface without boundary of genus g (i.e. a connected sum of g real projective planes), where g varies from 1 to...

    Full text available to download

  • Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets

    Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

    Full text available to download

  • Description of the Dataset Rhetoric at School – a Selection of the Syllabi from the Academic Gymnasium in Gdańsk – Transcription and Photographs

    Publication

    - Year 2022

    The research dataset described in the article was based on photographs and transcription of a textual record from Latin syllabi for classes at the Gdańsk Academic Gymnasium. The syllabi concern the years 1645/1648/1652/1653. The original document is held in the collection of the Gdańsk Library of the Polish Academy of Sciences [reference number: Ma 3920 8o]. The collected research material can be used for studying the practical...

    Full text available to download

  • Surf Zone Currents in the Coastal Zone of the Southern Baltic Sea – a Modelling Approach

    Publication

    - Year 2022

    Nearshore currents in a multi-bar non-tidal coastal zone environment located in the Southern Baltic Sea are studied. Spatiotemporal seaward-directed jets – so-called rip currents – are an important part of the nearshore current system. In previous research, Dudkowska et al. (2020) performed an extended modelling experiment to determine the wave conditions that are conducive to the emergence of rip currents. In this paper, the...

    Full text available to download

  • On Computing Curlicues Generated by Circle Homeomorphisms

    Publication

    The dataset entitled Computing dynamical curlicues contains values of consecutive points on a curlicue generated, respectively, by rotation on the circle by different angles, the Arnold circle map (with various parameter values) and an exemplary sequence as well as corresponding diameters and Birkhoff averages of these curves. We additionally provide source codes of the Matlab programs which can be used to generate and plot the...

    Full text available to download

  • Data-driven, probabilistic model for attainable speed for ships approaching Gdańsk harbour

    Publication

    The growing demand for maritime transportation leads to increased traffic in ports. From this arises the need to observe the consequences of the specific speed ships reach when approaching seaports. However, usually the analyzed cases refer only to the statistical evaluation of the studied phenomenon or to the empirical modelling, ignoring the mutual influence of variables such as ship type, length or weather conditions. In this...

    Full text to download in external service

  • Personalized prediction of the secondary oocytes number after ovarian stimulation: A machine learning model based on clinical and genetic data

    Publication
    • K. Zieliński
    • S. Pukszta
    • M. Mickiewicz
    • M. Kotlarz
    • P. Wygocki
    • M. Zieleń
    • D. Drzewiecka
    • D. Drzyzga
    • A. Kloska
    • J. Jakóbkiewicz-Banecka

    - PLoS Computational Biology - Year 2023

    Controlled ovarian stimulation is tailored to the patient based on clinical parameters but estimating the number of retrieved metaphase II (MII) oocytes is a challenge. Here, we have developed a model that takes advantage of the patient’s genetic and clinical characteristics simultaneously for predicting the stimulation outcome. Sequence variants in reproduction-related genes identified by next-generation sequencing were matched...

    Full text available to download

  • Comprehensive Comparison of a Few Variants of Cluster Analysis as Data Mining Tool in Supporting Environmental Management

    Publication
    • A. Astel
    • K. Astel
    • S. Tsakovski
    • M. Biziuk
    • K. Obolewski
    • K. Glińska-Lewczuk
    • K. Bigus
    • I. Craciun
    • C. M. Timofte

    - Environmental Engineering and Management Journal - Year 2016

    A few variants of hierarchical cluster analysis (CA) as tool of assessment of multidimensional similarity in environmental dataset are compared. The dataset consisted of analytical results of determination of metals (Na, K, Ca, Sc, Fe, Co, Zn, As, Br, Rb, Mo, Sb, Cs, Ba, La, Ce, Sm, Hf and Th) in ambient air dried and kept alive, by the means of hydroponics, moss baskets collected in 12 locations on the area of Tricity (Poland)....

    Full text to download in external service

  • Deep neural networks for human pose estimation from a very low resolution depth image

    Publication

    The work presented in the paper is dedicated to determining and evaluating the most efficient neural network architecture applied as a multiple regression network localizing human body joints in 3D space based on a single low resolution depth image. The main challenge was to deal with a noisy and coarse representation of the human body, as observed by a depth sensor from a large distance, and to achieve high localization precision....

    Full text available to download

  • The Central European GNSS Research Network (CEGRN) dataset

    Publication
    • J. Zurutuza
    • A. Caporali
    • M. Bertocco
    • M. Ishchenko
    • O. Khoda
    • H. Steffen
    • M. Figurski
    • E. Parseliunas
    • S. Berk
    • G. Nykiel

    - Data in Brief - Year 2019

    The Central European GNSS Research Network (CEGRN) collects GNSS data since 1994 from contributors which today include 42 Institutions in 33 Countries. CEGRN returns a dataset of coordinates and velocities computed according to international standards and the most recent processing procedures and recommendations. We provide a dataset of 1229 positions and velocities resulting from 3 or more repetitions of coordinate measurements...

    Full text available to download

  • Multiple Group Membership and Collective Action Intention

    Publication

    - Year 2022

    Datasets from two studies conducted in Poland on the relation between identity fusion, group identification, multiple group membership, perceived injustice, and collective action intention. The presented studies, in the context of protests against attempts to restrict abortion law, were conducted to examine the link between belonging to multiple groups, group efficacy & identification, perceived injustice and collective...

    Full text available to download

  • High Resolution Sea Ice Floe Size and Shape Data from Knox Coast, East Antarctica

    Publication

    - Year 2022

    This dataset contains floe size distribution data from a very high resolution (pixel size: 0.3 m) optical satellite image of sea ice, acquired on 16 Feb. 2019 off the Knox Coast (East Antarctica). The image shows relatively small ice floes produced by wave-induced breakup of landfast ice between Mill Island and Bowman Island. The ice floes are characterised by a narrow size distribution and angular, polygonal shapes, typical...

    Full text available to download

  • Thermal imaging in automatic rodent’s social behaviour analysis

    Publication

    - Year 2016

    Laboratory rodent social behaviour analysis is an extremely important task for biological, medical and pharmacological researches. In this work thermal images features that facilitate analysis are presented. Methods to distinguish objects on the basis of thermal distribution are tested. Actions of grooming or biting one rodent by another - important social behaviour incidents - are clearly visible...

    Full text to download in external service

  • Detection of circulating tumor cells by means of machine learning using Smart-Seq2 sequencing

    Circulating tumor cells (CTCs) are tumor cells that separate from the solid tumor and enter the bloodstream, which can cause metastasis. Detection and enumeration of CTCs show promising potential as a predictor for prognosis in cancer patients. Furthermore, single-cells sequencing is a technique that provides genetic information from individual cells and allows to classify them precisely and reliably. Sequencing data typically...

    Full text available to download

  • Assessing the attractiveness of human face based on machine learning

    Publication

    The attractiveness of the face plays an important role in everyday life, especially in the modern world where social media and the Internet surround us. In this study, an attempt to assess the attractiveness of a face by machine learning is shown. Attractiveness is determined by three deep models whose sum of predictions is the final score. Two annotated datasets available in the literature are employed for training and testing...

    Full text available to download

  • Testing the Effect of Bathymetric Data Reduction on the Shape of the Digital Bottom Model

    Publication

    - SENSORS - Year 2023

    Depth data and the digital bottom model created from it are very important in the inland and coastal water zones studies and research. The paper undertakes the subject of bathymetric data processing using reduction methods and examines the impact of data reduction according to the resulting representations of the bottom surface in the form of numerical bottom models. Data reduction is an approach that is meant to reduce the size...

    Full text available to download

  • Deep CNN based decision support system for detection and assessing the stage of diabetic retinopathy

    Publication

    - Year 2018

    The diabetic retinopathy is a disease caused by long-standing diabetes. Lack of effective treatment can lead to vision impairment and even irreversible blindness. The disease can be diagnosed by examining digital color fundus photographs of retina. In this paper we propose deep learning approach to automated diabetic retinopathy screening. Deep convolutional neural networks (CNN) - the most popular kind of deep learning algorithms...

    Full text to download in external service

  • Pedestrian detection in low-resolution thermal images

    Over one million people die in car accidents worldwide each year. A solution that will be able to reduce situations in which pedestrian safety is at risk has been sought for a long time. One of the techniques for detecting pedestrians on the road is the use of artificial intelligence in connection with thermal imaging. The purpose of this work was to design a system to assist the safety of people and car intelligence with the use...

    Full text to download in external service

  • Convolutional Neural Networks for C. Elegans Muscle Age Classification Using Only Self-Learned Features

    Nematodes Caenorhabditis elegans (C. elegans) have been used as model organisms in a wide variety of biological studies, especially those intended to obtain a better understanding of aging and age-associated diseases. This paper focuses on automating the analysis of C. elegans imagery to classify the muscle age of nematodes based on the known and well established IICBU dataset. Unlike many modern classification methods, the proposed...

    Full text available to download

  • KEMR-Net: A Knowledge-Enhanced Mask Refinement Network for Chromosome Instance Segmentation

    Publication

    - CYBERNETICS AND SYSTEMS - Year 2024

    This article proposes a mask refinement method for chromosome instance segmentation. The proposed method exploits the knowledge representation capability of Neural Knowledge DNA (NK-DNA) to capture the semantics of the chromosome’s shape, texture, and key points, and then it uses the captured knowledge to improve the accuracy and smoothness of the masks. We validate the method’s effectiveness on our latest high-resolution chromosome...

    Full text available to download

  • Macrophytobenthos in the Puck Bay in 2010–2018 Dataset

    Publication

    - Year 2022

    The dataset titled Biomass of macrophytobenthos in the Puck Bay in 2010-2018 con-tains data on the qualitative composition and biomass of macrophytobenthos (flow-er plants and macroalgae) in samples collected in the Puck Bay area (Gulf of Gdańsk, southern Baltic Sea) at 20 stations between 2010–2018. The data was supplemented with additional information: values of measured parameters of water and sediment, e.g. tem-perature...

    Full text available to download

  • Educational Dataset of Handheld Doppler Blood Flow Recordings

    Publication

    - Year 2022

    Vital signals registration plays a significant role in biomedical engineering and education process. Well acquired data allow future engineers to observe certain physical phenomena as well learn how to correctly process and interpret the data. This dataset was designed for students to learn about Doppler phenomena and to demonstrate correctly and incorrectly acquired signals as well as the basic methods of signal processing. This...

    Full text available to download