displaying 1000 best results Help
Search results for: dataset
-
A Survey on the Datasets and Algorithms for Satellite Data Applications
PublicationRecent advances in the area of the Internet of Things shows that devices are usually resource-constrained. To enable advanced applications on these devices, it is necessary to enhance their performance by leveraging external computing resources available in the network. This work presents a study of computational platforms to increase the performance of these devices based on the Mobile Cloud Computing (MCC) paradigm. The main...
-
Training of Deep Learning Models Using Synthetic Datasets
PublicationIn order to solve increasingly complex problems, the complexity of Deep Neural Networks also needs to be constantly increased, and therefore training such networks requires more and more data. Unfortunately, obtaining such massive real world training data to optimize neural networks parameters is a challenging and time-consuming task. To solve this problem, we propose an easy-touse and general approach to training deep learning...
-
Potential Energy Curves of Diatomic Alkali Molecules Datasets
PublicationThe datasets described in this article contain potential energy curves for several diatomic systems. The data was obtained via high-performance computing using MOLPRO, a system of ab initio programs for advanced molecular electronic structure calculations. The datasets allow to model bond lengths, energy levels, spectra and time-evolution of molecular dimers for which the data are presented.
-
Neural Graph Collaborative Filtering: Analysis of Possibilities on Diverse Datasets
Publication -
Neural Graph Collaborative Filtering: Analysis of Possibilities on Diverse Datasets
PublicationThis paper continues the work by Wang et al. [17]. Its goal is to verify the robustness of the NGCF (Neural Graph Collaborative Filtering) technique by assessing its ability to generalize across different datasets. To achieve this, we first replicated the experiments conducted by Wang et al. [17] to ensure that their replication package is functional. We received sligthly better results for ndcg@20 and somewhat poorer results for...
-
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
PublicationArtificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...
-
A Reduction Method for Bathymetric Datasets that Preserves True Coastal Water Geodata
PublicationWater areas occupy over 70 percent of the Earth’s surface and are constantly subject to research and analysis. Often, hydrographic remote sensors are used for such research, which allow for the collection of information on the shape of the water area bottom and the objects located on it. Information about the quality and reliability of the depth data is important, especially during coastal modelling. In-shore areas are liable...
-
High resolution optical and acoustic remote sensing datasets of the Puck Lagoon
PublicationThe very shallow marine basin of Puck Lagoon in the southern Baltic Sea, on the Northern coast of Poland, hosts valuable benthic habitats and cultural heritage sites. These include, among others, protected Zostera marina meadows, one of the Baltic’s major medieval harbours, a ship graveyard, and likely other submerged features that are yet to be discovered. Prior to this project, no comprehensive high-resolution remote sensing...
-
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
PublicationIn this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...
-
Application of Regression Line to Obtain Specified Number of Points in Reduced Large Datasets
PublicationModern measurement techniques like scanning technology or sonar measurements, provide large datasets, which are a reliable source of information about measured object, however such datasets are sometimes difficult to develop. Therefore, the algorithms for reducing the number of such sets are incorporated into their processing. In the reduction algorithms based on the...
-
Testing the Diagnostic Utility of Recombinant Toxoplasma Gondii Chimeric Antigens – Generated Datasets
PublicationThe datasets titled Toxoplasma gondii recombinant chimeric antigens – IgM and IgG ELISAs – mouse serum samples and Toxoplasma gondii recombinant chimeric antigens – IgG and IgM ELISAs – human serum samples contain absorbance measurements obtained during serological tests using mouse and human sera in enzyme-linked immunosorbent assay (ELISA) tests based on recombinant chimeric antigens. The datasets allows a comparison of absorbance...
-
Influence of datasets decreased by applying reduction and generation methods on Digital Terrain Models
PublicationThe number of point clouds provided by LiDAR technology can be sometimes seen as a problem in development and further processing for given purposes (e.g. Digital Terrain Model (DTM) generation). Therefore, there is still a need to reduce the obtained big datasets. Reducing can be done, inter alia, by reducing the size of the set or by generating the set. This paper presents two variants of the reduction of point clouds in order...
-
High performance filtering for big datasets from Airborne Laser Scanning with CUDA technology
PublicationThere are many studies on the problems of processing big datasets provided by Airborne Laser Scanning (ALS). The processing of point clouds is often executed in stages or on the fragments of the measurement set. Therefore, solutions that enable the processing of the entire cloud at the same time in a simple, fast, efficient way are the subject of many researches. In this paper, authors propose to use General-Purpose computation...
-
Towards High-Value Datasets Determination for Data-Driven Development: A Systematic Literature Review
PublicationOpen government data (OGD) is seen as a political and socio-economic phenomenon that promises to promote civic engagement and stimulate public sector innovations in various areas of public life. To bring the expected benefits, data must be reused and transformed into value-added products or services. This, in turn, sets another precondition for data that are expected to not only be available and comply with open data principles,...
-
Study of Multi-Class Classification Algorithms’ Performance on Highly Imbalanced Network Intrusion Datasets
PublicationThis paper is devoted to the problem of class imbalance in machine learning, focusing on the intrusion detection of rare classes in computer networks. The problem of class imbalance occurs when one class heavily outnumbers examples from the other classes. In this paper, we are particularly interested in classifiers, as pattern recognition and anomaly detection could be solved as a classification problem. As still a major part of...
-
Entropy Measures of heart rate variability for short ECG datasets in patients with congestive heart failure
PublicationWe investigated the usefulness of entropy measures calculated for short ECG series in distinguishing healthy subjects from patients with congestive heart failure (CHF). Four entropy measures were tested: Approximate Entropy (ApEn), Sample Entropy (SampEn), Fuzzy Entropy (Fuzzy En) and Permutation Entropy (PE), each computed for ECG series of 1000, 500, 250 and 100 RR intervals. We found that with a reduction of the data set length...
-
Entropy measures of heart rate variability for short ECG datasets in patients with congestive heart failure
PublicationWe investigated the usefulness of entropy measures calculated for short ECG series in distinguishing healthy subjects from patients with congestive heart failure (CHF). Four entropy measures were tested: Approximate Entropy (ApEn), Sample Entropy (SampEn), Fuzzy Entropy (FuzzyEn) and Permutation Entropy (PE), each computed for ECG series of 1000, 500, 250 and 100 RR intervals. We found that with a reduction of the data set length...
-
A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies
PublicationIn this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...
-
An Approach to Data Reduction for Learning from Big Datasets: Integrating Stacking, Rotation, and Agent Population Learning Techniques
Publication -
Integrating Statistical and Machine‐Learning Approach for Meta‐Analysis of Bisphenol A‐Exposure Datasets Reveals Effects on Mouse Gene Expression within Pathways of Apoptosis and Cell Survival
PublicationBisphenols are important environmental pollutants that are extensively studied due to different detrimental effects, while the molecular mechanisms behind these effects are less well understood. Like other environmental pollutants, bisphenols are being tested in various experimental models, creating large expression datasets found in open access storage. The meta‐analysis of such datasets is, however, very complicated for various...
-
Bias mitigation benchmark that includes two datasets
Open Research DataISIC-2020 is the largest skin lesion dataset divided into two classes -- benign and malignant. It contains 33126 dermoscopic images from over 2000 patients. The diagnoses were confirmed either by histopathology, expert agreement or longitudinal follow-up. The dataset was gathered by The International Skin Imaging Collaboration (ISIC) from several medical...
-
Wind speed, wind direction and solar radiation datasets; wind and solar energy resources analysis
Open Research DataDataset contain the results of wind speed, wind direction and solar radiation for wind and solar energy resources analysis performed in years 2008 and 2009. Application for efficiency and profitability of solar and wind power plants anaylsis and for energy generation forecasting algorithms design and anaysis. Datasets used in doctoral dissertations,...
-
Bibliographic data on datasets affiliated to Most Wiedzy and indexed in Data Citation Index (retrieved by Web of Science service in December 2021)
Open Research DataThe file contains the number of datasetes published by the reserchers affiliated to Most Wiedzy and indexed in Data Citation Index provided by Web of Science. The Search was performed using the name of institution in the 'address' filed or 'group author' filed . Data retrieved and published during the '5th Open Science Conference (1-3.12.2021).
-
Bibliographic data on datasets affiliated to Gdansk University of Technology and indexed in Data Citation Index (retrieved by Web of Science service in December 21)
Open Research DataThe file contains the number of datasets published by the researchers affiliated to Gdansk University of Technology and indexed in Data Citation Index provided by Web of Science. The Search was performed on the 1st of December 2021 using the name of institution in the 'address' and 'group author' field. Data retrieved and published during the 5th Open...
-
Bibliographic data on datasets affiliated to Maria Curie-Skłodowska University and indexed in Data Citation Index (retrieved by Web of Science service in February2022)
Open Research DataThe file contains the number of datasets published by the researchers affiliated to Maria Curie-Skłodowska University and indexed in Data Citation Index provided by Web of Science. The Search was performer using the name of institution in the address field or group author field. Data retrieved and published during the 5th Open Science Conference (1-3.12.2021)
-
Bibliographic data on datasets (from 2020) affiliated to Most Wiedzy and indexed in Data Citation Index (retrieved by Web of Science service in December 2021)
Open Research DataThe file contains the number of datasets published by the researchers affiliated to Most Wiedzy and indexed in Data Citation Index by Web of Science. The Search was perfprmed using the name of institution in the 'assress' filed or 'group author' field. Data retrieved and published during the 5th Open Science Conference (1-3.12.2021).
-
Bibliographic data on datasets affiliated to University of Technology and Humanities in Radom and indexed in Data Citation Inex (retrievd by Web of Science service in December 2021)
Open Research DataThe file contains the number of datasets published by the researchers affiliated to University of Technology and Humanities in Radom and indexed in Data Citation Index provided by Web of Science. The Search was performed using the name of institution in the address field or group author field. Data retrieved and published during the 5th Open Science...
-
Jacek Nikodem
PeopleDataset - tablice rejestracyjne Archiwa zabezpieczone hasłem - proszę o kontakt w celu przekazania klucza do plików.
-
Piotr Krajewski dr
PeoplePiotr Krajewski is a librarian at the Library of Gdańsk University of Technology (GUT) and a PhD student at the Medical University of Gdańsk. His research interests focus on the standardization of the e-resources usage data and Open Access publishing, especially the role of institutional repositories in the development of the OA initiative and the phenomenon of “predatory publishers”. He works at Scientific and Technical Information...
-
Photos and rendered images of LEGO bricks
PublicationThe paper describes a collection of datasets containing both LEGO brick renders and real photos. The datasets contain around 155,000 photos and nearly 1,500,000 renders. The renders aim to simulate real-life photos of LEGO bricks allowing faster creation of extensive datasets. The datasets are publicly available via the Gdansk University of Technology “Most Wiedzy” institutional repository. The source files of all tools used during...
-
Application Of Generative Adversarial Network for Data Augmentation and Multiplication to Automated Cell Segmentation of the Corneal Endothelium
PublicationConsidering the automatic segmentation of the endothelial layer, the available data of the corneal endothelium is still limited to a few datasets, typically containing an average of only about 30 images. To fill this gap, this paper introduces the use of Generative Adversarial Networks (GANs) to augment and multiply data. By using the ``Alizarine'' dataset, we train a model to generate a new synthetic dataset with over 513k images....
-
High-Resolution Wind Wave Parameters in the Area of the Gulf of Gdańsk During 21 Extreme Storms
PublicationThis dataset contains the results of wind-wave parameter modelling in the area of the Gulf of Gdańsk (Southern Baltic). For the simulations, a high resolution SWAN model was used. The dataset consists of the significant wave height, the direction of the wave approaching the shore and the wave period during 21 historical, extreme storms. The storms were selected by an automatic search over the 44-year-long significant wave height...
-
Mechanical Properties of Human Stomach Tissue
PublicationThe dataset entitled Determination of mechanical properties of human stomach tissues subjected to uniaxial stretching contains: the length of the sample as a function of the corresponding load (tensile force) and the initial values of the average width and average thickness of the sample. All tests were conducted in a self-developed tensile test machine: PG TissueTester. The dataset allows the coefficients of various models of...
-
Identyfikacja instrumentu muzycznego z nagrania fonicznego za pomocą sztucznych sieci neuronowych
PublicationCelem rozprawy jest zbadanie algorytmów do identyfikacji instrumentów występujących w sygnale polifonicznym z wykorzystaniem sztucznych sieci neuronowych. W części teoretycznej przywołano podstawy przetwarzania sygnałów fonicznych w kontekście ekstrakcji parametrów sygnałów wykorzystywanych w treningu sieci neuronowych. Dodatkowo dokonano analizy rozwoju metod uczenia maszynowego z uwzględnieniem podziału na sieci neuronowe pierwszej,...
-
Vehicle Detection and Speed Estimation Using Millimetre Wave Radar
PublicationThe dataset titled Data from 76- to 81-GHz mmWave Sensor located at S7 road contains data recorded employing an IWR1642 mmWave sensor from Texas Instruments. The data comes from two sessions lasting 24h each. The dataset provides the possibility to perform analyses related to car traffic intensity on one of the carriageways of the motorway heading to the Gdańsk metropolitan area. Based on the gathered data, it is possible to calculate...
-
Reduction of measurement data before Digital Terrain Model generation vs. DTM generalisation
PublicationModern data acquisition technologies provide large datasets that are not always necessary in its entirety to properly accomplish the goal of the study. In addition, such datasets are often cumbersome for rational processing, and their processing is time and labour consuming. Therefore, methods that enable to reduce the size of the measurement dataset, such as the generalization of the Digital Terrain Model (DTM) or the reduction...
-
Using Convolutional Neural Networks for Corneal Arcus Detection Towards Familial Hypercholesterolemia Screening
PublicationFamilial hypercholesterolemia (FH) is a highly undiagnosed disease. Among FH patients, the onset of premature coronary artery disease is 13 times higher than in the general population. Early diagnosis and treatment is essential to prevent cardiovascular diseases and their complications, and to prolong life. One of the clinical criteria of FH is the occurrence of a corneal arcus (CA) among patients, especially those under 45 years...
-
Effective Air Quality Prediction Using Reinforced Swarm Optimization and Bi-Directional Gated Recurrent Unit
PublicationIn the present scenario, air quality prediction (AQP) is a complex task due to high variability, volatility, and dynamic nature in space and time of particulates and pollutants. Recently, several nations have had poor air quality due to the high emission of particulate matter (PM2.5) that affects human health conditions, especially in urban areas. In this research, a new optimization-based regression model was implemented for effective...
-
The OptD-multi method in LiDAR processing
PublicationNew and constantly developing technology for acquiring spatial data, such as LiDAR (light detection and ranging), is a source for large volume of data. However, such amount of data is not always needed for developing the most popular LiDAR products: digital terrain model (DTM) or digital surface model. Therefore, in many cases, the number of contained points are reduced in the pre-processing stage. The degree of reduction is determined...
-
Style Transfer for Detecting Vehicles with Thermal Camera
PublicationIn this work we focus on nighttime vehicle detection for intelligent traffic monitoring from the thermal camera. To train a Convolutional Neural Network (CNN) detector we create a stylized version of COCO (Common Objects in Context) dataset using Style Transfer technique that imitates images obtained from thermal cameras. This new dataset is further used for fine-tuning of the model and as a result detection accuracy on images...
-
Crack Mouth Opening Displacement for EH36 Shipbuilding Steel Measurements
PublicationThe dataset titled EH36 steel for shipbuilding (plate thicnkness 50mm) - CMOD - force record, a0/W = 0.6 contains CMOD (Crack Mouth Opening Displacement) - Force record which is the base for evaluation of fracture toughness of structural steel. Bend specimens witch Bx2B section (B= 50mm), and relative initial crack length a0 / W = 0.60 were used. The test was carried out at ambient temperature in accordance to ISO 12135 standard....
-
Exploratory analysis and ranking of analytical procedures for short-chain chlorinated paraffins determination in environmental solid samples
PublicationShort-chain chlorinated paraffins are ones of the most recent chemical compounds that have been classified as persistent organic pollutants. They have various applications and are emitted to the environment. Despite the fact, that the content levels of these compounds in the environmental compartments should be monitored, there is still a lack of well-defined and validated analytical procedures, proposed or suggested by the national...
-
Extending touch-less interaction with smart glasses by implementing EMG module
PublicationIn this paper we propose to use temporal muscle contraction to perform certain actions. Method: The set of muscle contractions corresponding to one of three actions including “single-click”, “double-click” “click-n-hold” and “non-action” were recorded. After recording certain amount of signals, the set of five parameters was calculated. These parameters served as an input matrix for the neural network. Two-layer feedforward neural...
-
Using Isolation Forest and Alternative Data Products to Overcome Ground Truth Data Scarcity for Improved Deep Learning-based Agricultural Land Use Classification Models
PublicationHigh-quality labelled datasets represent a cornerstone in the development of deep learning models for land use classification. The high cost of data collection, the inherent errors introduced during data mapping efforts, the lack of local knowledge, and the spatial variability of the data hinder the development of accurate and spatially-transferable deep learning models in the context of agriculture. In this paper, we investigate...
-
Viewpoint independent shape-based object classification for video surveillance
PublicationA method for shape based object classification is presented.Unlike object dimension based methods it does not require any system calibration techniques. A number of 3D object models are utilized as a source of training dataset for a specified camera orientation. Usage of the 3D models allows to perform the dataset creation process semiautomatically. The background subtraction method is used for the purpose of detecting moving objects...
-
Outlier detection method by using deep neural networks
PublicationDetecting outliers in the data set is quite important for building effective predictive models. Consistent prediction can not be made through models created with data sets containing outliers, or robust models can not be created. In such cases, it may be possible to exclude observations that are determined to be outlier from the data set, or to assign less weight to these points of observation than to other points of observation....
-
Long-Term GNSS Tropospheric Parameters for the Tropics (2001-2018) Derived from Selected IGS Stations
PublicationThis paper describes dataset “Tropospheric parameters derived from selected IGS stations in the tropics for the years 2001-2018” contains GNSS-derived zenith tropospheric delay (ZTD), a posteriori corrected zenith wet delay (ZWD), and precipitable water vapour (PWV) time series. These troposphere-related data were estimated for the Jan 2001 – Dec 2018 period for 43 International GNSS Service (IGS) stations located across the global...
-
Local variability in snow concentrations of chlorinated persistent organic pollutants as a source of large uncertainty in interpreting spatial patterns at all scales
PublicationSingle point sampling, a widespread practice in snow studies in remote areas, due to logistical constraints, can present an unquantified error to the final study results. The low concentrations of studied chemicals, such as chlorinated persistent organic pollutants, contribute to the uncertainty. We conducted a field experiment in the Arctic to estimate the error stemming from differences in the composition of snow at short distances...
-
Methodology for Processing of 3D Multibeam Sonar Big Data for Comparative Navigation
PublicationAutonomous navigation is an important task for unmanned vehicles operating both on the surface and underwater. A sophisticated solution for autonomous non-global navigational satellite system navigation is comparative (terrain reference) navigation. We present a method for fast processing of 3D multibeam sonar data to make depth area comparable with depth areas from bathymetric electronic navigational charts as source maps during...
-
Searching for Solvents with an Increased Carbon Dioxide Solubility Using Multivariate Statistics
PublicationIonic liquids (ILs) are used in various fields of chemistry. One of them is CO2 capture, a process that is quite well described. The solubility of CO2 in ILs can be used as a model to investigate gas absorption processes. The aim is to find the relationships between the solubility of CO2 and other variables—physicochemical properties and parameters related to greenness. In this study, 12 variables are used to describe a dataset...