displaying 1000 best results Help
Search results for: DATASET
-
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
PublicationArtificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...
-
A Reduction Method for Bathymetric Datasets that Preserves True Coastal Water Geodata
PublicationWater areas occupy over 70 percent of the Earth’s surface and are constantly subject to research and analysis. Often, hydrographic remote sensors are used for such research, which allow for the collection of information on the shape of the water area bottom and the objects located on it. Information about the quality and reliability of the depth data is important, especially during coastal modelling. In-shore areas are liable...
-
High resolution optical and acoustic remote sensing datasets of the Puck Lagoon
PublicationThe very shallow marine basin of Puck Lagoon in the southern Baltic Sea, on the Northern coast of Poland, hosts valuable benthic habitats and cultural heritage sites. These include, among others, protected Zostera marina meadows, one of the Baltic’s major medieval harbours, a ship graveyard, and likely other submerged features that are yet to be discovered. Prior to this project, no comprehensive high-resolution remote sensing...
-
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
PublicationIn this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...
-
Application of Regression Line to Obtain Specified Number of Points in Reduced Large Datasets
PublicationModern measurement techniques like scanning technology or sonar measurements, provide large datasets, which are a reliable source of information about measured object, however such datasets are sometimes difficult to develop. Therefore, the algorithms for reducing the number of such sets are incorporated into their processing. In the reduction algorithms based on the...
-
Influence of datasets decreased by applying reduction and generation methods on Digital Terrain Models
PublicationThe number of point clouds provided by LiDAR technology can be sometimes seen as a problem in development and further processing for given purposes (e.g. Digital Terrain Model (DTM) generation). Therefore, there is still a need to reduce the obtained big datasets. Reducing can be done, inter alia, by reducing the size of the set or by generating the set. This paper presents two variants of the reduction of point clouds in order...
-
Testing the Diagnostic Utility of Recombinant Toxoplasma Gondii Chimeric Antigens – Generated Datasets
PublicationThe datasets titled Toxoplasma gondii recombinant chimeric antigens – IgM and IgG ELISAs – mouse serum samples and Toxoplasma gondii recombinant chimeric antigens – IgG and IgM ELISAs – human serum samples contain absorbance measurements obtained during serological tests using mouse and human sera in enzyme-linked immunosorbent assay (ELISA) tests based on recombinant chimeric antigens. The datasets allows a comparison of absorbance...
-
High performance filtering for big datasets from Airborne Laser Scanning with CUDA technology
PublicationThere are many studies on the problems of processing big datasets provided by Airborne Laser Scanning (ALS). The processing of point clouds is often executed in stages or on the fragments of the measurement set. Therefore, solutions that enable the processing of the entire cloud at the same time in a simple, fast, efficient way are the subject of many researches. In this paper, authors propose to use General-Purpose computation...
-
Towards High-Value Datasets Determination for Data-Driven Development: A Systematic Literature Review
PublicationOpen government data (OGD) is seen as a political and socio-economic phenomenon that promises to promote civic engagement and stimulate public sector innovations in various areas of public life. To bring the expected benefits, data must be reused and transformed into value-added products or services. This, in turn, sets another precondition for data that are expected to not only be available and comply with open data principles,...
-
Study of Multi-Class Classification Algorithms’ Performance on Highly Imbalanced Network Intrusion Datasets
PublicationThis paper is devoted to the problem of class imbalance in machine learning, focusing on the intrusion detection of rare classes in computer networks. The problem of class imbalance occurs when one class heavily outnumbers examples from the other classes. In this paper, we are particularly interested in classifiers, as pattern recognition and anomaly detection could be solved as a classification problem. As still a major part of...
-
Entropy measures of heart rate variability for short ECG datasets in patients with congestive heart failure
PublicationWe investigated the usefulness of entropy measures calculated for short ECG series in distinguishing healthy subjects from patients with congestive heart failure (CHF). Four entropy measures were tested: Approximate Entropy (ApEn), Sample Entropy (SampEn), Fuzzy Entropy (FuzzyEn) and Permutation Entropy (PE), each computed for ECG series of 1000, 500, 250 and 100 RR intervals. We found that with a reduction of the data set length...
-
Entropy Measures of heart rate variability for short ECG datasets in patients with congestive heart failure
PublicationWe investigated the usefulness of entropy measures calculated for short ECG series in distinguishing healthy subjects from patients with congestive heart failure (CHF). Four entropy measures were tested: Approximate Entropy (ApEn), Sample Entropy (SampEn), Fuzzy Entropy (Fuzzy En) and Permutation Entropy (PE), each computed for ECG series of 1000, 500, 250 and 100 RR intervals. We found that with a reduction of the data set length...
-
A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies
PublicationIn this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...
-
An Approach to Data Reduction for Learning from Big Datasets: Integrating Stacking, Rotation, and Agent Population Learning Techniques
Publication -
Integrating Statistical and Machine‐Learning Approach for Meta‐Analysis of Bisphenol A‐Exposure Datasets Reveals Effects on Mouse Gene Expression within Pathways of Apoptosis and Cell Survival
PublicationBisphenols are important environmental pollutants that are extensively studied due to different detrimental effects, while the molecular mechanisms behind these effects are less well understood. Like other environmental pollutants, bisphenols are being tested in various experimental models, creating large expression datasets found in open access storage. The meta‐analysis of such datasets is, however, very complicated for various...
-
Wind speed, wind direction and solar radiation datasets; wind and solar energy resources analysis
Open Research DataDataset contain the results of wind speed, wind direction and solar radiation for wind and solar energy resources analysis performed in years 2008 and 2009. Application for efficiency and profitability of solar and wind power plants anaylsis and for energy generation forecasting algorithms design and anaysis. Datasets used in doctoral dissertations,...
-
Bibliographic data on datasets affiliated to Most Wiedzy and indexed in Data Citation Index (retrieved by Web of Science service in December 2021)
Open Research DataThe file contains the number of datasetes published by the reserchers affiliated to Most Wiedzy and indexed in Data Citation Index provided by Web of Science. The Search was performed using the name of institution in the 'address' filed or 'group author' filed . Data retrieved and published during the '5th Open Science Conference (1-3.12.2021).
-
Bibliographic data on datasets affiliated to Gdansk University of Technology and indexed in Data Citation Index (retrieved by Web of Science service in December 21)
Open Research DataThe file contains the number of datasets published by the researchers affiliated to Gdansk University of Technology and indexed in Data Citation Index provided by Web of Science. The Search was performed on the 1st of December 2021 using the name of institution in the 'address' and 'group author' field. Data retrieved and published during the 5th Open...
-
Bibliographic data on datasets affiliated to Maria Curie-Skłodowska University and indexed in Data Citation Index (retrieved by Web of Science service in February2022)
Open Research DataThe file contains the number of datasets published by the researchers affiliated to Maria Curie-Skłodowska University and indexed in Data Citation Index provided by Web of Science. The Search was performer using the name of institution in the address field or group author field. Data retrieved and published during the 5th Open Science Conference (1-3.12.2021)
-
Bibliographic data on datasets (from 2020) affiliated to Most Wiedzy and indexed in Data Citation Index (retrieved by Web of Science service in December 2021)
Open Research DataThe file contains the number of datasets published by the researchers affiliated to Most Wiedzy and indexed in Data Citation Index by Web of Science. The Search was perfprmed using the name of institution in the 'assress' filed or 'group author' field. Data retrieved and published during the 5th Open Science Conference (1-3.12.2021).
-
Bibliographic data on datasets affiliated to University of Technology and Humanities in Radom and indexed in Data Citation Inex (retrievd by Web of Science service in December 2021)
Open Research DataThe file contains the number of datasets published by the researchers affiliated to University of Technology and Humanities in Radom and indexed in Data Citation Index provided by Web of Science. The Search was performed using the name of institution in the address field or group author field. Data retrieved and published during the 5th Open Science...
-
Jacek Nikodem
PeopleDataset - tablice rejestracyjne Archiwa zabezpieczone hasłem - proszę o kontakt w celu przekazania klucza do plików.
-
Piotr Krajewski dr
PeoplePiotr Krajewski is a librarian at the Library of Gdańsk University of Technology (GUT) and a PhD student at the Medical University of Gdańsk. His research interests focus on the standardization of the e-resources usage data and Open Access publishing, especially the role of institutional repositories in the development of the OA initiative and the phenomenon of “predatory publishers”. He works at Scientific and Technical Information...
-
Photos and rendered images of LEGO bricks
PublicationThe paper describes a collection of datasets containing both LEGO brick renders and real photos. The datasets contain around 155,000 photos and nearly 1,500,000 renders. The renders aim to simulate real-life photos of LEGO bricks allowing faster creation of extensive datasets. The datasets are publicly available via the Gdansk University of Technology “Most Wiedzy” institutional repository. The source files of all tools used during...
-
High-Resolution Wind Wave Parameters in the Area of the Gulf of Gdańsk During 21 Extreme Storms
PublicationThis dataset contains the results of wind-wave parameter modelling in the area of the Gulf of Gdańsk (Southern Baltic). For the simulations, a high resolution SWAN model was used. The dataset consists of the significant wave height, the direction of the wave approaching the shore and the wave period during 21 historical, extreme storms. The storms were selected by an automatic search over the 44-year-long significant wave height...
-
Mechanical Properties of Human Stomach Tissue
PublicationThe dataset entitled Determination of mechanical properties of human stomach tissues subjected to uniaxial stretching contains: the length of the sample as a function of the corresponding load (tensile force) and the initial values of the average width and average thickness of the sample. All tests were conducted in a self-developed tensile test machine: PG TissueTester. The dataset allows the coefficients of various models of...
-
Vehicle Detection and Speed Estimation Using Millimetre Wave Radar
PublicationThe dataset titled Data from 76- to 81-GHz mmWave Sensor located at S7 road contains data recorded employing an IWR1642 mmWave sensor from Texas Instruments. The data comes from two sessions lasting 24h each. The dataset provides the possibility to perform analyses related to car traffic intensity on one of the carriageways of the motorway heading to the Gdańsk metropolitan area. Based on the gathered data, it is possible to calculate...
-
Reduction of measurement data before Digital Terrain Model generation vs. DTM generalisation
PublicationModern data acquisition technologies provide large datasets that are not always necessary in its entirety to properly accomplish the goal of the study. In addition, such datasets are often cumbersome for rational processing, and their processing is time and labour consuming. Therefore, methods that enable to reduce the size of the measurement dataset, such as the generalization of the Digital Terrain Model (DTM) or the reduction...
-
Using Convolutional Neural Networks for Corneal Arcus Detection Towards Familial Hypercholesterolemia Screening
PublicationFamilial hypercholesterolemia (FH) is a highly undiagnosed disease. Among FH patients, the onset of premature coronary artery disease is 13 times higher than in the general population. Early diagnosis and treatment is essential to prevent cardiovascular diseases and their complications, and to prolong life. One of the clinical criteria of FH is the occurrence of a corneal arcus (CA) among patients, especially those under 45 years...
-
Effective Air Quality Prediction Using Reinforced Swarm Optimization and Bi-Directional Gated Recurrent Unit
PublicationIn the present scenario, air quality prediction (AQP) is a complex task due to high variability, volatility, and dynamic nature in space and time of particulates and pollutants. Recently, several nations have had poor air quality due to the high emission of particulate matter (PM2.5) that affects human health conditions, especially in urban areas. In this research, a new optimization-based regression model was implemented for effective...
-
The OptD-multi method in LiDAR processing
PublicationNew and constantly developing technology for acquiring spatial data, such as LiDAR (light detection and ranging), is a source for large volume of data. However, such amount of data is not always needed for developing the most popular LiDAR products: digital terrain model (DTM) or digital surface model. Therefore, in many cases, the number of contained points are reduced in the pre-processing stage. The degree of reduction is determined...
-
Style Transfer for Detecting Vehicles with Thermal Camera
PublicationIn this work we focus on nighttime vehicle detection for intelligent traffic monitoring from the thermal camera. To train a Convolutional Neural Network (CNN) detector we create a stylized version of COCO (Common Objects in Context) dataset using Style Transfer technique that imitates images obtained from thermal cameras. This new dataset is further used for fine-tuning of the model and as a result detection accuracy on images...
-
Crack Mouth Opening Displacement for EH36 Shipbuilding Steel Measurements
PublicationThe dataset titled EH36 steel for shipbuilding (plate thicnkness 50mm) - CMOD - force record, a0/W = 0.6 contains CMOD (Crack Mouth Opening Displacement) - Force record which is the base for evaluation of fracture toughness of structural steel. Bend specimens witch Bx2B section (B= 50mm), and relative initial crack length a0 / W = 0.60 were used. The test was carried out at ambient temperature in accordance to ISO 12135 standard....
-
Exploratory analysis and ranking of analytical procedures for short-chain chlorinated paraffins determination in environmental solid samples
PublicationShort-chain chlorinated paraffins are ones of the most recent chemical compounds that have been classified as persistent organic pollutants. They have various applications and are emitted to the environment. Despite the fact, that the content levels of these compounds in the environmental compartments should be monitored, there is still a lack of well-defined and validated analytical procedures, proposed or suggested by the national...
-
Extending touch-less interaction with smart glasses by implementing EMG module
PublicationIn this paper we propose to use temporal muscle contraction to perform certain actions. Method: The set of muscle contractions corresponding to one of three actions including “single-click”, “double-click” “click-n-hold” and “non-action” were recorded. After recording certain amount of signals, the set of five parameters was calculated. These parameters served as an input matrix for the neural network. Two-layer feedforward neural...
-
Using Isolation Forest and Alternative Data Products to Overcome Ground Truth Data Scarcity for Improved Deep Learning-based Agricultural Land Use Classification Models
PublicationHigh-quality labelled datasets represent a cornerstone in the development of deep learning models for land use classification. The high cost of data collection, the inherent errors introduced during data mapping efforts, the lack of local knowledge, and the spatial variability of the data hinder the development of accurate and spatially-transferable deep learning models in the context of agriculture. In this paper, we investigate...
-
Viewpoint independent shape-based object classification for video surveillance
PublicationA method for shape based object classification is presented.Unlike object dimension based methods it does not require any system calibration techniques. A number of 3D object models are utilized as a source of training dataset for a specified camera orientation. Usage of the 3D models allows to perform the dataset creation process semiautomatically. The background subtraction method is used for the purpose of detecting moving objects...
-
Long-Term GNSS Tropospheric Parameters for the Tropics (2001-2018) Derived from Selected IGS Stations
PublicationThis paper describes dataset “Tropospheric parameters derived from selected IGS stations in the tropics for the years 2001-2018” contains GNSS-derived zenith tropospheric delay (ZTD), a posteriori corrected zenith wet delay (ZWD), and precipitable water vapour (PWV) time series. These troposphere-related data were estimated for the Jan 2001 – Dec 2018 period for 43 International GNSS Service (IGS) stations located across the global...
-
Outlier detection method by using deep neural networks
PublicationDetecting outliers in the data set is quite important for building effective predictive models. Consistent prediction can not be made through models created with data sets containing outliers, or robust models can not be created. In such cases, it may be possible to exclude observations that are determined to be outlier from the data set, or to assign less weight to these points of observation than to other points of observation....
-
Local variability in snow concentrations of chlorinated persistent organic pollutants as a source of large uncertainty in interpreting spatial patterns at all scales
PublicationSingle point sampling, a widespread practice in snow studies in remote areas, due to logistical constraints, can present an unquantified error to the final study results. The low concentrations of studied chemicals, such as chlorinated persistent organic pollutants, contribute to the uncertainty. We conducted a field experiment in the Arctic to estimate the error stemming from differences in the composition of snow at short distances...
-
Methodology for Processing of 3D Multibeam Sonar Big Data for Comparative Navigation
PublicationAutonomous navigation is an important task for unmanned vehicles operating both on the surface and underwater. A sophisticated solution for autonomous non-global navigational satellite system navigation is comparative (terrain reference) navigation. We present a method for fast processing of 3D multibeam sonar data to make depth area comparable with depth areas from bathymetric electronic navigational charts as source maps during...
-
Searching for Solvents with an Increased Carbon Dioxide Solubility Using Multivariate Statistics
PublicationIonic liquids (ILs) are used in various fields of chemistry. One of them is CO2 capture, a process that is quite well described. The solubility of CO2 in ILs can be used as a model to investigate gas absorption processes. The aim is to find the relationships between the solubility of CO2 and other variables—physicochemical properties and parameters related to greenness. In this study, 12 variables are used to describe a dataset...
-
Standard deviation as the optimization criterion in the OptD method and its influence on the generated DTM
PublicationReduction of the measurement dataset is one of the current issues related to constantly developing technologies that provide large datasets, eg. laser scanning. It could seems that presence and evolution of processors computer, increase of hard drive capacity etc. is the solution for development of such large datasets. And in fact it is, however, the “lighter” datasets are easier to work with. Additionally, reduced datasets can...
-
MULTI-OBJECTIVE OPTIMIZATION PROBLEM IN THE OptD-MULTI METHOD
PublicationNew measurement technologies, e.g. Light Detection And Ranging (LiDAR), generate very large datasets. In many cases, it is reasonable to reduce the number of measuring points, but in such a way that the datasets after reduction satisfy specific optimization criteria. For this purpose the Optimum Dataset (OptD) method proposed in [1] and [2] can be applied. The OptD method with the use of several optimization criteria is called...
-
Data on LEGO sets release dates and worldwide retail prices combined with aftermarket transaction prices in Poland between June 2018 and June 2023
PublicationThe dataset contains LEGO bricks sets item count and pricing history for AI-based set pricing prediction. The data spans the timeframe from June 2018 to June 2023. The data was obtained from three sources: Brickset.com (LEGO sets retail prices, release dates, and IDs), Lego.com official web page (ID number of each set that was released by Lego, its retail prices, the current status of the set) and promoklocki.pl web page (the retail...
-
Herbarium of Division of Marine Biology and Ecology as the Primary Basis for Conservation Status Assessments in the Gulf of Gdańsk
PublicationThe dataset titled Herbarium of Division of Marine Biology and Ecology University of Gdańsk (DMBE) is a research herbarium encompassing specimens of vascular plants and algae hosted by the Laboratory of Marine Plant Ecology at the University of Gdańsk, Poland. The aim of Herbarium is to preserve marine plant and algae collections mostly from the Gulf of Gdańsk, but the herbarium also holds specimens from other parts of the world.
-
Induction of the common-sense hierarchies in lexical data
PublicationUnsupervised organization of a set of lexical concepts that captures common-sense knowledge inducting meaningful partitioning of data is described. Projection of data on principal components allow for dentification of clusters with wide margins, and the procedure is recursively repeated within each cluster. Application of this idea to a simple dataset describing animals created hierarchical partitioning with each clusters related...
-
Using contextual conditional preferences for recommendation taska: a case study in the movie domain
PublicationRecommendation engines aim to propose users items they are interested in by looking at the user interaction with a system. However, individual interests may be drastically influenced by the context in which decisions are taken. We present an attempt to model user interests via a set of contextual conditional preferences. We show that usage of proposed preferences gives reasonable values of the accuracy and the precision even when...
-
Selection of Visual Descriptors for the Purpose of Multi-camera Object Re-identification
PublicationA comparative analysis of various visual descriptors is presented in this chapter. The descriptors utilize many aspects of image data: colour, texture, gradient, and statistical moments. The descriptor list is supplemented with local features calculated in close vicinity of key points found automatically in the image. The goal of the analysis is to find descriptors that are best suited for particular task, i.e. re-identification...
-
On Bayesian Tracking and Prediction of Radar Cross Section
PublicationWe consider the problem of Bayesian tracking of radar cross section. The adopted observation model employs the gamma family, which covers all Swerling cases in a unified framework. State dynamics are modeled using a nonstationary autoregressive gamma process. The principal component of the proposed solution is a nontrivial gamma approximation, applied during the time update recursion. The superior performance of the proposed approach...