Filtry
wszystkich: 8567
wybranych: 6168
-
Katalog
- Publikacje 6168 wyników po odfiltrowaniu
- Czasopisma 56 wyników po odfiltrowaniu
- Konferencje 24 wyników po odfiltrowaniu
- Osoby 91 wyników po odfiltrowaniu
- Projekty 7 wyników po odfiltrowaniu
- Kursy Online 48 wyników po odfiltrowaniu
- Wydarzenia 3 wyników po odfiltrowaniu
- Dane Badawcze 2170 wyników po odfiltrowaniu
Filtry wybranego katalogu
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: HIGH-VALUE DATASETS
-
Towards High-Value Datasets Determination for Data-Driven Development: A Systematic Literature Review
PublikacjaOpen government data (OGD) is seen as a political and socio-economic phenomenon that promises to promote civic engagement and stimulate public sector innovations in various areas of public life. To bring the expected benefits, data must be reused and transformed into value-added products or services. This, in turn, sets another precondition for data that are expected to not only be available and comply with open data principles,...
-
High resolution optical and acoustic remote sensing datasets of the Puck Lagoon
PublikacjaThe very shallow marine basin of Puck Lagoon in the southern Baltic Sea, on the Northern coast of Poland, hosts valuable benthic habitats and cultural heritage sites. These include, among others, protected Zostera marina meadows, one of the Baltic’s major medieval harbours, a ship graveyard, and likely other submerged features that are yet to be discovered. Prior to this project, no comprehensive high-resolution remote sensing...
-
High performance filtering for big datasets from Airborne Laser Scanning with CUDA technology
PublikacjaThere are many studies on the problems of processing big datasets provided by Airborne Laser Scanning (ALS). The processing of point clouds is often executed in stages or on the fragments of the measurement set. Therefore, solutions that enable the processing of the entire cloud at the same time in a simple, fast, efficient way are the subject of many researches. In this paper, authors propose to use General-Purpose computation...
-
Silybum marianum glycerol extraction for the preparation of high-value anti-ageing extracts
Publikacja -
Effect of high added-value components of acid whey on the nutritional and physiological indices of rats
Publikacja -
Identification of High-Value Dataset determinants: is there a silver bullet for efficient sustainability-oriented data-driven development?
PublikacjaOpen Government Data (OGD) are seen as one of the trends that has the potential to benefit the economy, improve the quality, efficiency, and transparency of public administration, and change the lives of citizens, and the society as a whole facilitating efficient sustainability-oriented data-driven services. However, the quick achievement of these benefits is closely related to the “value” of the OGD, i.e., how useful, and reusable...
-
Ammonium Enhances Food Waste Fermentation to High-Value Optically Active l-Lactic acid
Publikacja -
New insights into the role of lattice oxygen in the catalytic carbonization of polypropylene into high value-added carbon nanomaterials
Publikacja -
Activated Carbon Modification towards Efficient Catalyst for High Value-Added Products Synthesis from Alpha-Pinene
Publikacja -
Correction: New insights into the role of lattice oxygen in the catalytic carbonization of polypropylene into high value-added carbon nanomaterials
Publikacja -
Potential Energy Curves of Diatomic Alkali Molecules Datasets
PublikacjaThe datasets described in this article contain potential energy curves for several diatomic systems. The data was obtained via high-performance computing using MOLPRO, a system of ab initio programs for advanced molecular electronic structure calculations. The datasets allow to model bond lengths, energy levels, spectra and time-evolution of molecular dimers for which the data are presented.
-
Non-invasive investigation of a submerged medieval harbour, a case study from Puck Lagoon
PublikacjaThis study presents an innovative approach to underwater archaeological prospection using non-invasive methods of seabed exploration. The research focuses on the Puck medieval harbour, a cultural heritage site, and utilises acoustic and optical underwater remote-sensing technology. The primary objectives include optimising the use of Airborne Laser Bathymetry in underwater archaeology, enhancing the filtration process for mapping...
-
Distribution and extent of benthic habitats in Puck Bay (Gulf of Gdańsk, southern Baltic Sea)
PublikacjaThe majority of the southern Baltic Sea seabed encompasses homogenous soft-bottom sediments of limited productivity and low biological diversity, but shallow productive areas in the coastal zone such as wetlands, vegetated lagoons and sheltered bays show a high variety of benthic habitat types offering favourable biotopic conditions for benthic fauna. Within Polish marine areas, semi-enclosed Puck Bay (the western part of the...
-
Neural Graph Collaborative Filtering: Analysis of Possibilities on Diverse Datasets
PublikacjaThis paper continues the work by Wang et al. [17]. Its goal is to verify the robustness of the NGCF (Neural Graph Collaborative Filtering) technique by assessing its ability to generalize across different datasets. To achieve this, we first replicated the experiments conducted by Wang et al. [17] to ensure that their replication package is functional. We received sligthly better results for ndcg@20 and somewhat poorer results for...
-
Application of Regression Line to Obtain Specified Number of Points in Reduced Large Datasets
PublikacjaModern measurement techniques like scanning technology or sonar measurements, provide large datasets, which are a reliable source of information about measured object, however such datasets are sometimes difficult to develop. Therefore, the algorithms for reducing the number of such sets are incorporated into their processing. In the reduction algorithms based on the...
-
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
PublikacjaIn this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...
-
Integrating Statistical and Machine‐Learning Approach for Meta‐Analysis of Bisphenol A‐Exposure Datasets Reveals Effects on Mouse Gene Expression within Pathways of Apoptosis and Cell Survival
PublikacjaBisphenols are important environmental pollutants that are extensively studied due to different detrimental effects, while the molecular mechanisms behind these effects are less well understood. Like other environmental pollutants, bisphenols are being tested in various experimental models, creating large expression datasets found in open access storage. The meta‐analysis of such datasets is, however, very complicated for various...
-
Testing the Diagnostic Utility of Recombinant Toxoplasma Gondii Chimeric Antigens – Generated Datasets
PublikacjaThe datasets titled Toxoplasma gondii recombinant chimeric antigens – IgM and IgG ELISAs – mouse serum samples and Toxoplasma gondii recombinant chimeric antigens – IgG and IgM ELISAs – human serum samples contain absorbance measurements obtained during serological tests using mouse and human sera in enzyme-linked immunosorbent assay (ELISA) tests based on recombinant chimeric antigens. The datasets allows a comparison of absorbance...
-
Training of Deep Learning Models Using Synthetic Datasets
PublikacjaIn order to solve increasingly complex problems, the complexity of Deep Neural Networks also needs to be constantly increased, and therefore training such networks requires more and more data. Unfortunately, obtaining such massive real world training data to optimize neural networks parameters is a challenging and time-consuming task. To solve this problem, we propose an easy-touse and general approach to training deep learning...
-
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
PublikacjaArtificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...
-
Mobilenet-V2 Enhanced Parkinson's Disease Prediction with Hybrid Data Integration
PublikacjaThis study investigates the role of deep learning models, particularly MobileNet-v2, in Parkinson's Disease (PD) detection through handwriting spiral analysis. Handwriting difficulties often signal early signs of PD, necessitating early detection tools due to potential impacts on patients' work capacities. The study utilizes a three-fold approach, including data augmentation, algorithm development for simulated PD image datasets,...
-
Selecting Features with SVM
PublikacjaA common problem with feature selection is to establish how many features should be retained at least so that important information is not lost. We describe a method for choosing this number that makes use of Support Vector Machines. The method is based on controlling an angle by which the decision hyperplane is tilt due to feature selection. Experiments were performed on three text datasets generated from a Wikipedia dump. Amount...
-
Data Acquisition and Processing for GeoAI Models to Support Sustainable Agricultural Practices
PublikacjaThere are growing opportunities to leverage new technologies and data sources to address global problems related to sustainability, climate change, and biodiversity loss. The emerging discipline of GeoAI resulting from the convergence of AI and Geospatial science (Geo-AI) is enabling the possibility to harness the increasingly available open Earth Observation data collected from different constellations of satellites and sensors...
-
Influence of datasets decreased by applying reduction and generation methods on Digital Terrain Models
PublikacjaThe number of point clouds provided by LiDAR technology can be sometimes seen as a problem in development and further processing for given purposes (e.g. Digital Terrain Model (DTM) generation). Therefore, there is still a need to reduce the obtained big datasets. Reducing can be done, inter alia, by reducing the size of the set or by generating the set. This paper presents two variants of the reduction of point clouds in order...
-
Study of Multi-Class Classification Algorithms’ Performance on Highly Imbalanced Network Intrusion Datasets
PublikacjaThis paper is devoted to the problem of class imbalance in machine learning, focusing on the intrusion detection of rare classes in computer networks. The problem of class imbalance occurs when one class heavily outnumbers examples from the other classes. In this paper, we are particularly interested in classifiers, as pattern recognition and anomaly detection could be solved as a classification problem. As still a major part of...
-
Deep learning-based waste detection in natural and urban environments
PublikacjaWaste pollution is one of the most significant environmental issues in the modern world. The importance of recycling is well known, both for economic and ecological reasons, and the industry demands high efficiency. Current studies towards automatic waste detection are hardly comparable due to the lack of benchmarks and widely accepted standards regarding the used metrics and data. Those problems are addressed in this article by...
-
Bi-GRU-APSO: Bi-Directional Gated Recurrent Unit with Adaptive Particle Swarm Optimization Algorithm for Sales Forecasting in Multi-Channel Retail
PublikacjaIn the present scenario, retail sales forecasting has a great significance in E-commerce companies. The precise retail sales forecasting enhances the business decision making, storage management, and product sales. Inaccurate retail sales forecasting can decrease customer satisfaction, inventory shortages, product backlog, and unsatisfied customer demands. In order to obtain a better retail sales forecasting, deep learning models...
-
Photos and rendered images of LEGO bricks
PublikacjaThe paper describes a collection of datasets containing both LEGO brick renders and real photos. The datasets contain around 155,000 photos and nearly 1,500,000 renders. The renders aim to simulate real-life photos of LEGO bricks allowing faster creation of extensive datasets. The datasets are publicly available via the Gdansk University of Technology “Most Wiedzy” institutional repository. The source files of all tools used during...
-
Simulations of the Derecho Event in Poland of 11th August 2017 Using WRF Model
PublikacjaThis series contains datasets related to the forecasting of a severe weather event, a derecho, in Poland on 11 August 2017. The simulations were conducted using the Weather Research and Forecasting (WRF) model version 4.2.1 with different initial and boundary conditions of the pressure and model levels derived from 5 global models: Global Forecast System (GFS), Global Data Assimilation System (GDAS), European Centre for Medium-Range...
-
Application of Web-GIS and Cloud Computing to Automatic Satellite Image Correction
PublikacjaRadiometric calibration of satellite imagery requires coupling of atmospheric and topographic parameters, which constitutes serious computational problems in particular in complex geographical terrain. Successful application of topographic normalization algorithms for calibration purposes requires integration of several types of high-resolution geographic datasets and their processing in a common context. This paper presents the...
-
Standard deviation as the optimization criterion in the OptD method and its influence on the generated DTM
PublikacjaReduction of the measurement dataset is one of the current issues related to constantly developing technologies that provide large datasets, eg. laser scanning. It could seems that presence and evolution of processors computer, increase of hard drive capacity etc. is the solution for development of such large datasets. And in fact it is, however, the “lighter” datasets are easier to work with. Additionally, reduced datasets can...
-
Machine Learning and Deep Learning Methods for Fast and Accurate Assessment of Transthoracic Echocardiogram Image Quality
PublikacjaHigh-quality echocardiogram images are the cornerstone of accurate and reliable measurements of the heart. Therefore, this study aimed to develop, validate and compare machine learning and deep learning algorithms for accurate and automated assessment of transthoracic echocardiogram image quality. In total, 4090 single-frame two-dimensional transthoracic echocardiogram...
-
Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits
PublikacjaThe Contextual Multi-Armed Bandits (CMAB) framework is pivotal for learning to make decisions. However, due to challenges in deploying online algorithms, there is a shift towards offline policy learning, which relies on pre-existing datasets. This study examines the relationship between the quality of these datasets and the performance of offline policy learning algorithms, specifically, Neural Greedy and NeuraLCB. Our results...
-
Ontological Model for Contextual Data Defining Time Series for Emotion Recognition and Analysis
PublikacjaOne of the major challenges facing the field of Affective Computing is the reusability of datasets. Existing affective-related datasets are not consistent with each other, they store a variety of information in different forms, different formats, and the terms used to describe them are not unified. This paper proposes a new ontology, ROAD, as a solution to this problem, by formally describing the datasets and unifying the terms...
-
A new multi-process collaborative architecture for time series classification
PublikacjaTime series classification (TSC) is the problem of categorizing time series data by using machine learning techniques. Its applications vary from cybersecurity and health care to remote sensing and human activity recognition. In this paper, we propose a novel multi-process collaborative architecture for TSC. The propositioned method amalgamates multi-head convolutional neural networks and capsule mechanism. In addition to the discovery...
-
X-ray Photoelectron Spectroscopy of Carboxylic Acids as Corrosion Inhibitors of Aluminium Alloys
PublikacjaThe datasets, titled X-ray Photoelectron Spectroscopy studies of citric acid adsorption on aluminium alloy 5754 in alkaline media and X-ray Photoelectron Spectroscopy studies of various carboxylic acids adsorption on aluminium alloys in alkaline media, contain XPS studies of the corrosion inhibitory action of selected dicarboxylic acids towards commercially available aluminium alloy 5754 in alkaline media at pH=11. These datasets...
-
Sampling-based novel heterogeneous multi-layer stacking ensemble method for telecom customer churn prediction
PublikacjaIn recent times, customer churn has become one of the most significant issues in business-oriented sectors with telecommunication being no exception. Maintaining current customers is particularly valuable due to the high degree of rivalry among telecommunication companies and the costs of acquiring new ones. The early prediction of churned customers may help telecommunication companies to identify the causes of churn and design...
-
OOA-modified Bi-LSTM network: An effective intrusion detection framework for IoT systems
PublikacjaCurrently, the Internet of Things (IoT) generates a huge amount of traffic data in communication and information technology. The diversification and integration of IoT applications and terminals make IoT vulnerable to intrusion attacks. Therefore, it is necessary to develop an efficient Intrusion Detection System (IDS) that guarantees the reliability, integrity, and security of IoT systems. The detection of intrusion is considered...
-
Rating mathematical models for first-pass of tracer in pCT lung studies
PublikacjaThis paper presents a comparison of model based on the Gauss function and the most commonly used Gamma-variate model in perfusion computed tomography (pCT) lung studies. It also verifies whether used model affects value of blood volume parameter. Three mean concentration-time curves were created from actual pCT measurements: arterial input function, blood vessels in lungs and lung parenchyma. On the basis of these mean curves we...
-
A review of explainable fashion compatibility modeling methods
PublikacjaThe paper reviews methods used in the fashion compatibility recommendation domain. We select methods based on reproducibility, explainability, and novelty aspects and then organize them chronologically and thematically. We presented general characteristics of publicly available datasets that are related to the fashion compatibility recommendation task. Finally, we analyzed the representation bias of datasets, fashion-based algorithms’...
-
Parallel Computations of Text Similarities for Categorization Task
PublikacjaIn this chapter we describe the approach to parallel implementation of similarities in high dimensional spaces. The similarities computation have been used for textual data categorization. A test datasets we create from Wikipedia articles that with their hyper references formed a graph used in our experiments. The similarities based on Euclidean distance and Cosine measure have been used to process the data using k-means algorithm....
-
Using Isolation Forest and Alternative Data Products to Overcome Ground Truth Data Scarcity for Improved Deep Learning-based Agricultural Land Use Classification Models
PublikacjaHigh-quality labelled datasets represent a cornerstone in the development of deep learning models for land use classification. The high cost of data collection, the inherent errors introduced during data mapping efforts, the lack of local knowledge, and the spatial variability of the data hinder the development of accurate and spatially-transferable deep learning models in the context of agriculture. In this paper, we investigate...
-
Satellite Image Classification Using a Hierarchical Ensemble Learning and Correlation Coefficient-Based Gravitational Search Algorithm
PublikacjaSatellite image classification is widely used in various real-time applications, such as the military, geospatial surveys, surveillance and environmental monitoring. Therefore, the effective classification of satellite images is required to improve classification accuracy. In this paper, the combination of Hierarchical Framework and Ensemble Learning (HFEL) and optimal feature selection is proposed for the precise identification...
-
Application of the Optimum Dataset Method in Archeological Studies on Barrows
PublikacjaLight Detection and Ranging (LiDAR) became one of the technologies used in archaeological research. It allows for relatively easy detection of archaeological sites that have their own field form, e.g.: barrows, fortresses, tracts, ancient fields [1]. As a result of the scanning, the so-called point cloud is obtained, often consisting of millions of points. Such large measurement datasets are very time-consuming and labor-intensive...
-
Reduction of measurement data before Digital Terrain Model generation vs. DTM generalisation
PublikacjaModern data acquisition technologies provide large datasets that are not always necessary in its entirety to properly accomplish the goal of the study. In addition, such datasets are often cumbersome for rational processing, and their processing is time and labour consuming. Therefore, methods that enable to reduce the size of the measurement dataset, such as the generalization of the Digital Terrain Model (DTM) or the reduction...
-
Towards Effective Processing of Large Text Collections
PublikacjaIn the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...
-
Smart Approach for Glioma Segmentation in Magnetic Resonance Imaging using Modified Convolutional Network Architecture (U-NET)
PublikacjaSegmentation of a brain tumor from magnetic resonance multimodal images is a challenging task in the field of medical imaging. The vast diversity in potential target regions, appearance and multifarious intensity threshold levels of various tumor types are few of the major factors that affect segmentation results. An accurate diagnosis and its treatment demand strict delineation of the tumor affected tissues. Herein, we focus on...
-
From Scores to Predictions in Multi-Label Classification: Neural Thresholding Strategies
PublikacjaIn this paper, we propose a novel approach for obtaining predictions from per-class scores to improve the accuracy of multi-label classification systems. In a multi-label classification task, the expected output is a set of predicted labels per each testing sample. Typically, these predictions are calculated by implicit or explicit thresholding of per-class real-valued scores: classes with scores exceeding a given threshold value...
-
Machine-learning methods for estimating compressive strength of high-performance alkali-activated concrete
PublikacjaHigh-performance alkali-activated concrete (HP-AAC) is acknowledged as a cementless and environmentally friendly material. It has recently received a substantial amount of interest not only due to the potential it has for being used instead of ordinary concrete but also owing to the concerns associated with climate change, sustainability, reduction of CO2 emissions, and energy consumption. The characteristics and amounts of the...
-
Empirical Analysis of Forest Penalizing Attribute and Its Enhanced Variations for Android Malware Detection
PublikacjaAs a result of the rapid advancement of mobile and internet technology, a plethora of new mobile security risks has recently emerged. Many techniques have been developed to address the risks associated with Android malware. The most extensively used method for identifying Android malware is signature-based detection. The drawback of this method, however, is that it is unable to detect unknown malware. As a consequence of this problem,...