Filtry
wszystkich: 378
Wyniki wyszukiwania dla: DATASET CONSTRUCTION
-
Thermal imaging in automatic rodent’s social behaviour analysis
PublikacjaLaboratory rodent social behaviour analysis is an extremely important task for biological, medical and pharmacological researches. In this work thermal images features that facilitate analysis are presented. Methods to distinguish objects on the basis of thermal distribution are tested. Actions of grooming or biting one rodent by another - important social behaviour incidents - are clearly visible...
-
Deep CNN based decision support system for detection and assessing the stage of diabetic retinopathy
PublikacjaThe diabetic retinopathy is a disease caused by long-standing diabetes. Lack of effective treatment can lead to vision impairment and even irreversible blindness. The disease can be diagnosed by examining digital color fundus photographs of retina. In this paper we propose deep learning approach to automated diabetic retinopathy screening. Deep convolutional neural networks (CNN) - the most popular kind of deep learning algorithms...
-
News that Moves the Market: DSEX-News Dataset for Forecasting DSE Using BERT
PublikacjaStock market is a complex and dynamic industry that has always presented challenges for stakeholders and investors due to its unpredictable nature. This unpredictability motivates the need for more accurate prediction models. Traditional prediction models have limitations in handling the dynamic nature of the stock market. Additionally, previous methods have used less relevant data, leading to suboptimal performance. This study...
-
From Data to Decision: Interpretable Machine Learning for Predicting Flood Susceptibility in Gdańsk, Poland
PublikacjaFlood susceptibility prediction is complex due to the multifaceted interactions among hydrological, meteorological, and urbanisation factors, further exacerbated by climate change. This study addresses these complexities by investigating flood susceptibility in rapidly urbanising regions prone to extreme weather events, focusing on Gdańsk, Poland. Three popular ML techniques, Support Vector Machine (SVM), Random Forest (RF), and...
-
Testing the Effect of Bathymetric Data Reduction on the Shape of the Digital Bottom Model
PublikacjaDepth data and the digital bottom model created from it are very important in the inland and coastal water zones studies and research. The paper undertakes the subject of bathymetric data processing using reduction methods and examines the impact of data reduction according to the resulting representations of the bottom surface in the form of numerical bottom models. Data reduction is an approach that is meant to reduce the size...
-
Assessing the attractiveness of human face based on machine learning
PublikacjaThe attractiveness of the face plays an important role in everyday life, especially in the modern world where social media and the Internet surround us. In this study, an attempt to assess the attractiveness of a face by machine learning is shown. Attractiveness is determined by three deep models whose sum of predictions is the final score. Two annotated datasets available in the literature are employed for training and testing...
-
Detection of circulating tumor cells by means of machine learning using Smart-Seq2 sequencing
PublikacjaCirculating tumor cells (CTCs) are tumor cells that separate from the solid tumor and enter the bloodstream, which can cause metastasis. Detection and enumeration of CTCs show promising potential as a predictor for prognosis in cancer patients. Furthermore, single-cells sequencing is a technique that provides genetic information from individual cells and allows to classify them precisely and reliably. Sequencing data typically...
-
Convolutional Neural Networks for C. Elegans Muscle Age Classification Using Only Self-Learned Features
PublikacjaNematodes Caenorhabditis elegans (C. elegans) have been used as model organisms in a wide variety of biological studies, especially those intended to obtain a better understanding of aging and age-associated diseases. This paper focuses on automating the analysis of C. elegans imagery to classify the muscle age of nematodes based on the known and well established IICBU dataset. Unlike many modern classification methods, the proposed...
-
Pedestrian detection in low-resolution thermal images
PublikacjaOver one million people die in car accidents worldwide each year. A solution that will be able to reduce situations in which pedestrian safety is at risk has been sought for a long time. One of the techniques for detecting pedestrians on the road is the use of artificial intelligence in connection with thermal imaging. The purpose of this work was to design a system to assist the safety of people and car intelligence with the use...
-
KEMR-Net: A Knowledge-Enhanced Mask Refinement Network for Chromosome Instance Segmentation
PublikacjaThis article proposes a mask refinement method for chromosome instance segmentation. The proposed method exploits the knowledge representation capability of Neural Knowledge DNA (NK-DNA) to capture the semantics of the chromosome’s shape, texture, and key points, and then it uses the captured knowledge to improve the accuracy and smoothness of the masks. We validate the method’s effectiveness on our latest high-resolution chromosome...
-
Comparison of image pre-processing methods in liver segmentation task
PublikacjaAutomatic liver segmentation of Computed Tomography (CT) images is becoming increasingly important. Although there are many publications in this field there is little explanation why certain pre-processing methods were utilised. This paper presents a comparison of the commonly used approach of Hounsfield Units (HU) windowing, histogram equalisation, and a combination of these methods to try to ascertain what are the differences...
-
Cyanobacterial and Algal Strains in the Culture Collection of Baltic Algae (CCBA)
PublikacjaThe dataset titled Microalgal strains from “Culture Collection of Baltic Algae (CCBA)” is a representation of cyanobacterial and microalgal cultures isolated from the Baltic Sea. It is a unique catalogue of strains of the dominant and rare species found in the Baltic phytoplankton and microphytobenthos assemblages. The main purpose of the collection is to extend the knowledge on the Baltic microbial communities by providing...
-
Macrophytobenthos in the Puck Bay in 2010–2018 Dataset
PublikacjaThe dataset titled Biomass of macrophytobenthos in the Puck Bay in 2010-2018 con-tains data on the qualitative composition and biomass of macrophytobenthos (flow-er plants and macroalgae) in samples collected in the Puck Bay area (Gulf of Gdańsk, southern Baltic Sea) at 20 stations between 2010–2018. The data was supplemented with additional information: values of measured parameters of water and sediment, e.g. tem-perature...
-
Simplified AutoDock force field for hydrated binding sites
Publikacjahas been extracted from the Protein Data Bank and used to test and recalibrate AutoDock force field. Since for some binding sites water molecules are crucial for bridging the receptor-ligand interactions, they have to be included in the analysis. To simplify the process of incorporating water molecules into the binding sites and make it less ambiguous, new simple water model was created. After recalibration of the force field on...
-
Educational Dataset of Handheld Doppler Blood Flow Recordings
PublikacjaVital signals registration plays a significant role in biomedical engineering and education process. Well acquired data allow future engineers to observe certain physical phenomena as well learn how to correctly process and interpret the data. This dataset was designed for students to learn about Doppler phenomena and to demonstrate correctly and incorrectly acquired signals as well as the basic methods of signal processing. This...
-
Spatial development concept for the Outer Port in Gdańsk - work number 1/22/23
Dane BadawczeThe research series presents in a form of a design chart possible variants of the spatial layout of the water and land areas of the Outer Port of Gdańsk using different combinations of terminals with different annual turnover volumes. Individual works in the series present different solutions for anchorages, channels, port entrances, turntables and...
-
AFM and SSRM investiagtion of carbon nanowalls properties
Dane BadawczeStructures with limited dimensionality are of great interest in modern nanotechnology. The properties of these objects are used, among others, for the construction of modern displays or as a base for quantum computers. Carbon nanowalls, which are the subject of the imaging results contained in this collection, are also considered interesting building...
-
Global Value Chains and Wages: Multi-Country Evidence from Linked Worker-Industry Data
PublikacjaThis paper uses a multi-country microeconomic setting to contribute to the literature on the nexus between production fragmentation and wages. Exploiting a rich dataset on over 110,000 workers from nine Eastern and Western European countries and the United States, we study the relationship between individual workers’ wages and industry ties into global value chains (GVCs). We find an inverse (but weak) relationship between the...
-
Analysing By-Products Interaction as an Industry Resource of Circular Economy in Ukraine and the World
PublikacjaThe paper analyses existing and current scientific developments and literature sources, which show the advantages and disadvantages of many different influences of waste in Ukraine and other countries of Europe and the world. As a research result, stable connections have been established between the factors and criteria in assessing the by-product interaction as an industry resource. In our research, we used programs R.Studio and...
-
Legislation and Practice of Selected State Aid Issues, According to EU and Polish Law
PublikacjaThe dataset encompasses several tables, each consisting of three elements: legislation, jurisprudence and scientific articles on numerous subjects and economic activities receiving public financial support in the form of state aid instruments. The set includes a subjective list of the most commonly used and/or disputable examples of granting aid, such as for (local) airports and airlines, steel production, shipyards, and coalmines....
-
Long-term Hindcast Simulation of Currents, Sea Level, Water Temperature and Salinity in the Baltic Sea
PublikacjaThis dataset contains the results of numerical modelling of currents, sea level, water temperature and salinity over a period of 50 years (1958–2007) in the Baltic Sea. A long-term hindcast simulation was performed using a three-dimensional hydrodynamic model (PM3D) based on the Princeton Ocean Model (POM). The spatial resolution was 3 nautical miles, i.e. about 5.5 km. Currents, water temperature, and salinity were recorded...
-
Operational Enhancement of Numerical Weather Prediction with Data from Real-time Satellite Images
PublikacjaNumerical weather prediction (NWP) is a rapidly expanding field of science, which is related to meteorology, remote sensing and computer science. Authors present methods of enhancing WRF EMS (Weather Research and Forecast Environmental Modeling System) weather prediction system using data from satellites equipped with AMSU sensor (Advanced Microwave Sounding Unit). The data is acquired with Department of Geoinformatics’ ground...
-
Towards semantic-rich word embeddings
PublikacjaIn recent years, word embeddings have been shown to improve the performance in NLP tasks such as syntactic parsing or sentiment analysis. While useful, they are problematic in representing ambiguous words with multiple meanings, since they keep a single representation for each word in the vocabulary. Constructing separate embeddings for meanings of ambiguous words could be useful for solving the Word Sense Disambiguation (WSD)...
-
Toward Intelligent Recommendations Using the Neural Knowledge DNA
PublikacjaIn this paper we propose a novel recommendation approach using past news click data and the Neural Knowledge DNA (NK-DNA). The Neural Knowledge DNA is a novel knowledge representation method designed to support discovering, storing, reusing, improving, and sharing knowledge among machines and computing systems. We examine our approach for news recommendation tasks on the MIND benchmark dataset. By taking advantages of NK-DNA, deep...
-
Simultaneous grouping and ranking with combination of SOM and TOPSIS for selection of preferable analytical procedure for furan determination in food
PublikacjaNovel methodology for grouping and ranking with application of self-organizing maps and multicriteria decision analysis is presented. The dataset consists of 22 objects that are analytical procedures applied to furan determination in food samples. They are described by 10 variables, referred to their analytical performance, environmental and economic aspects. Multivariate statistics analysis allows to limit the amount of input...
-
Exploring the Usability and User Experience of Social Media Apps through a Text Mining Approach
PublikacjaThis study aims to evaluate the applicability of a text mining approach for extracting UUX-related issues from a dataset of user comments and not to evaluate the Instagram (IG) app. This study analyses textual data mined from reviews in English written by IG mobile application users. The article’s authors used text mining (based on the LDA algorithm) to identify the main UUX-related topics. Next, they mapped the identified topics...
-
Ontological Modeling for Contextual Data Describing Signals Obtained from Electrodermal Activity for Emotion Recognition and Analysis
PublikacjaMost of the research in the field of emotion recognition is based on datasets that contain data obtained during affective computing experiments. However, each dataset is described by different metadata, stored in various structures and formats. This research can be counted among those whose aim is to provide a structural and semantic pattern for affective computing datasets, which is an important step to solve the problem of data...
-
Bi-GRU-APSO: Bi-Directional Gated Recurrent Unit with Adaptive Particle Swarm Optimization Algorithm for Sales Forecasting in Multi-Channel Retail
PublikacjaIn the present scenario, retail sales forecasting has a great significance in E-commerce companies. The precise retail sales forecasting enhances the business decision making, storage management, and product sales. Inaccurate retail sales forecasting can decrease customer satisfaction, inventory shortages, product backlog, and unsatisfied customer demands. In order to obtain a better retail sales forecasting, deep learning models...
-
Balanced Spider Monkey Optimization with Bi-LSTM for Sustainable Air Quality Prediction
PublikacjaA reliable air quality prediction model is required for pollution control, human health monitoring, and sustainability. The existing air quality prediction models lack efficiency due to overfitting in prediction model and local optima trap in feature selection. This study proposes the Balanced Spider Monkey Optimization (BSMO) technique for effective feature selection to overcome the local optima trap and overfitting problems....
-
Automatic music genre classification based on musical instrument track separation / Automatyczna klasyfikacja gatunku muzycznego wykorzystująca algorytm separacji dźwięku instrumentó muzycznych
PublikacjaThe aim of this article is to investigate whether separating music tracks at the pre-processing phase and extending feature vector by parameters related to the specific musical instruments that are characteristic for the given musical genre allow for efficient automatic musical genre classification in case of database containing thousands of music excerpts and a dozen of genres. Results of extensive experiments show that the approach...
-
Data from the Survey on Gdańsk University of Technology Graduates’ Professional Careers
PublikacjaThe dataset titled Data from the survey on Gdańsk University of Technology graduates’ professional careers includes data from a survey of Gdańsk University of Technology (Gdańsk Tech) graduates’ professional careers. The survey was conducted in 2017, two years after the respondents obtained graduate status. The research sample included 2553 respondents. The study concerned, i.a. the percentage of people working among graduates...
-
Bus bays inventory using a terrestrial laser scanning system
PublikacjaThis article presents the use of laser scanning technology for the assessment of bus bay geo-location. Ground laser scanning is an effective tool for collecting three-dimensional data. Moreover, the analysis of a point cloud dataset can be a source of a lot of information. The authors have outlined an innovative use of data collection and analysis using the TLS regarding information on the flatness of bus bays. The results were...
-
Spatial development concept for the Outer Port in Gdańsk - work number 1/23/24
Dane BadawczeThe research series presents in a form of a design chart possible variants of the spatial layout of the water and land areas of the Outer Port of Gdańsk using various combinations of terminals with different annual turnover volumes. Individual works in the series present different solutions for anchorages, channels, port entrances, turntables and basins,...
-
Spatial development concept for the Outer Port in Gdańsk - work number 3/23/24
Dane BadawczeThe research series presents in the form of a design chart possible variants of the spatial layout of the water and land areas of the Outer Port of Gdańsk using different combinations of terminals with various annual turnover volumes. Individual works in the series present different solutions for anchorages, channels, port entrances, turntables and...
-
Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits
PublikacjaThe Contextual Multi-Armed Bandits (CMAB) framework is pivotal for learning to make decisions. However, due to challenges in deploying online algorithms, there is a shift towards offline policy learning, which relies on pre-existing datasets. This study examines the relationship between the quality of these datasets and the performance of offline policy learning algorithms, specifically, Neural Greedy and NeuraLCB. Our results...
-
Global Value Chains and Wages: International Evidence from Linked Worker-Industry Data
PublikacjaUsing a rich dataset on over 110,000 workers from nine European countries and the USA we study the wage response to industry dependence on foreign value added. We estimate a Mincerian wage model augmented with an input-output interindustry linkages measure accounting for task heterogeneity across workers. Low and mediumeducated workers and those performing routine tasks experience (little) wage decline due to major dependency of...
-
Neural Network Subgraphs Correlation with Trained Model Accuracy
PublikacjaNeural Architecture Search (NAS) is a computationally demanding process of finding optimal neural network architecture for a given task. Conceptually, NAS comprises applying a search strategy on a predefined search space accompanied by a performance evaluation method. The design of search space alone is expected to substantially impact NAS efficiency. We consider neural networks as graphs and find a correlation between the presence...
-
TOWARDS EXPLAINABLE CLASSIFIERS USING THE COUNTERFACTUAL APPROACH - GLOBAL EXPLANATIONS FOR DISCOVERING BIAS IN DATA
PublikacjaThe paper proposes summarized attribution-based post-hoc explanations for the detection and identification of bias in data. A global explanation is proposed, and a step-by-step framework on how to detect and test bias is introduced. Since removing unwanted bias is often a complicated and tremendous task, it is automatically inserted, instead. Then, the bias is evaluated with the proposed counterfactual approach. The obtained results...
-
Data from the Survey on Entrepreneurs’ Opinions on Factors Determining the Employment of the Gdańsk University of Technology Graduates
PublikacjaThe dataset includes data from a survey on factors determining the employment of the Gdańsk University of Technology (Gdańsk Tech) graduates’ in the opinion of entrepreneurs. The survey was conducted in 2017. The research sample included 102 respondents representing various firms from the Pomeranian Voivodeship, Poland. The study concerned i.a. factors determining the decision to hire a candidate, methods of recruiting employees,...
-
A Data-Driven Comparative Analysis of Machine-Learning Models for Familial Hypercholesterolemia Detection
PublikacjaThis study presents an assessment of familial hypercholesterolemia (FH) probability using different algorithms (CatBoost, XGBoost, Random Forest, SVM) and its ensembles, leveraging electronic health record data. The primary objective is to explore an enhanced method for estimating FH probability, surpassing the currently recommended Dutch Lipid Clinic Network (DLCN) Score. The models were trained using the largest Polish cohort...
-
Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires
PublikacjaBackground Endometriosis is a condition that significantly affects the quality of life of about 10 % of reproductive-aged women. It is characterized by the presence of tissue similar to the uterine lining (endometrium) outside the uterus, which can lead lead scarring, adhesions, pain, and fertility issues. While numerous factors associated with endometriosis are documented, a wide range of symptoms may still be undiscovered. Methods In...
-
Cascade Object Detection and Remote Sensing Object Detection Method Based on Trainable Activation Function
PublikacjaObject detection is an important process in surveillance system to locate objects and it is considered as major application in computer vision. The Convolution Neural Network (CNN) based models have been developed by many researchers for object detection to achieve higher performance. However, existing models have some limitations such as overfitting problem and lower efficiency in small object detection. Object detection in remote...
-
Ensembling noisy segmentation masks of blurred sperm images
PublikacjaBackground: Sperm tail morphology and motility have been demonstrated to be important factors in determining sperm quality for in vitro fertilization. However, many existing computer-aided sperm analysis systems leave the sperm tail out of the analysis, as detecting a few tail pixels is challenging. Moreover, some publicly available datasets for classifying morphological defects contain images limited only to the sperm head. This...
-
Spatial development concept for the Outer Port in Gdańsk - work number 4/23/24
Dane BadawczeThe research series presents in a form of a design chart possible variants of the spatial layout of the water and land areas of the Outer Port of Gdańsk using different combinations of terminals with different annual turnover volumes. Individual works in the series present different solutions for anchorages, channels, port entrances, turntables and...
-
Spatial development concept for the Outer Port in Gdańsk - work number 2/23/24
Dane BadawczeThe research series presents in a form of a design chart possible variants of the spatial layout of the water and land areas of the Outer Port of Gdańsk using different combinations of terminals with different annual turnover volumes. Individual works in the series present different solutions for anchorages, channels, port entrances, turntables and...
-
Mobilenet-V2 Enhanced Parkinson's Disease Prediction with Hybrid Data Integration
PublikacjaThis study investigates the role of deep learning models, particularly MobileNet-v2, in Parkinson's Disease (PD) detection through handwriting spiral analysis. Handwriting difficulties often signal early signs of PD, necessitating early detection tools due to potential impacts on patients' work capacities. The study utilizes a three-fold approach, including data augmentation, algorithm development for simulated PD image datasets,...
-
Tweet you right back: Follower anxiety predicts leader anxiety in social media interactions during the SARS-CoV-2 pandemic
PublikacjaRecent research has shown that organizational leaders’ tweets can influence employee anxiety. In this study, we turn the table and examine whether the same can be said about followers’ tweets. Based on emotional contagion and a dataset of 108 leaders and 178 followers across 50 organizations, we infer and track state- and trait-anxiety scores of participants over 316 days, including pre- and post the onset of the SARS-CoV-2 pandemic...
-
Tribological Properties of Thermoplastic Materials Formed by 3D Printing by FDM Process
PublikacjaThe dataset entitled 3D printed ABS thermoplastic vs. steel. Dry sliding wear test in constant load & velocity ring on flat configuration. Test parameters: print layer thickness and orientation. Test symbol: 019_h_4 contains: the time base (expressed in seconds and minutes), the friction torque for sliding friction, rotational velocity of the counter – specimen (velocity of sliding), friction coefficient, load in the friction contact...
-
Acquisition and indexing of RGB-D recordings for facial expressions and emotion recognition
PublikacjaIn this paper KinectRecorder comprehensive tool is described which provides for convenient and fast acquisition, indexing and storing of RGB-D video streams from Microsoft Kinect sensor. The application is especially useful as a supporting tool for creation of fully indexed databases of facial expressions and emotions that can be further used for learning and testing of emotion recognition algorithms for affect-aware applications....
-
Exploring music listening patterns: an online survey
PublikacjaAn online survey was carried out to explore how respondents listen to music recordings. It was anticipated that the listener’s preferences would be influenced by various factors, such as age, music genre, the contexts in which they listen, and their favored methods of music consumption. Consequently, the data were collected to analyze these relationships. The survey, structured as a web application, encompassed 23 questions,...