wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: RDF DATASET PROFILING
-
Rapid Characterization of the Human Breast Milk Lipidome Using a Solid-Phase Microextraction and Liquid Chromatography-Mass Spectrometry-Based Approach.
PublikacjaHuman breast milk (HBM) is a biofluid consisting of various biomolecules such as proteins, lipids, carbohydrates, minerals and bioactive substances. Due to its unique and complex composition, HBM provides not only nutritional components required for the growth of the infant, but also additional protection against infections. Global insight into the composition of HBM is crucial to understanding the health benefits infants receive...
-
Chiral analysis of chloro intermediates of methylamphetamine by one-dimensional and multidementional NMR and GC/MS
PublikacjaImpurity profiling and classification of abused drugs using chiral analytical techniques is of particular interest and importance because of the additional information obtained fromthis approach. When these methods are applied to the synthesis of illicitly used substances, they can supply valuable information about the conditions/chemicals used in the synthesis. We have applied GC and NMR methods to the study of intermediates found...
-
In Silico Safety Assessment of Bacillus Isolated from Polish Bee Pollen and Bee Bread as Novel Probiotic Candidates
PublikacjaBacillus species isolated from Polish bee pollen (BP) and bee bread (BB) were characterized for in silico probiotic and safety attributes. A probiogenomics approach was used, and in-depth genomic analysis was performed using a wide array of bioinformatics tools to investigate the presence of virulence and antibiotic resistance properties, mobile genetic elements, and secondary metabolites. Functional annotation and Carbohydrate-Active...
-
Epigenetic Basis of Regeneration: Analysis of Genomic DNA Methylation Profiles in the MRL/MpJ Mouse
PublikacjaEpigenetic regulation plays essential role in cell differentiation and dedifferentiation, which are the intrinsic processes involved in regeneration. To investigate the epigenetic basis of regeneration capacity, we choose DNA methylation as one of the most important epigenetic mechanisms and the MRL/MpJ mouse as a model of mammalian regeneration known to exhibit enhanced regeneration response in different organs. We report the...
-
Global Value Chains and Wages: International Evidence from Linked Worker-Industry Data
PublikacjaUsing a rich dataset on over 110,000 workers from nine European countries and the USA we study the wage response to industry dependence on foreign value added. We estimate a Mincerian wage model augmented with an input-output interindustry linkages measure accounting for task heterogeneity across workers. Low and mediumeducated workers and those performing routine tasks experience (little) wage decline due to major dependency of...
-
TOWARDS EXPLAINABLE CLASSIFIERS USING THE COUNTERFACTUAL APPROACH - GLOBAL EXPLANATIONS FOR DISCOVERING BIAS IN DATA
PublikacjaThe paper proposes summarized attribution-based post-hoc explanations for the detection and identification of bias in data. A global explanation is proposed, and a step-by-step framework on how to detect and test bias is introduced. Since removing unwanted bias is often a complicated and tremendous task, it is automatically inserted, instead. Then, the bias is evaluated with the proposed counterfactual approach. The obtained results...
-
Neural Network Subgraphs Correlation with Trained Model Accuracy
PublikacjaNeural Architecture Search (NAS) is a computationally demanding process of finding optimal neural network architecture for a given task. Conceptually, NAS comprises applying a search strategy on a predefined search space accompanied by a performance evaluation method. The design of search space alone is expected to substantially impact NAS efficiency. We consider neural networks as graphs and find a correlation between the presence...
-
Data from the Survey on Entrepreneurs’ Opinions on Factors Determining the Employment of the Gdańsk University of Technology Graduates
PublikacjaThe dataset includes data from a survey on factors determining the employment of the Gdańsk University of Technology (Gdańsk Tech) graduates’ in the opinion of entrepreneurs. The survey was conducted in 2017. The research sample included 102 respondents representing various firms from the Pomeranian Voivodeship, Poland. The study concerned i.a. factors determining the decision to hire a candidate, methods of recruiting employees,...
-
Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires
PublikacjaBackground Endometriosis is a condition that significantly affects the quality of life of about 10 % of reproductive-aged women. It is characterized by the presence of tissue similar to the uterine lining (endometrium) outside the uterus, which can lead lead scarring, adhesions, pain, and fertility issues. While numerous factors associated with endometriosis are documented, a wide range of symptoms may still be undiscovered. Methods In...
-
Cascade Object Detection and Remote Sensing Object Detection Method Based on Trainable Activation Function
PublikacjaObject detection is an important process in surveillance system to locate objects and it is considered as major application in computer vision. The Convolution Neural Network (CNN) based models have been developed by many researchers for object detection to achieve higher performance. However, existing models have some limitations such as overfitting problem and lower efficiency in small object detection. Object detection in remote...
-
Ensembling noisy segmentation masks of blurred sperm images
PublikacjaBackground: Sperm tail morphology and motility have been demonstrated to be important factors in determining sperm quality for in vitro fertilization. However, many existing computer-aided sperm analysis systems leave the sperm tail out of the analysis, as detecting a few tail pixels is challenging. Moreover, some publicly available datasets for classifying morphological defects contain images limited only to the sperm head. This...
-
Tweet you right back: Follower anxiety predicts leader anxiety in social media interactions during the SARS-CoV-2 pandemic
PublikacjaRecent research has shown that organizational leaders’ tweets can influence employee anxiety. In this study, we turn the table and examine whether the same can be said about followers’ tweets. Based on emotional contagion and a dataset of 108 leaders and 178 followers across 50 organizations, we infer and track state- and trait-anxiety scores of participants over 316 days, including pre- and post the onset of the SARS-CoV-2 pandemic...
-
Tribological Properties of Thermoplastic Materials Formed by 3D Printing by FDM Process
PublikacjaThe dataset entitled 3D printed ABS thermoplastic vs. steel. Dry sliding wear test in constant load & velocity ring on flat configuration. Test parameters: print layer thickness and orientation. Test symbol: 019_h_4 contains: the time base (expressed in seconds and minutes), the friction torque for sliding friction, rotational velocity of the counter – specimen (velocity of sliding), friction coefficient, load in the friction contact...
-
Acquisition and indexing of RGB-D recordings for facial expressions and emotion recognition
PublikacjaIn this paper KinectRecorder comprehensive tool is described which provides for convenient and fast acquisition, indexing and storing of RGB-D video streams from Microsoft Kinect sensor. The application is especially useful as a supporting tool for creation of fully indexed databases of facial expressions and emotions that can be further used for learning and testing of emotion recognition algorithms for affect-aware applications....
-
Capillary gas chromatography using a -cyclodextrin forenantiomeric separation of methylamphetamine, its precursors andchloro intermediates after optimization of the derivatization reaction
PublikacjatThe enantiomeric ratio of methylamphetamine (MAMP) is closely related to the optical activity of precur-sors and reagents used for the synthesis and this knowledge can provide useful information concerningthe origins and synthetic methods used for illicit manufacture. The information can be utilized for reg-ulation of the precursors and investigation of the manufacturing sources but this requires analyticalprocedures to determine...
-
Novel ABTS-dot-blot method for the assessment of antioxidant properties of food packaging
PublikacjaThe new ABTS-dot-blot method for the direct determination of antioxidant activity of active packaging that is in contact with foodstuffs has been developed. The usefulness of the new method was verified with the use of agarose, pork gelatin, bacterial cellulose and cellulose-chitosan films with incorporated standard antioxidant – Trolox or plant phytochemicals derived from three types of berry juices (chokeberry, blue-berried honeysuckle,...
-
Two Stage SVM and kNN Text Documents Classifier
PublikacjaThe paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...
-
Exploring music listening patterns: an online survey
PublikacjaAn online survey was carried out to explore how respondents listen to music recordings. It was anticipated that the listener’s preferences would be influenced by various factors, such as age, music genre, the contexts in which they listen, and their favored methods of music consumption. Consequently, the data were collected to analyze these relationships. The survey, structured as a web application, encompassed 23 questions,...
-
Integrating Statistical and Machine‐Learning Approach for Meta‐Analysis of Bisphenol A‐Exposure Datasets Reveals Effects on Mouse Gene Expression within Pathways of Apoptosis and Cell Survival
PublikacjaBisphenols are important environmental pollutants that are extensively studied due to different detrimental effects, while the molecular mechanisms behind these effects are less well understood. Like other environmental pollutants, bisphenols are being tested in various experimental models, creating large expression datasets found in open access storage. The meta‐analysis of such datasets is, however, very complicated for various...
-
CPLFD-GDPT5: High-resolution gridded daily precipitation and temperature data set for two largest Polish river basins
PublikacjaThe CHASE-PL (Climate change impact assessment for selected sectors in Poland) Forcing Data–Gridded Daily Precipitation & Temperature Dataset–5 km (CPLFD-GDPT5) consists of 1951–2013 daily minimum and maximum air temperatures and precipitation totals interpolated onto a 5 km grid based on daily meteorological observations from the Institute of Meteorology and Water Management (IMGW-PIB; Polish stations), Deutscher Wetterdienst...
-
Real-Time Facial Features Detection from Low Resolution Thermal Images with Deep Classification Models
PublikacjaDeep networks have already shown a spectacular success for object classification and detection for various applications from everyday use cases to advanced medical problems. The main advantage of the classification models over the detection models is less time and effort needed for dataset preparation, because classification networks do not require bounding box annotations, but labels at the image level only. Yet, after passing...
-
How Specific Can We Be with k-NN Classifier?
PublikacjaThis paper discusses the possibility of designing a two stage classifier for large-scale hierarchical and multilabel text classification task, that will be a compromise between two common approaches to this task. First of it is called big-bang, where there is only one classifier that aims to do all the job at once. Top-down approach is the second popular option, in which at each node of categories’ hierarchy, there is a flat classifier...
-
Residual MobileNets
PublikacjaAs modern convolutional neural networks become increasingly deeper, they also become slower and require high computational resources beyond the capabilities of many mobile and embedded platforms. To address this challenge, much of the recent research has focused on reducing the model size and computational complexity. In this paper, we propose a novel residual depth-separable convolution block, which is an improvement of the basic...
-
Expectation-Maximization Model for Substitution of Missing Values Characterizing Greenness of Organic Solvents
PublikacjaOrganic solvents are ubiquitous in chemical laboratories and the Green Chemistry trend forces their detailed assessments in terms of greenness. Unfortunately, some of them are not fully characterized, especially in terms of toxicological endpoints that are time consuming and expensive to be determined. Missing values in the datasets are serious obstacles, as they prevent the full greenness characterization of chemicals. A featured...
-
Occurrence of Cyanobacteria in the Gulf of Gdańsk (2008–2009)
PublikacjaBlooms of cyanobacteria develop each summer in the Baltic Sea. Collecting complete data on this phenomenon is helpful in understanding the changes taking place in the Baltic Sea and forecasting the occurrence of these phenomena in the future. This dataset includes unpublished information about the occurrence of cyanobacteria in the Gulf of Gdańsk (Southern Baltic) in 2008 and 2009. The presented data combines basic physic-ochemical...
-
The Belt and Road Initiative and export variety: 1996–2019
PublikacjaThis study examines the association between the Belt and Road Initiative (BRI) and export variety (EV). We propose three hypotheses on how BRI may foster export markets (destinations) or export product lines. The estimates are based on a dataset constructed specifically for this analysis, covering 183 countries and linked with trade data from 1996 to 2019. We apply the instrumental variable (IV) approach in regressions for covering the...
-
Photos and rendered images of LEGO bricks
PublikacjaThe paper describes a collection of datasets containing both LEGO brick renders and real photos. The datasets contain around 155,000 photos and nearly 1,500,000 renders. The renders aim to simulate real-life photos of LEGO bricks allowing faster creation of extensive datasets. The datasets are publicly available via the Gdansk University of Technology “Most Wiedzy” institutional repository. The source files of all tools used during...
-
Preeclampsia Risk Prediction Using Machine Learning Methods Trained on Synthetic Data
PublikacjaThis paper describes a research study that investigates the use of machine learning algorithms on synthetic data to classify the risk of developing preeclampsia by pregnant women. Synthetic datasets were generated based on parameter distributions from three real patient studies. Four models were compared: XGBoost, Support Vector Machine (SVM), Random Forest, and Explainable Boosting Machines (EBM). The study found that the XGBoost...
-
LSA Is not Dead: Improving Results of Domain-Specific Information Retrieval System Using Stack Overflow Questions Tags
PublikacjaThe paper presents the approach to using tags from Stack Overflow questions as a data source in the process of building domain-specific unsupervised term embeddings. Using a huge dataset of Stack Overflow posts, our solution employs the LSA algorithm to learn latent representations of information technology terms. The paper also presents the Teamy.ai system, currently developed by Scalac company, which serves as a platform that...
-
Instance segmentation of stack composed of unknown objects
PublikacjaThe article reviews neural network architectures designed for the segmentation task. It focuses mainly on instance segmentation of stacked objects. The main assumption is that segmentation is based on a color image with an additional depth layer. The paper also introduces the Stacked Bricks Dataset based on three cameras: RealSense L515, ZED2, and a synthetic one. Selected architectures: DeepLab, Mask RCNN, DEtection TRansformer,...
-
Analysis of results of large-scale multimodal biometric identity verification experiment
PublikacjaAn analysis of a large set of biometric data obtained during the enrolment and the verification phase in an experimental biometric system installed in bank branches is presented. Subjective opinions of bank clients and of bank tellers were also surveyed concerning the studied biometric methods in order to discover and to explore relations emerging from the obtained multimodal dataset. First, data acquisition and identity verification...
-
A Bayesian regularization-backpropagation neural network model for peeling computations
PublikacjaA Bayesian regularization-backpropagation neural network (BRBPNN) model is employed to predict some aspects of the gecko spatula peeling, viz. the variation of the maximum normal and tangential pull-off forces and the resultant force angle at detachment with the peeling angle. K-fold cross validation is used to improve the effectiveness of the model. The input data is taken from finite element (FE) peeling results. The neural network...
-
Hasse diagram as a green analytical metrics tool: ranking of methods for benzo[a]pyrene determination in sediments
PublikacjaThis study presents an application of the Hasse diagram technique (HDT) as the assessment tool to select the most appropriate analytical procedures according to their greenness or the best analytical performance. The dataset consists of analytical procedures for benzo[a]pyrene determination in sediment samples, which were described by 11 variables concerning their greenness and analytical performance. Two analyses with the HDT...
-
Vehicle detector training with minimal supervision
PublikacjaRecently many efficient object detectors based on convolutional neural networks (CNN) have been developed and they achieved impressive performance on many computer vision tasks. However, in order to achieve practical results, CNNs require really large annotated datasets for training. While many such databases are available, many of them can only be used for research purposes. Also some problems exist where such datasets are not...
-
Automatic Threat Detection for Historic Buildings in Dark Places Based on the Modified OptD Method
PublikacjaHistoric buildings, due to their architectural, cultural, and historical value, are the subject of preservation and conservatory works. Such operations are preceded by an inventory of the object. One of the tools that can be applied for such purposes is Light Detection and Ranging (LiDAR). This technology provides information about the position, reflection, and intensity values of individual points; thus, it allows for the creation...
-
Forecasting risks and challenges of digital innovations
PublikacjaForecasting and assessment of societal risks related to digital innovation systems and services is an urgent problem, because these solutions usually contain artificial intelligence algorithms which learn using data from the environment and modify their behaviour much beyond human control. Digital innovation solutions are increasingly deployed in transport, business and administrative domains, and therefore, if abused by a malicious...
-
Development of cluster analysis methodology for identification of model rainfall hyetographs and its application at an urban precipitation field scale
PublikacjaDespite growing access to precipitation time series records at a high temporal scale, in hydrology, and particularly urban hydrology, engineers still design and model drainage systems using scenarios of rainfall temporal distributions predefined by means of model hyetographs. This creates the need for the availability of credible statistical methods for the development and verification of already locally applied model hyetographs....
-
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
PublikacjaAutomatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...
-
Processing of LiDAR and Multibeam Sonar Point Cloud Data for 3D Surface and Object Shape Reconstruction
PublikacjaUnorganised point cloud dataset, as a transitional data model in several applications, usually contains a considerable amount of undesirable irregularities, such as strong variability of local point density, missing data, overlapping points and noise caused by scattering characteristics of the environment. For these reasons, further processing of such data, e.g. for construction of higher order geometric models of the topography...
-
INFLUENCE OF DATA NORMALIZATION ON THE EFFECTIVENESS OF NEURAL NETWORKS APPLIED TO CLASSIFICATION OF PAVEMENT CONDITIONS – CASE STUDY
PublikacjaIn recent years automatic classification employing machine learning seems to be in high demand for tele-informatic-based solutions. An example of such solutions are intelligent transportation systems (ITS), in which various factors are taken into account. The subject of the study presented is the impact of data pre-processing and normalization on the accuracy and training effectiveness of artificial neural networks in the case...
-
CNN Architectures for Human Pose Estimation from a Very Low Resolution Depth Image
PublikacjaThe paper is dedicated to proposing and evaluating a number of convolutional neural network architectures for calculating a multiple regression on 3D coordinates of human body joints tracked in a single low resolution depth image. The main challenge was to obtain a high precision in case of a noisy and coarse scan of the body, as observed by a depth sensor from a large distance. The regression network was expected to reason about...
-
Analysis of the Capability of Deep Learning Algorithms for EEG-based Brain-Computer Interface Implementation
PublikacjaMachine learning models have received significant attention for their exceptional performance in classifying electroencephalography (EEG) data. They have proven to be highly effective in extracting intricate patterns and features from the raw signal data, thereby contributing to their success in EEG classification tasks. In this study, we explore the possibilities of utilizing contemporary machine learning algorithms in decoding...
-
Driver’s Condition Detection System Using Multimodal Imaging and Machine Learning Algorithms
PublikacjaTo this day, driver fatigue remains one of the most significant causes of road accidents. In this paper, a novel way of detecting and monitoring a driver’s physical state has been proposed. The goal of the system was to make use of multimodal imaging from RGB and thermal cameras working simultaneously to monitor the driver’s current condition. A custom dataset was created consisting of thermal and RGB video samples. Acquired data...
-
A novel approach exploiting properties of convolutional neural networks for vessel movement anomaly detection and classification
PublikacjaThe article concerns the automation of vessel movement anomaly detection for maritime and coastal traffic safety services. Deep Learning techniques, specifically Convolutional Neural Networks (CNNs), were used to solve this problem. Three variants of the datasets, containing samples of vessel traffic routes in relation to the prohibited area in the form of a grayscale image, were generated. 1458 convolutional neural networks with...
-
Methodology of Constructing and Analyzing the Hierarchical Contextually-Oriented Corpora
PublikacjaMethodology of Constructing and Analyzing the Hierarchical structure of the Contextually-Oriented Corpora was developed. The methodology contains the following steps: Contextual Component of the Corpora’s Structure Building; Text Analysis of the Contextually-Oriented Hierarchical Corpus. Main contribution of this study is the following: hierarchical structure of the Corpus provides advanced possibilities for identification of the...
-
Visual Content Learning in a Cognitive Vision Platform for Hazard Control (CVP-HC)
PublikacjaThis work is part of an effort for the development of a Cognitive Vision Platform for Hazard Control (CVP-HC) for applications in industrial workplaces, adaptable to a wide range of environments. The paper focuses on hazards resulted from the nonuse of personal protective equipment (PPE). Given the results of previous analysis of supervised techniques for the problem of classification of a few PPE (boots, hard hats, and gloves...
-
Crowdsourcing-Based Evaluation of Automatic References Between WordNet and Wikipedia
PublikacjaThe paper presents an approach to build references (also called mappings) between WordNet and Wikipedia. We propose four algorithms used for automatic construction of the references. Then, based on an aggregation algorithm, we produce an initial set of mappings that has been evaluated in a cooperative way. For that purpose, we implement a system for the distribution of evaluation tasks, that have been solved by the user community....
-
Focus on Misinformation: Improving Medical Experts’ Efficiency of Misinformation Detection
PublikacjaFighting medical disinformation in the era of the global pandemic is an increasingly important problem. As of today, automatic systems for assessing the credibility of medical information do not offer sufficient precision to be used without human supervision, and the involvement of medical expert annotators is required. Thus, our work aims to optimize the utilization of medical experts’ time. We use the dataset of sentences taken...
-
Global value chains and wages under different wage setting mechanisms
PublikacjaThis study examines whether, and how, differences in wage bargaining schemes shape the relationship between global value chains (GVCs) and the wages of workers while considering both GVC participation and position in GVC. Our dataset is derived from the European Structure of Earnings Survey (SES), containing employee–employer data from 18 European countries, merged with sectoral data from the World Input-Output Database (WIOD)....
-
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
PublikacjaIn this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...