Filters
total: 48
Search results for: PREPROCESSING
-
DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING
PublicationThe algorithm and the software for conducting the procedure of Preprocessing of the reviews of films in the Polish language were developed. This algorithm contains the following steps: Text Adaptation Procedure; Procedure of Tokenization; Procedure of Transforming Words into the Byte Format; Part-of-Speech Tagging; Stemming / Lemmatization Procedure; Presentation of Documents in the Vector Form (Vector Space Model) Procedure; Forming...
-
Influence of preprocessing techniques on pulse pressure velocity determination
PublicationPulse Wave Velocity (PWV) is measured and utilized in many clinical applications. Recently, a wide research has been led to develop a cuff-less and continuous blood pressure method basing on PWV. However, in this application a decision on choosing an appropriate fiducial point of pulse wave (PW) waveform is necessary and substantial. It would allow to measure time parameters necessary to determine PWV. An influence of sampling...
-
New supervised alignment method as a preprocessing tool for chromatographic data in metabolomic studies
Publication -
Preprocessing of Document Images Based on the GGD and GMM for Binarization of Degraded Ancient Papyri Images
PublicationThresholding of document images is one of the most relevant operations that influence the final results of their further analysis. Although many image binarization methods have been proposed during recent several years, starting from global thresholding, through local and adaptive methods, to more sophisticated multi-stage algorithms and the use of deep convolutional neural networks, proper thresholding of degraded historical...
-
Methods of Natural Image Preprocessing Supporting the Automatic Text Recognition Using the OCR Algorithms
Publication -
Transcriptomics in Toxicogenomics, Part II: Preprocessing and Differential Expression Analysis for High Quality Data
Publication -
Continuous blood pressure monitoring by photoplethysmography - signal preprocessing requirements based on blood flow modelling
PublicationObjective. The aim of the study is to investigate the effect of the signal sampling frequency and low-pass filtering on the accuracy of the localisation of the fiducial points of the photoplethysmographic signal (PPG), and thus on the estimation of the blood pressure (i.e. the accuracy of the estimation). Approach. Statistical analysis was performed on 3,799 data samples taken from a publicly available database. Four PPGfiducial...
-
Increasing conclusiveness of metabonomic studies by cheminformatic preprocessing of capillary electrophoretic data on urinary nucleoside profiles
Publication -
Analysis of Image Preprocessing and Binarization Methods for OCR-Based Detection and Classification of Electronic Integrated Circuit Labeling
PublicationAutomatic recognition and classification of electronic integrated circuits based on optical character recognition combined with the analysis of the shape of their housings are essential to machine vision methods supporting the production of electronic parts, especially small-volume ones in the through-hole technology, characteristic of printed circuit boards. Since such methods utilize binary images, applying appropriate image...
-
Improvement of Image Binarization Methods Using Image Preprocessing with Local Entropy Filtering for Alphanumerical Character Recognition Purposes
PublicationAutomatic text recognition from the natural images acquired in uncontrolled lighting conditions is a challenging task due to the presence of shadows hindering the shape analysis and classification of individual characters. Since the optical character recognition methods require prior image binarization, the application of classical global thresholding methods in such case makes it impossible to preserve the visibility of all...
-
Gas Detection Using Resistive Gas Sensors And Radial Basis Function Neural Networks
PublicationWe present a use of Radial Basis Function (RBF) neural networks and Fluctuation Enhanced Sensing (FES) method in gas detection system utilizing a prototype resistive WO3 gas sensing layer with gold nanoparticles. We investigated accuracy of gas detection for three different preprocessing methods: no preprocessing, Principal Component Analysis (PCA) and wavelet transformation. Low frequency noise voltage observed in resistive gas...
-
Improving css-KNN Classification Performance by Shifts in Training Data
PublicationThis paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...
-
Age Prediction from Low Resolution, Dual-Energy X-ray Images Using Convolutional Neural Networks
PublicationAge prediction from X-rays is an interesting research topic important for clinical applications such as biological maturity assessment. It is also useful in many other practical applications, including sports or forensic investigations for age verification purposes. Research on these issues is usually carried out using high-resolution X-ray scans of parts of the body, such as images of the hands or images of the chest. In this...
-
Hybrid approach to ontology specification and development
PublicationIn this chapter a first draft of a hybrid ontology development approach is presented. The context of this method encompasses the multi-agent system equipped with knowledge bases and inferring engine. The process of ontology engineering appears as a complex issue, as it addresses different point-views of both, a client and a modeler with a requirements/system analyst. Regarding this, an approach based on the knowledge preprocessing...
-
Advances in macromodeling technique
PublicationThe paper discuses recent advances in the finite differencetime domain method employing macromodels. New techniquesfor creating irregularly shaped macromodels, grouping ofmacromodels and advanced macromodel cloning are introduced.The last technique is particularly important for efficient analysisof the structures based on Photonic Crystals (PhC). The methodallows one to shorten considerably the preprocessing time, theRAM usage...
-
Harmony Search for Data Mining with Big Data
PublicationIn this paper, some harmony search algorithms have been proposed for data mining with big data. Three areas of big data processing have been studied to apply new metaheuristics. The first problem is related to MapReduce architecture that can be supported by a team of harmony search agents in grid infrastructure. The second dilemma involves development of harmony search in preprocessing of data series before data mining. Moreover,...
-
A Mammography Data Management Application for Federated Learning
PublicationThis study aimed to develop and assess an application designed to enhance the management of a local client database consisting of mammographic images with a focus on ensuring that images are suitably and uniformly prepared for federated learning applications. The application supports a comprehensive approach, starting with a versatile image-loading function that supports DICOM files from various medical imaging devices and settings....
-
Application of dynamic time warping and cepstrograms to text-dependent speaker verification
PublicationThis work provides a description of an automatic speaker verification (ASV) system. In particular, it documents the evolution of all individual stages of the proposed ASV system design from the phase of preprocessing to an operational decision making system. The aim of this research was to achieve the system of the best safety and ease of use in view of users. The objective estimation of this target has been accomplished by assessing...
-
ANN for human pose estimation in low resolution depth images
PublicationThe paper presents an approach to localize human body joints in 3D coordinates based on a single low resolution depth image. First a framework to generate a database of 80k realistic depth images from a 3D body model is described. Then data preprocessing and normalization procedure, and DNN and MLP artificial neural networks architectures and training are presented. The robustness against camera distance and image noise is analysed....
-
MACHINE LEARNING APPLICATIONS IN RECOGNIZING HUMAN EMOTIONS BASED ON THE EEG
PublicationThis study examined the machine learning-based approach allowing the recognition of human emotional states with the use of EEG signals. After a short introduction to the fundamentals of electroencephalography and neural oscillations, the two-dimensional valence-arousal Russell’s model of emotion was described. Next, we present the assumptions of the performed EEG experiment. Detail aspects of the data sanitization including preprocessing,...
-
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
PublicationAplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...
-
Face Profile View Retrieval Using Time of Flight Camera Image Analysis
PublicationMethod for profile view retrieving of the human face is presented. The depth data from the 3D camera is taken as an input. The preprocessing is, besides of standard filtration, extended by the process of filling of the holes which are present in depth data. The keypoints, defined as the nose tip and the chin are detected in user’s face and tracked. The Kalman filtering is applied to smooth the coordinates of those points which...
-
Processing of LiDAR and Multibeam Sonar Point Cloud Data for 3D Surface and Object Shape Reconstruction
PublicationUnorganised point cloud dataset, as a transitional data model in several applications, usually contains a considerable amount of undesirable irregularities, such as strong variability of local point density, missing data, overlapping points and noise caused by scattering characteristics of the environment. For these reasons, further processing of such data, e.g. for construction of higher order geometric models of the topography...
-
AN ALGORITHM FOR PORTAL HYPERTENSIVE GASTROPATHY RECOGNITION ON THE ENDOSCOPIC RECORDINGS
PublicationSymptoms recognition of portal hypertensive gastropathy (PHG) can be done by analysing endoscopic recordings, but manual analysis done by physician may take a long time. This increases probability of missing some symptoms and automated methods may be applied to prevent that. In this paper a novel hybrid algorithm for recognition of early stage of portal hypertensive gastropathy is proposed. First image preprocessing is described....
-
Analyzing the Effectiveness of the Brain–Computer Interface for Task Discerning Based on Machine Learning
PublicationThe aim of the study is to compare electroencephalographic (EEG) signal feature extraction methods in the context of the effectiveness of the classification of brain activities. For classification, electroencephalographic signals were obtained using an EEG device from 17 subjects in three mental states (relaxation, excitation, and solving logical task). Blind source separation employing independent component analysis (ICA) was...
-
Federated Learning in Healthcare Industry: Mammography Case Study
PublicationThe paper focuses on the role of federated learning in a healthcare environment. The experimental setup involved different healthcare providers, each with their datasets. A comparison was made between training a deep learning model using traditional methods, where all the data is stored in one place, and using federated learning, where the data is distributed among the workers. The experiment aimed to identify possible challenges...
-
Absorption spectroscopy setup for determination of whole human blood and blood–derived materials spectral characteristics
PublicationA dedicated absorption spectroscopy system was set up using tungsten-halogen broadband source, optical fibers, sample holder, and a commercial spectrometer with CCD array. Analysis of noise present in the setup was carried out. Data processing was applied to the absorption spectra to reduce spectral noise, and improve the quality of the spectra and to remove the baseline level. The absorption spectra were measured for whole blood...
-
Image Segmentation of MRI image for Brain Tumor Detection
Publicationthis research work presents a new technique for brain tumor detection by the combination of Watershed algorithm with Fuzzy K-means and Fuzzy C-means (KIFCM) clustering. The MATLAB based proposed simulation model is used to improve the computational simplicity, noise sensitivities, and accuracy rate of segmentation, detection and extraction from MR...
-
Comparison of Selected Neural Network Models Used for Automatic Liver Tumor Segmentation
PublicationAutomatic and accurate segmentation of liver tumors is crucial for the diagnosis and treatment of hepatocellular carcinoma or metastases. However, the task remains challenging due to imprecise boundaries and significant variations in the shape, size, and location of tumors. The present study focuses on tumor segmentation as a more critical aspect from a medical perspective, compared to liver parenchyma segmentation, which is the...
-
Predictions of cervical cancer identification by photonic method combined with machine learning
PublicationCervical cancer is one of the most commonly appearing cancers, which early diagnosis is of greatest importance. Unfortunately, many diagnoses are based on subjective opinions of doctors—to date, there is no general measurement method with a calibrated standard. The problem can be solved with the measurement system being a fusion of an optoelectronic sensor and machine learning algorithm to provide reliable assistance for doctors...
-
A Task-Scheduling Approach for Efficient Sparse Symmetric Matrix-Vector Multiplication on a GPU
PublicationIn this paper, a task-scheduling approach to efficiently calculating sparse symmetric matrix-vector products and designed to run on Graphics Processing Units (GPUs) is presented. The main premise is that, for many sparse symmetric matrices occurring in common applications, it is possible to obtain significant reductions in memory usage and improvements in performance when the matrix is prepared in certain ways prior to computation....
-
How personality traits, sports anxiety, and general imagery could influence the physiological response measured by SCL to imagined situations in sports?
Open Research DataThe data were collected to understand how individual differences in personality (e.g. neuroticism), general imagery, and situational sports anxiety are linked to arousal measuring with skin conductance level (SCL) in situational imagery (as scripted for sport-related scenes). Thirty persons participated in the study, aged between 14 and 42 years, with...
-
Face with Mask Detection in Thermal Images Using Deep Neural Networks
PublicationAs the interest in facial detection grows, especially during a pandemic, solutions are sought that will be effective and bring more benefits. This is the case with the use of thermal imaging, which is resistant to environmental factors and makes it possible, for example, to determine the temperature based on the detected face, which brings new perspectives and opportunities to use such an approach for health control purposes. The...
-
A CNN based coronavirus disease prediction system for chest X-rays
PublicationCoronavirus disease (COVID-19) proliferated globally in early 2020, causing existential dread in the whole world. Radiography is crucial in the clinical staging and diagnosis of COVID-19 and offers high potential to improve healthcare plans for tackling the pandemic. However high variations in infection characteristics and low contrast between normal and infected regions pose great challenges in preparing radiological reports....
-
Optimal selection of input features and an acompanying neural network structure for the classification purposes - skin lesions case study
PublicationMalignant melanomas are the most deadly type of skin cancers however detected early enough give a high chances for successful treatment. The last years saw the dynamic growth of interest of automatic computer-aided skin cancer diagnosis. Every month brings new research results on new approaches to this problem, new methods of preprocessing, new classifiers, new ideas to follow etc. In particular, the rapid development of dermatoscopy,...
-
Numerical Method for Stability Testing of Fractional Exponential Delay Systems
PublicationA numerical method for stability testing of fractional exponential systems including delays is presented in this contribution. We propose the numerical test of stability for a very general class of systems with a transfer function, which includes polynomials and exponentials of fractional powers of the Laplace variable s combined with delay terms. Such a system is unstable if any root of its characteristic equation, which usually...
-
Comparative Analysis of the Coffee and Cocoa Industry By-Products on the Performance of Polyethylene-Based Composites
PublicationThe application of plant-based by-products from the food industry as minimally processed functional fillers for polymeric composites is an increasingly popular trend among researchers and manufacturers. While minimizing the preprocessing of lignocellulosic fillers leads to an increase in the sustainability of the overall composite and a decrease of the carbon footprint, filler modification is usually indispensable to obtaining...
-
Comparative Analysis of the Coffee and Cocoa Industry By‑Products on the Performance of Polyethylene‑Based Composites
PublicationThe application of plant-based by-products from the food industry as minimally processed functional fillers for polymeric composites is an increasingly popular trend among researchers and manufacturers. While minimizing the preprocessing of lignocellulosic fillers leads to an increase in the sustainability of the overall composite and a decrease of the carbon footprint, filler modification is usually indispensable to obtaining...
-
Improving the Accuracy of Automatic Reconstruction of 3D Complex Buildings Models from Airborne Lidar Point Clouds
PublicationDue to high requirements of variety of 3D spatial data applications with respect to data amount and quality, automatized, effcient and reliable data acquisition and preprocessing methods are needed. The use of photogrammetry techniques—as well as the light detection and ranging (LiDAR) automatic scanners—are among attractive solutions. However, measurement data are in the form of unorganized point clouds, usually requiring transformation...
-
Deep Learning Optimization for Edge Devices: Analysis of Training Quantization Parameters
PublicationThis paper focuses on convolution neural network quantization problem. The quantization has a distinct stage of data conversion from floating-point into integer-point numbers. In general, the process of quantization is associated with the reduction of the matrix dimension via limited precision of the numbers. However, the training and inference stages of deep learning neural network are limited by the space of the memory and a...
-
A Survey on the Datasets and Algorithms for Satellite Data Applications
PublicationThis survey compiles insights and describes datasets and algorithms for applications based on remote sensing. The goal of this review is twofold: datasets review for particular groups of tasks and high-level steps of data flow between satellite instruments and end applications from an implementation and development perspective. The article outlines the generalized data processing pipelines, taking into account the variations in...
-
DIAGNOSIS OF MALIGNANT MELANOMA BY NEURAL NETWORK ENSEMBLE-BASED SYSTEM UTILISING HAND-CRAFTED SKIN LESION FEATURES
PublicationMalignant melanomas are the most deadly type of skin cancer but detected early have high chances for successful treatment. In the last twenty years, the interest of automated melanoma recognition detection and classification dynamically increased partially because of public datasets appearing with dermatoscopic images of skin lesions. Automated computer-aided skin cancer detection in dermatoscopic images is a very challenging task...
-
Statistical Data Pre-Processing and Time Series Incorporation for High-Efficacy Calibration of Low-Cost NO2 Sensor Using Machine Learning
PublicationAir pollution stands as a significant modern-day challenge impacting life quality, the environment, and the economy. It comprises various pollutants like gases, particulate matter, biological molecules, and more, stemming from sources such as vehicle emissions, industrial operations, agriculture, and natural events. Nitrogen dioxide (NO2), among these harmful gases, is notably prevalent in densely populated urban regions. Given...
-
Decoding imagined speech for EEG-based BCI
PublicationBrain–computer interfaces (BCIs) are systems that transform the brain's electrical activity into commands to control a device. To create a BCI, it is necessary to establish the relationship between a certain stimulus, internal or external, and the brain activity it provokes. A common approach in BCIs is motor imagery, which involves imagining limb movement. Unfortunately, this approach allows few commands. As an alternative, this...
-
Machine-learning-based precise cost-efficient NO2 sensor calibration by means of time series matching and global data pre-processing
PublicationAir pollution remains a considerable contemporary challenge affecting life quality, the environment, and economic well-being. It encompasses an array of pollutants—gases, particulate matter, biological molecules—emanating from sources such as vehicle emissions, industrial activities, agriculture, and natural occurrences. Nitrogen dioxide (NO2), a harmful gas, is particularly abundant in densely populated urban areas. Given its...
-
Active Annotation in Evaluating the Credibility of Web-Based Medical Information: Guidelines for Creating Training Data Sets for Machine Learning
PublicationMethods Results Discussion References Abbreviations Copyright Abstract Background: The spread of false medical information on the web is rapidly accelerating. Establishing the credibility of web-based medical information has become a pressing necessity. Machine learning offers a solution that, when properly deployed, can be an effective tool in fighting medical misinformation on the web. Objective: The aim of this study is to...
-
Underground Water Level Prediction in Remote Sensing Images Using Improved Hydro Index Value with Ensemble Classifier
PublicationThe economic sustainability of aquifers across the world relies on accurate and rapid estimates of groundwater storage changes, but this becomes difficult due to the absence of insitu groundwater surveys in most areas. By closing the water balance, hydrologic remote sensing measures offer a possible method for quantifying changes in groundwater storage. However, it is uncertain to what extent remote sensing data can provide an...
-
TF-IDF weighted bag-of-words preprocessed text documents from Simple English Wikipedia
Open Research DataThe SimpleWiki2K-scores dataset contains TF-IDF weighted bag-of-words preprocessed text documents (raw strings are not available) [feature matrix] and their multi-label assignments [label-matrix]. Label scores for each document are also provided for an enhanced multi-label KNN [1] and LEML [2] classifiers. The aim of the dataset is to establish a benchmark...