Filters
total: 494
filtered: 481
-
Catalog
Chosen catalog filters
Search results for: PARALLEL PERFORMANCE
-
Tuning matrix-vector multiplication on GPU
PublicationA matrix times vector multiplication (matvec) is a cornerstone operation in iterative methods of solving large sparse systems of equations such as the conjugate gradients method (cg), the minimal residual method (minres), the generalized residual method (gmres) and exerts an influence on overall performance of those methods. An implementation of matvec is particularly demanding when one executes computations on a GPU (Graphics...
-
Implementation of spatial/polarization diversity for improved-performance circularly polarized multiple-input-multiple-output ultra-wideband antenna
PublicationIn this paper, spatial and polarization diversities are simultaneously implemented in an ultra-wideband (UWB) multiple-input-multiple-output (MIMO) antenna to reduce the correlation between the parallel-placed radiators. The keystone of the antenna is systematically modified coplanar ground planes that enable excitation of circular polarization (CP). To realize one sense of circular polarization as well as ultra-wideband operation,...
-
Molecular Simulations Using Boltzmann’s Thermally Activated Diffusion - Implementation on ARUZ – Massively-parallel FPGA-based Machine
Publication -
Redundantly Actuated 3RRR Parallel Planar Manipulator - Numerical Analyses of its Dynamics Sensitivity on Modifications of its Platform’s Inertia Parameters
PublicationIn the paper, numerical analyses, as well as dynamics of a complex mechanism, are presented. Two objectives are crucial for the paper: inverse dynamic model is needed (dedicated to be use in the model predictive controller); an identification method is searched (some trajectory parameters are controlled, when specific trajectory is tracked under an open-loop model-based control), as selected parameters must be identified for the...
-
Modelling and simulation of GPU processing in the MERPSYS environment
PublicationIn this work, we evaluate an analytical GPU performance model based on Little's law, that expresses the kernel execution time in terms of latency bound, throughput bound, and achieved occupancy. We then combine it with the results of several research papers, introduce equations for data transfer time estimation, and finally incorporate it into the MERPSYS framework, which is a general-purpose simulator for parallel and distributed...
-
Investigation of Mechanical and Microstructural Properties of Welded Specimens of AA6061-T6 Alloy with Friction Stir Welding and Parallel Friction Stir Welding Methods
PublicationThe present study investigates the effect of two parameters of process type and tool offset on tensile, microhardness, and microstructure properties of AA6061-T6 aluminum alloy joints. Three methods of Friction Stir Welding (FSW), Advancing Parallel-Friction Stir Welding (AP-FSW), and Retreating Parallel-Friction Stir Welding (RP-FSW) were used. In addition, four modes of 0.5, 1, 1.5, and 2 mm of tool offset were used in two welding...
-
Air Pollution Research Based on Spider Web and Parallel Continuous Particulate Monitoring—A Comparison Study Coupled with Identification of Sources
Publication -
Improved conformational space annealing method to treat β-structure with the UNRES force-field and to enhance scalability of parallel implementation
Publication -
Checkpointing of Parallel MPI Applications using MPI One-sided API with Support for Byte-addressable Non-volatile RAM
PublicationThe increasing size of computational clusters results in an increasing probability of failures, which in turn requires application checkpointing in order to survive those failures. Traditional checkpointing requires data to be copied from application memory into persistent storage medium, which increases application execution time as it is usually done in a separate step. In this paper we propose to use emerging byte-addressable...
-
A Task-Scheduling Approach for Efficient Sparse Symmetric Matrix-Vector Multiplication on a GPU
PublicationIn this paper, a task-scheduling approach to efficiently calculating sparse symmetric matrix-vector products and designed to run on Graphics Processing Units (GPUs) is presented. The main premise is that, for many sparse symmetric matrices occurring in common applications, it is possible to obtain significant reductions in memory usage and improvements in performance when the matrix is prepared in certain ways prior to computation....
-
A Concept of Modeling and Optimization of Applications in Large Scale Systems
PublicationThe chapter presents the idea that includes modeling and subsequent optimization of application execution on large scale parallel and distributed systems. The model considers performance, reliability and power consumption. It should allow easy modeling of various classes of applications while reflecting key parameters of both the applications and two classes of target systems: clusters and volunteer based systems. The chapter presents...
-
A new look at the statistical identification of nonstationary systems
PublicationThe paper presents a new, two-stage approach to identification of linear time-varying stochastic systems, based on the concepts of preestimation and postfiltering. The proposed preestimated parameter trajectories are unbiased but have large variability. Hence, to obtain reliable estimates of system parameters, the preestimated trajectories must be further filtered (postfiltered). It is shown how one can design and optimize such...
-
Grid Implementation of a Parallel Multiobjective Genetic Algorithm for Optimized Allocation of Chlorination Stations in Drinking Water Distribution Systems: Chojnice Case Study
PublicationSolving multiobjective optimization problems requires suitable algorithms to find a satisfactory approximation of a globally optimal Pareto front. Furthermore, it is a computationally demanding task. In this paper, the grid implementation of a distributed multiobjective genetic algorithm is presented. The distributed version of the algorithm is based on the island algorithm with forgetting island elitism used instead of a genetic...
-
Variable-structure algorithm for identification of quasi-periodically varying systems
PublicationThe paper presents a variable-structure version of a generalized notchfiltering (GANF) algorithm. Generalized notch filters are used for identification of quasi-periodically varying dynamic systems and can be considered an extension, to the system case, of classical adaptive notch filters. The proposed algorithm is a cascade of two GANF filters: a multiple-frequency "precise" filter bank, used for precise system tracking, and a...
-
The Use of Photographs in the Teaching/Learning of Descriptive Geometry
PublicationThe article presents the concept of enriching the Descriptive Geometry course with photographs and several simplified real-life engineering tasks. The photographic images used for the exercises are tightly linked to engineering structures, the given specialization and the surrounding world. The photo image as a record of central projection of a real space can be useful for presentation and analysis of the properties of perspective....
-
Design of a microrobotic wrist for needle laparoscopic surgery
PublicationThe paper addresses the design of a micro wrist for needle laparoscopic surgery (needlescopy) using MEMS technology and an original 3 degree of freedom, 3D architecture. Advancement in needlescopy drives the development of multi-dof micro-tools 1-2mmin diameter with 3D mobility but standard available fabricationtechniques are for 2.5D structures. The paper discusses thedevelopment steps and design solutions for the realization...
-
Identification of nonstationary multivariate autoregressive processes– Comparison of competitive and collaborative strategies for joint selection of estimation bandwidth and model order
PublicationThe problem of identification of multivariate autoregressive processes (systems or signals) with unknown and possibly time-varying model order and time-varying rate of parameter variation is considered and solved using parallel estimation approach. Under this approach, several local estimation algorithms, with different order and bandwidth settings, are run simultaneously and compared based on their predictive performance. First,...
-
A self-optimization mechanism for generalized adaptive notch smoother
PublicationTracking of nonstationary narrowband signals is often accomplished using algorithms called adaptive notch filters (ANFs). Generalized adaptive notch smoothers (GANSs) extend the concepts of adaptive notch filtering in two directions. Firstly, they are designed to estimate coefficients of nonstationary quasi-periodic systems, rather than signals. Secondly, they employ noncausal processing, which greatly improves their accuracy and...
-
Enhancing Resilience of FSO Networks to Adverse Weather Conditions
PublicationOptical wireless networks realized by means of gigabit optical wireless communication (OWC) systems are becoming, in a variety of applications, an important alternative, or a complementary solution, to their fiber-based counterparts. However, performance of the OWC systems can be considerably degraded in periods of unfavorable weather conditions, such as heavy fog, which temporarily reduce the effective capacity of the network....
-
Parallelization of large vector similarity computations in a hybrid CPU+GPU environment
PublicationThe paper presents design, implementation and tuning of a hybrid parallel OpenMP+CUDA code for computation of similarity between pairs of a large number of multidimensional vectors. The problem has a wide range of applications, and consequently its optimization is of high importance, especially on currently widespread hybrid CPU+GPU systems targeted in the paper. The following are presented and tested for computation of all vector...
-
Multi-source-supplied parallel hybrid propulsion of the inland passenger ship STA.H. Research work on energy efficiency of a hybrid propulsion system operating in the electric motor drive mode
PublicationIn the Faculty of Ocean Engineering and Ship Technology, Gdansk University of Technology, design has recently been developed of a small inland ship with hybrid propulsion and supply system. The ship will be propelled by a specially designed so called parallel hybrid propulsion system. The work was aimed at carrying out the energy efficiency analysis of a hybrid propulsion system operating in the electric motor drive mode and at...
-
Multiprocessor implementation of Parallel Multiobjective Genetic Algorithm for Optimized Allocation of Chlorination Stations in Drinking Water Distribution System - a new water quality model approach
Publication -
Multiprocessor Implementation of Parallel Multiobjective Genetic Algorithm for Optimized Allocation of Chlorination Stations in Drinking Water Distribution System a New Water Quality Model Approach
PublicationThe Critical Infrastructure Systems (CISs) have received in recent years a considerable attention due to their heavy impact on sustainable development of modern societies. Most CISs may be classied as large scale complex systems of network structure, in uenced by strong interactions form the surrounding environment, internal and external interconnections. The later is a result of inter-CIS dependencies. The control, monitoring...
-
Bioaugmentation of a sequencing batch reactor with Archaea for the treatment of reject water
PublicationIn this study, the bioaugmentation of a sequencing batch reactor (SBR) for the treatment of reject water from wastewater treatment plant was evaluated. For the bioaugmentation step a product containing an enrichment of microorganisms from the Archaea domain was used to enhance the performance of the reactor for treating reject water. The experiment was carried out in two parallel lab-scale sequencing batch reactors. The first one...
-
Process arrival pattern aware algorithms for acceleration of scatter and gather operations
PublicationImbalanced process arrival patterns (PAPs) are ubiquitous in many parallel and distributed systems, especially in HPC ones. The collective operations, e.g. in MPI, are designed for equal process arrival times (PATs), and are not optimized for deviations in their appearance. We propose eight new PAP-aware algorithms for the scatter and gather operations. They are binomial or linear tree adaptations introducing additional process...
-
Pulsed Laser Deposition of Bismuth Vanadate Thin Films—The Effect of Oxygen Pressure on the Morphology, Composition, and Photoelectrochemical Performance
PublicationThin layers of bismuth vanadate were deposited using the pulsed laser deposition technique on commercially available FTO (fluorine-doped tin oxide) substrates. Films were sputtered from a sintered, monoclinic BiVO4 pellet, acting as the target, under various oxygen pressures (from 0.1 to 2 mbar), while the laser beam was perpendicular to the target surface and parallel to the FTO substrate. The oxygen pressure strongly affects...
-
Repair Augmentation of Unstable, Complete Vertical Meniscal Tears With Bone Marrow Venting Procedure: A Prospective, Randomized, Double-Blind, Parallel-Group, Placebo-Controlled Study
Publication -
Distributed NVRAM Cache – Optimization and Evaluation with Power of Adjacency Matrix
PublicationIn this paper we build on our previously proposed MPI I/O NVRAM distributed cache for high performance computing. In each cluster node it incorporates NVRAMs which are used as an intermediate cache layer between an application and a file for fast read/write operations supported through wrappers of MPI I/O functions. In this paper we propose optimizations of the solution including handling of write requests with a synchronous mode,...
-
A Regular Expression Matching Application with Configurable Data Intensity for Testing Heterogeneous HPC Systems
PublicationModern High Performance Computing (HPC) systems are becoming increasingly heterogeneous in terms of utilized hardware, as well as software solutions. The problems, that we wish to efficiently solve using those systems have different complexity, not only considering magnitude, but also the type of complexity: computation, data or communication intensity. Developing new mechanisms for dealing with those complexities or choosing an...
-
Behavior Analysis and Dynamic Crowd Management in Video Surveillance System
PublicationA concept and practical implementation of a crowd management system which acquires input data by the set of monitoring cameras is presented. Two leading threads are considered. First concerns the crowd behavior analysis. Second thread focuses on detection of a hold-ups in the doorway. The optical flow combined with soft computing methods (neural network) is employed to evaluate the type of crowd behavior, and fuzzy logic aids detection...
-
Single and Dual-GPU Generalized Sparse Eigenvalue Solvers for Finding a Few Low-Order Resonances of a Microwave Cavity Using the Finite-Element Method
PublicationThis paper presents two fast generalized eigenvalue solvers for sparse symmetric matrices that arise when electromagnetic cavity resonances are investigated using the higher-order finite element method (FEM). To find a few loworder resonances, the locally optimal block preconditioned conjugate gradient (LOBPCG) algorithm with null-space deflation is applied. The computations are expedited by using one or two graphical processing...
-
Implementation of Molecular Dynamics and Its Extensions with the Coarse-Grained UNRES Force Field on Massively Parallel Systems: Toward Millisecond-Scale Simulations of Protein Structure, Dynamics, and Thermodynamics
Publication -
Modeling of a Quasi-Resonant DC Link Inverter Dedicated to Common-Mode Voltage and Ground Current Reduction
PublicationIn this paper, the modeling methodology of the AC drive system with a Parallel Quasi-Resonant DC Link Inverter (PQRDCLI) is described. A presented modeling approach is an attractive tool used for the effective evaluation of a common-mode (CM) voltage and grounds current reduction methods. Designed models of inverter, induction machine (IM), and cable are simple, thus the methods for parameter extraction are not complicated. Verification...
-
Empirical investigation on labour market interactions in an enlarged Europe
PublicationThis paper proposes an empirical assessment of economic interactions between the labour markets ofthe integrating EU over the period of time 1995-2005. Drawing on recently made available industrystatistics, we provide a sector level study (13 tradable sectors, including manufacturing and services),analysing the contemporary evolution of domestic and trade partners' employment levels. Given theintensification of trade relations...
-
Low-Power WSN System for Honey Bee Monitoring
PublicationThe paper presents a universal low-power system for biosensory data acquisition in scope of bees monitoring. We describe the architecture of the system, energy-saving components as well as we discuss the selection of used sensors. The work focuses on energy optimization in a scope of wireless communication. A custom protocol was implemented, which is the basis for presented energy-efficient devices. Data exchange process during...
-
Author Reply to “Regarding ‘Repair Augmentation of Unstable, Complete Vertical Meniscal Tears With Bone Marrow Venting Procedure: A Prospective, Randomized, Double-Blind, Parallel-Group, Placebo-Controlled Study’”
Publication -
Short-Term Outcomes of Percutaneous Trephination with a Platelet Rich Plasma Intrameniscal Injection for the Repair of Degenerative Meniscal Lesions. A Prospective, Randomized, Double-Blind, Parallel-Group, Placebo-Controlled Study
Publication -
Graphical presentation of the power of energy losses and power developed in the elements hydrostatic drive and control system. Part II. Rotational hydraulic motor speed parallel throtling control and volumetric control systems
PublicationPrzedstawiono interpretację graficzną mocy strat energetycznych występujących w elementach układów napędu i sterowania hydrostatycznego, a także mocy rozwijanych przez te elementy. Dokonano analizy układu indywidualnego ze sterowaniem dławieniowym równoległym prędkości silnika hydraulicznego obrotowego, układu indywidualnego ze sterowaniem objętościowym, pompą o zmiennej wydajności, prędkości silnika hydrailicznego obrotowego,...
-
Anterior Cruciate Ligament Reconstruction Using a 4-Strand Semitendinosus Tendon Graft or a Doubled Semitendinosus and Gracilis Tendon Graft: A 4.5-Year Prospective, Randomized, Double-Blind, Parallel-Group Study
Publication -
On the possible increasing of efficiency of ship power plant with the system combined of marine diesel engine, gas turbine and steam turbine in case of main engine cooperation with the gas turbine fed in parallel and the steam turbine
PublicationW pracy przedstawiono koncepcję układu kombinowanego okrętowego dużej mocy złożonego z wiodącego silnika głównego spalinowego silnika tłokowego i skojarzonych z nim turbiny gazowej mocy i układu turbiny parowej, wykorzystujących energię zawartą w spalinach wylotowych głównego silnika spalinowego. Rozpatrzono wariant układu kombinowanego, w którym główny jest silnik tłokowy skojarzony z turbiną gazową mocy i układem turbiny parowej,...
-
Efficacy, pharmacokinetics, and safety of the biosimilar CT-P10 compared with rituximab in patients with previously untreated advanced-stage follicular lymphoma: a randomised, double-blind, parallel-group, non-inferiority phase 3 trial
Publication -
Effect of asymmetric fluid flow distribution on flow boiling in a microchannel heat sink – An experimental investigation
PublicationFlow boiling in microchannels is emerging as an exclusive cooling solution for miniaturized high-power electronic devices alongside having other high heat flux applications. Size miniaturization at microscale strangely increases heat transfer performance as well as flow boiling instabilities. Many flow boiling instabilities are interrelated and result from imperfect hydrodynamic conditions. One of such problems is flow maldistribution among...
-
Thermal and technological aspects of double face grinding of Al2O3 ceramic materials
PublicationDouble face grinding with planetary kinematics is a process to manufacture workpieces with plan parallel functional surfaces, such as bearing rings or sealing shims. In order to increase the economic efficiency of this process, it has to be advanced permanently. The temperature in the contact zone of most grinding processes has a huge influence on the process efficiency and the workpiece qualities. In contrast to most grinding...
-
A Series-Inclined-Slot-Fed Circularly Polarized Antenna for 5G 28-GHz Applications
PublicationThis letter presents the design of a single-point-fed, geometrically simple circularly polarized (CP) antenna for 28 GHz Ka-band applications. The proposed antenna is based on a straight microstrip line printed on one side and coupled with the nearly square patches through a 45-degree inclined V-shape slot aperture on the other side. In order to generate circular polarization, the fundamental radiating mode is degenerated at a...
-
Technical Limitations in Merging Secular and Sacred Functions in Monumental Churches
PublicationThe abandonment of churches and their adaptation for secular purposes is a current subject in Europe and worldwide. Most cases involve objects that were desacralized and then rebuilt as a whole object for alternative functions. Thus far, the merging of secular and sacred functions in one monumental Catholic church has not raised any issues. The paper describes the case of St. Catherine’s Church in Gdansk, Poland, where sacred function...
-
Psychometric properties of the Bern illegitimate tasks scale using classical test and item response theories
PublicationCombining a classical test theory and an item response theory (IRT), this study aimed to investigate the psychometric properties of the Bern Illegitimate Tasks Scale (BITS) by measuring two conceptually separate dimensions capturing unnecessary tasks (perceived by employees as pointless) and unreasonable tasks (perceived as unfairly or inappropriately assigned). Data collected among Polish employees in two samples (N= 965 and N=...
-
Assessment of baby disposable diapers application for urine collection and determination of phthalate metabolites
PublicationThe baby disposable diapers were investigated as a sampling material for urine collection and validated for the evaluation of the exposure of children to xenobiotics. Phthalate metabolites detected in urine samples were chosen as proof-of-concept analytes. For the determination of phthalate metabolites in children’s urine samples, high performance liquid chromatography coupled with tandem mass spectrometry (HPLC-MS/MS) was used. Two...
-
Multi-layered tissue head phantoms for noninvasive optical diagnostics
PublicationExtensive research in the area of optical sensing for medical diagnostics requires development of tissue phantoms with optical properties similar to those of living human tissues. Development and improvement of in vivo optical measurement systems requires the use of stable tissue phantoms with known characteristics, which are mainly used for calibration of such systems and testing their performance over time. Optical and mechanical...
-
Dynamic coloring of graphs
PublicationDynamics is an inherent feature of many real life systems so it is natural to define and investigate the properties of models that reflect their dynamic nature. Dynamic graph colorings can be naturally applied in system modeling, e.g. for scheduling threads of parallel programs, time sharing in wireless networks, session scheduling in high-speed LAN's, channel assignment in WDM optical networks as well as traffic scheduling. In...
-
A GPU Solver for Sparse Generalized Eigenvalue Problems with Symmetric Complex-Valued Matrices Obtained Using Higher-Order FEM
PublicationThe paper discusses a fast implementation of the stabilized locally optimal block preconditioned conjugate gradient (sLOBPCG) method, using a hierarchical multilevel preconditioner to solve nonHermitian sparse generalized eigenvalue problems with large symmetric complex-valued matrices obtained using the higher-order finite-element method (FEM), applied to the analysis of a microwave resonator. The resonant frequencies of the low-order...