Search results for: PARALLEL ALGORITHMS
-
Testing for conformance of parallel programming pattern languages
PublicationThis paper reports on the project being run by TUG and IMAG, aimed at reducing the volume of tests required to exercise parallel programming language compilers and libraries. The idea is to use the ISO STEP standard scheme for conformance testing of software products. A detailed example illustrating the ongoing work is presented.
-
Bounds on the Cover Time of Parallel Rotor Walks
PublicationThe rotor-router mechanism was introduced as a deterministic alternative to the random walk in undirected graphs. In this model, a set of k identical walkers is deployed in parallel, starting from a chosen subset of nodes, and moving around the graph in synchronous steps. During the process, each node maintains a cyclic ordering of its outgoing arcs, and successively propagates walkers which visit it along its outgoing arcs in...
-
Mechanism of recognition of parallel G-quadruplexes by DEAH/RHAU helicase DHX36 explored by molecular dynamics simulations
PublicationBecause of high stability and slow unfolding rates of G-quadruplexes (G4), cells have evolved specialized helicases that disrupt these non-canonical DNA and RNA structures in an ATP-dependent manner. One example is DHX36, a DEAH-box helicase, which participates in gene expression and replication by recognizing and unwinding parallel G4s. Here, we studied the molecular basis for the high affinity and specificity of DHX36 for parallel-type...
-
Machine Learning in Multi-Agent Systems using Associative Arrays
PublicationIn this paper, a new machine learning algorithm for multi-agent systems is introduced. The algorithm is based on associative arrays, thus it becomes less complex and more efficient substitute of artificial neural networks and Bayesian networks, which is confirmed by performance measurements. Implementation of machine learning algorithm in multi-agent system for aided design of selected control systems allowed to improve the performance...
-
Acceleration of the discrete Green's function computations
PublicationResults of the acceleration of the 3-D discrete Green's function (DGF) computations on the multicore processor are presented. The code was developed in the multiple precision arithmetic with use of the OpenMP parallel programming interface. As a result, the speedup factor of three orders of magnitude compared to the previous implementation was obtained thus applicability of the DGF in FDTD simulations was significantly improved.
-
Propagation in rectangular waveguides with a pseudochiral Ω slab
PublicationThe transfer matrix approach is applied for analysis of waveguides loaded with a uniaxial pseudochiral Ω slab. In particular a pseudochiral parallel plate and rectangular guides are investigated. Based on the numerical analysis the influence of the pseudochirality on propagation characteristics and field distribution are examined. Other feature such as a field displacement phenomenon appearing in the both considered structures...
-
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
PublicationAuto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...
-
Three levels of fail-safe mode in MPI I/O NVRAM distributed cache
PublicationThe paper presents architecture and design of three versions for fail-safe data storage in a distributed cache using NVRAM in cluster nodes. In the first one, cache consistency is assured through additional buffering write requests. The second one is based on additional write log managers running on different nodes. The third one benefits from synchronization with a Parallel File System (PFS) for saving data into a new file which...
-
Steam turbines governors in power system restoration process
PublicationThe paper discusses problems related to electric power system restoration process. The turbine controller operating mode influence in small subsystem is analyzed. There are considered abilities, advantages and drawbacks of the controller's two operating modes: power control and speed (frequen-cy) control, to rebuild electric power system from a single generating unit to a few parallel running generators.
-
Two-Stage Identification of Locally Stationary Autoregressive Processes and its Application to the Parametric Spectrum Estimation
PublicationThe problem of identification of a nonstationary autoregressive process with unknown, and possibly time-varying, rate of parameter changes, is considered and solved using the parallel estimation approach. The proposed two-stage estimation scheme, which combines the local estimation approach with the basis function one, offers both quantitative and qualitative improvements compared with the currently used single-stage methods.
-
New approach to noncausal identification of nonstationary stochastic systems subject to both smooth and abrupt parameter changes
PublicationIn this paper we consider the problem of finiteintervalparameter smoothing for a class of nonstationary linearstochastic systems subject to both smooth and abrupt parameterchanges. The proposed parallel estimation scheme combines theestimates yielded by several exponentially weighted basis functionalgorithms. The resulting smoother automatically adjustsits smoothing bandwidth to the type and rate of nonstationarityof the identified...
-
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
PublicationIn the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...
-
Unusual divergence of magnetoacoustic beams
PublicationTwo-dimensional magnetosonic beams directed along a line forming a constant angle h with the equilibrium straight magnetic field are considered. Perturbations in a plasma are described by the system of ideal magnetohydrodynamic equations. The dynamics of perturbations in a beam are different in the cases of fast and slow modes, and it is determined by h and equilibrium parameters of a plasma. In particular, a beam divergence may...
-
10-Methyl- and 9,10-dimethyl acridinium methyl sulfate
PublicationThe title compounds, C(14)H(12)N(+).CH(3)O(4)S(-), (I), and C(15)H(14)N(+).CH(3)O(4)S(-), (II), respectively, crystallize with the planar 10-methylacridinium or 9,10-dimethylacridinium cations arranged in layers, parallel to the twofold axis in (I) and perpendicular to the 2(1) axis in (II). Adjacent cations in both compounds are packed in a 'head-to-tail' manner. The methyl sulfate anion only exhibits planar symmetry in (II)....
-
Network-aware Data Prefetching Optimization of Computations in a Heterogeneous HPC Framework
PublicationRapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...
-
A CMOS Pixel With Embedded ADC, Digital CDS and Gain Correction Capability for Massively Parallel Imaging Array
PublicationIn the paper, a CMOS pixel has been proposed for imaging arrays with massively parallel image acquisition and simultaneous compensation of dark signal nonuniformity (DSNU) as well as photoresponse nonuniformity (PRNU). In our solution the pixel contains all necessary functional blocks: a photosensor and an analog-to-digital converter (ADC) with built-in correlated double sampling (CDS) integrated together. It is implemented in...
-
The evaluation of the vibration measurement usability of electronic indicator lemag "premet C"
PublicationThe measuring possibilities of modern compression and combustion pressure analyzers are extended with additional functions. One of them is parallel to the pressure measurement, the measurement of vibrations in the region of the cylinder head. The paper presents a general assessment of the vibration measurement function of the electronic indicator LEMAG "PREMET C". This feature is very rarely offered by manufacturers of these devices....
-
A New Method of Noncausal Identification of Time-varying Systems
PublicationThe paper shows that the problem of noncausal identification of a time-varying FIR (finite impulse response) sys- tem can be reformulated, and solved, as a problem of smoothing of the preestimated parameter trajectories. Characteristics of the smoothing filter should be chosen so as to provide the best trade- off between the bias and variance of the resulting estimates. It is shown that optimization of the smoothing operation can...
-
Parallel immune system for graph coloring
PublicationThis paper presents a parallel artificial immune system designed forgraph coloring. The algorithm is based on the clonal selection principle. Each processor operates on its own pool of antibodies and amigration mechanism is used to allow processors to exchange information. Experimental results show that migration improves the performance of the algorithm. The experiments were performed using a high performance cluster on a set...
-
Fully Adaptive Savitzky-Golay Type Smoothers
PublicationThe problem of adaptive signal smoothing is consid-ered and solved using the weighted basis function approach. Inthe special case of polynomial basis and uniform weighting theproposed method reduces down to the celebrated Savitzky-Golaysmoother. Data adaptiveness is achieved via parallel estimation.It is shown that for the polynomial and harmonic bases andcosinusoidal weighting sequences, the competing signal estimatescan be computed...
-
Channel Blockage and Flow Maldistribution during Unsteady Flow in a Model Microchannel Plate heat Exchanger
PublicationThis paper describes the problem of channel blockage as a result of flow maldistribution between the channels of a model mini channel plate heat exchanger consisting of one pass on each leg. Each leg of the heat exchanger contains 51 parallel and rectangular minichannels of four hydraulic diameters namely 461 μm, 571 μm, 750 μm and 823 μm. In addition, a more complex geometry has been investigated where for the sake of breaking...
-
Edge-Guided Mode Performance and Applications in Nonreciprocal Millimeter-Wave Gyroelectric Components
PublicationThe analogies between the behavior of gyromagnetic and gyroelectric nonreciprocal structures, the use of the simple transfer matrix approach, and the edge-guided (EG) wave property, supported in a parallel plate model for integrated magnetized semiconductor waveguide, are investigated in those frequency regions, where the effective permittivity is negative or positive. As with their ferrite counterparts, the leakage of the EG waves...
-
Infrared techniques for natural convection investigations in channels between two vertical, parallel, isothermal and symmetrically heated plates
PublicationThe effect of the gap width between two symmetrically heated vertical, parallel, isothermal plates on intensity of natural convective heat transfer in a gas (Pr = 0.71) was experimentally studied using the balance and gradient methods. In the former method heat fluxes were determined based on measurements of the voltage and electric current supplying the heaters placed inside the walls. In the latter, heat fluxes were calculated...
-
Modeling the effect of parasitic capacitances on the dead-time distortion in multilevel NPC inverters
PublicationA simple model is derived and verified for evaluating the effect of parasitic capacitances on the dead-time related voltage distortion in multilevel NPC voltage source inverters. The model permits well-defined and precise compensation of dead-time distortion, exhibiting meaningful improvement on compensation methods neglecting the effects of parasitic capacitances. A simple formula is given for evaluating the capacitances as serial/parallel...
-
OpenGL accelerated method of the material matrix generation for FDTD simulations
PublicationThis paper presents the accelerated technique of the material matrix generation from CAD models utilized by the finite-difference time-domain (FDTD) simulators. To achieve high performance of these computations, the parallel-processing power of a graphics processing unit was employed with the use of the OpenGL library. The method was integrated with the developed FDTD solver, providing approximately five-fold speedup of the material...
-
The bridge over Regalia River in Szczecin - design and construction.
PublicationNowoclowa Route is the largest projekt in Szczecin, that consists of 11 km of roads 3,3 km bridges and viaducts, including three parallel bridges across Regalica River (the east arm of Odra River), 535 m long, with the spans: 59+90+90+116+116+64m. The bridge structure consist of two steel plate girders composite with reinforced concrete deck slab. The design and the construction of the bridge are described in the paper.
-
From Sequential to Parallel Implementation of NLP Using the Actor Model
PublicationThe article focuses on presenting methods allowing easy parallelization of an existing, sequential Natural Language Processing (NLP) application within a multi-core system. The actor-based solution implemented with the Akka framework has been applied and compared to an application based on Task Parallel Library (TPL) and to the original sequential application. Architectures, data and control flows are described along with execution...
-
Structural and dynamic insights on the EmrE protein with TPP+ and related substrates through molecular dynamics simulations
PublicationEmrE is a bacterial transporter protein that forms an anti-parallel homodimer with four transmembrane helices in each monomer. EmrE transports positively charged aromatic compounds, such as TPP+ and its derivatives. We performed molecular dynamics (MD) simulations of EmrE in complex with TPP+, MeTPP+, and MBTPP+ embedded in a membrane. The detailed molecular properties and interactions were analysed for all EmrE-ligand complexes....
-
Marek Kubale prof. dr hab. inż.
PeopleDetails concerning: Qualifications, Experiences, Editorial boards, Ph.D. theses supervised, Books, and Recent articles can be found at http://eti.pg.edu.pl/katedra-algorytmow-i-modelowania-systemow/Marek_KubaleGoogle ScholarSylwetka prof. Marka Kubalego Prof. Marek Kubale pracuje na Wydziale ETI Politechniki Gdańskiej nieprzerwanie od roku 1969. W tym czasie napisał ponad 150 prac naukowych, w tym ponad 40 z listy JCR. Ponadto...
-
General Provisioning Strategy for Local Specialized Cloud Computing Environments
PublicationThe well-known management strategies in cloud computing based on SLA requirements are considered. A deterministic parallel provisioning algorithm has been prepared and used to show its behavior for three different requirements: load balancing, consolidation, and fault tolerance. The impact of these strategies on the total execution time of different sets of services is analyzed for randomly chosen sets of data. This makes it possible...
-
Degradation of polyurethanes in Compost Under Natural Conditions
PublicationThe estimation of degradibility of different polyurethanes under natural weather depending conditions in compost pile was the subject of the studies. The incubation of polymer samples took place for a period up to 24 months. The characteristic parameters of the compost: temperature, pH, moisture content, and activity of dehydrogenasis were monitored and their influence on degradation of polyuiretahnes was discussed. The compostability...
-
Overflowing tests at the Polish DredgDikes research dike – stability of the dike surface against erosion
PublicationIn the project DredgDikes the different research dike embankments were tested with respect to overflowing water induced erosion. Therefore, flumes were installed on the land side embankments in which the effect of overflowing water on the vegetated surface was investigated. On the Polish DredgDikes research dike near Gdansk, Poland, two parallel flumes were installed and the surface of the dike made of different mixtures of...
-
Modern Arrangement for Reduction of Voltage Perturbations
PublicationThe contents of this chapter encompass general problems and the most important issues of power-supply-quality improvement in AC systems. In the context of the above, consideration is given to evaluation of bilateral interactions of receivers with an electrical power-distribution system and methods of their reduction. Also are discussed the basis of operation of the most important compensation-filtration devices and their applications...
-
The chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identified complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for different size of the data as well as for different number of the processors used for the computations. The results we got using the model have been compared to the results obtained from real computational environment.
PublicationThe chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identified complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for different size of the data as well as for different number of the processors used...
-
Improving web user experience with caching user interface
PublicationOften, Web technologies are used to operate or to configure network-enabled equipment, to configure and administer modular applications, or as teaching environments. The comfort of human work requires a similar response time in these applications as in the Internet. To improve response time, various forms of caching at different levels are employed. To improve the user experience in regard to response time when performing specific...
-
On the preestimation technique and its application to identification of nonstationary systems
PublicationThe problem of noncausal identification of a nonstationary stochastic FIR (finite impulse response) sys- tem is reformulated, and solved, as a problem of smoothing of preestimated parameter trajectories. Three approaches to preestimation are critically analyzed and compared. It is shown that optimization of the smoothing operation can be performed adaptively using the parallel estimation technique. The new approach is computationally...
-
Seawater intrusion due to pumping mitigated by natural freshwater flux: a case study in Władysławowo, northern Poland
PublicationThe paper presents a case study of seawater intrusion into a coastal aquifer, caused by a groundwater intake located close to the seashore in Władysławowo, northern Poland. Evolution of the basic hydrogeochemical parameters for the 50-year period from 1964 to 2014 indicates progressing encroachment of saline seawater into the aquifer. However, the spatial pattern of salinity was influenced by the variability of hydraulic gradient...
-
An Ultra-Low-Energy Analog Comparator for A/D Converters in CMOS Image Sensors
PublicationThis paper proposes a new solution of an ultra-low-energy analog comparator, dedicated to slope analog-to-digital converters (ADC), particularly suited for CMOS image sensors (CISs) featuring a large number of ADCs. For massively parallel imaging arrays, this number may be as high as tens-hundreds of thousands ADCs. As each ADC includes an analog comparator, the number of these comparators in CIS is always high. Detailed analysis...
-
A multithreaded CUDA and OpenMP based power‐aware programming framework for multi‐node GPU systems
PublicationIn the paper, we have proposed a framework that allows programming a parallel application for a multi-node system, with one or more GPUs per node, using an OpenMP+extended CUDA API. OpenMP is used for launching threads responsible for management of particular GPUs and extended CUDA calls allow to manage CUDA objects, data and launch kernels. The framework hides inter-node MPI communication from the programmer who can benefit from...
-
BUILDINGS AND CONSTRUCTIONS FROM BUILDING WASTE MATERIALS IN ACCORDANCE WITH THE NEW CIRCULAR ECONOMY TREND
PublicationMATERIALS April 18-22, 2022 / BOSTON, MA Abstract Book 3rd INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE & ENGINEERING Day 5 Friday, April 22, 2022 Parallel Sessio II- (Virtual EST Zone) pp. 154 3rd INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE & ENGINEERING APRIL 18-22, 2022 | BOSTON, MAAt: Boston, MAAffiliation: UNICEF at: USA BOSTONVolume: 2022
-
Sensorless predictive control of three-phase parallel active filter
PublicationThe paper presents the control system of parallel active power filter (APF) with predictive reference current calculation and model based predictive current control. The novel estimator and predictor of grid emf is proposed for AC voltage sensorless operation of APF, regardless of distortion of this voltage. Proposed control system provides control of APF current with high precision and dynamics limited only by filter circuit parameters....
-
Single and Series of Multi-valued Decision Diagrams in Representation of Structure Function
PublicationStructure function, which defines dependency of performance of the system on performance of its components, is a key part of system description in reliability analysis. In this paper, we compare two approaches for representation of the structure function. The first one is based on use of a single Multi-valued Decision Diagram (MDD) and the second on use of a series of MDDs. The obtained results indicate that the series of MDDs...
-
Multi-source-supplied parallel hybrid propulsion of the inland passenger ship STA.H. Research work on energy efficiency of a hybrid propulsion system operating in the electric motor drive mode
PublicationIn the Faculty of Ocean Engineering and Ship Technology, Gdansk University of Technology, design has recently been developed of a small inland ship with hybrid propulsion and supply system. The ship will be propelled by a specially designed so called parallel hybrid propulsion system. The work was aimed at carrying out the energy efficiency analysis of a hybrid propulsion system operating in the electric motor drive mode and at...
-
Experiences from operation of various expansion devices in small scale ORC
PublicationThe main aim of this paper was to present various expansion devices for an application in the small scale ORC system. The investigations were carried out in two parallel directions. One direction was to design and construct a device dedicated to the analyzed ORC system. The second direction was to adapt existing expansion devices for the needs of the analyzed ORC system. Four various devices were described and presented together...
-
Towards Effective Processing of Large Text Collections
PublicationIn the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...
-
Benchmarking Parallel Chess Search in Stockfish on Intel Xeon and Intel Xeon Phi Processors
PublicationThe paper presents results from benchmarking the parallel multithreaded Stockfish chess engine on selected multi- and many-core processors. It is shown how the strength of play for an n-thread version compares to 1-thread version on both Intel Xeon and latest Intel Xeon Phi x200 processors. Results such as the number of wins, losses and draws are presented and how these change for growing numbers of threads. Impact of using particular...
-
Low harmonic multipulse voltage converters using coupled reactors
PublicationThis paper presents a novel approach to the multi pulse voltage converters (VC), especially voltage source inverters (VSI) and matrix converters (MC) based on several typical identical modules connected in parallel using coupled reactors. Such arrangements resulting in lower voltage distortions at extremely low switching frequency. The proposed arrangement was validated by simulation. Laboratory models of 18- and 24pulse 3-level...
-
Method of reconstructing two-dimensional velocity fields on the basis of temperature field values measured with a thermal imaging camera
PublicationThis paper describes a novel numerical reconstruction procedure (NRP) of the velocity field during natural convective heat transfer from a two-sided, isothermal, heated vertical plate based only on the known temperature field obtained, e.g. with a thermal imaging camera. It has been demonstrated that with a knowledge of temperature distributions, the NRP enables the reconstruction of velocity fields by solving the Navier-Stokes...
-
Assessment of OpenMP Master–Slave Implementations for Selected Irregular Parallel Applications
PublicationThe paper investigates various implementations of a master–slave paradigm using the popular OpenMP API and relative performance of the former using modern multi-core workstation CPUs. It is assumed that a master partitions available input into a batch of predefined number of data chunks which are then processed in parallel by a set of slaves and the procedure is repeated until all input data has been processed. The paper experimentally...
-
Parallelization of large vector similarity computations in a hybrid CPU+GPU environment
PublicationThe paper presents design, implementation and tuning of a hybrid parallel OpenMP+CUDA code for computation of similarity between pairs of a large number of multidimensional vectors. The problem has a wide range of applications, and consequently its optimization is of high importance, especially on currently widespread hybrid CPU+GPU systems targeted in the paper. The following are presented and tested for computation of all vector...