Filters
total: 439
filtered: 416
Search results for: parallel processing
-
KernelHive: a new workflow-based framework for multilevel high performance computing using clusters and workstations with CPUs and GPUs
PublicationThe paper presents a new open-source framework called KernelHive for multilevel parallelization of computations among various clusters, cluster nodes, and finally, among both CPUs and GPUs for a particular application. An application is modeled as an acyclic directed graph with a possibility to run nodes in parallel and automatic expansion of nodes (called node unrolling) depending on the number of computation units available....
-
Parallel Computations of Text Similarities for Categorization Task
PublicationIn this chapter we describe the approach to parallel implementation of similarities in high dimensional spaces. The similarities computation have been used for textual data categorization. A test datasets we create from Wikipedia articles that with their hyper references formed a graph used in our experiments. The similarities based on Euclidean distance and Cosine measure have been used to process the data using k-means algorithm....
-
Modern Platform for Parallel Algorithms Testing: Java on Intel Xeon Phi
PublicationParallel algorithms are popular method of increasing system performance. Apart from showing their properties using asymptotic analysis, proof-of-concept implementation and practical experiments are often required. In order to speed up the development and provide simple and easily accessible testing environment that enables execution of reliable experiments, the paper proposes a platform with multi-core computational accelerator:...
-
Modeling SPMD Application Execution Time
PublicationParallel applications in a Single Process Multiple Data paradigm assume splitting huge amounts of data to multiple processors working in parallel at small data packets. As the individual data packets are not independent, the processors must interact with each other to exchange results of the calculations with their adjacent partners and take these results into account in their own computations. An example of SPMD is geometric parallelism...
-
Two Stage SVM and kNN Text Documents Classifier
PublicationThe paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...
-
Numerical Study on Mitigation of Flow Maldistribution in Parallel Microchannel Heat Sink: Channels Variable Width Versus Variable Height Approach
PublicationMicrochannel heat sink on one hand enjoys benefits of intensified several folds heat transfer performance but on the other hand has to suffer aggravated form of trifling limitations associated with imperfect hydrodynamics and heat transfer behavior. Flow maldistribution is one of such limitation that exaggerates temperature nonuniformity across parallel microchannels leading to increase in maximum base temperature. Recently, variable...
-
ENERGY EFFICIENT AND ENVIRONMENTALLY FRIENDLY HYBRID CONVERSION OF INLAND PASSENGER VESSEL
PublicationThe development and growing availability of modern technologies, along with more and more severe environment protection standards which frequently take a form of legal regulations, are the reason why attempts are made to find a quiet and economical propulsion system not only for newly built watercraft units, but also for modernised ones. Correct selection of the propulsion and supply system for a given vessel affects significantly...
-
Parallel Programming for Modern High Performance Computing Systems
PublicationIn view of the growing presence and popularity of multicore and manycore processors, accelerators, and coprocessors, as well as clusters using such computing devices, the development of efficient parallel applications has become a key challenge to be able to exploit the performance of such systems. This book covers the scope of parallel programming for modern high performance computing systems. It first discusses selected and...
-
MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems
PublicationIn this paper we present a new environment called MERPSYS that allows simulation of parallel application execution time on cluster-based systems. The environment offers a modeling application using the Java language extended with methods representing message passing type communication routines. It also offers a graphical interface for building a system model that incorporates various hardware components such as CPUs, GPUs, interconnects...
-
A New Approach for the Mitigating of Flow Maldistribution in Parallel Microchannel Heat Sink
PublicationThe problem of flow maldistribution is very critical in microchannel heat sinks (MCHS). It induces temperature nonuniformity, which may ultimately lead to the breakdown of associated system. In the present communication, a novel approach for the mitigation of flow maldistribution problem in parallel MCHS has been proposed using variable width microchannels. Numerical simulation of copper made parallel MCHS consisting of 25 channels...
-
Runtime Visualization of Application Progress and Monitoring of a GPU-enabled Parallel Environment
PublicationThe paper presents design, implementation and real life uses of a visualization subsystem for a distributed framework for parallelization of workflow-based computations among clusters with nodes that feature both CPUs and GPUs. Firstly, the proposed system presents a graphical view of the infrastructure with clusters, nodes and compute devices along with parameters and runtime graphs of load, memory available, fan speeds etc. Secondly,...
-
Dynamic Data Management Among Multiple Databases for Optimization of Parallel Computations in Heterogeneous HPC Systems
PublicationRapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...
-
The Quick Measure of a Nurbs Surface Curvature for Accurate Triangular Meshing
PublicationNURBS surfaces are the most widely used surfaces for three-dimensional models in CAD/CAE programs. As a model for FEM calculation is prepared with a CAD program it is inevitable to mesh it finally. There are many algorithms for meshing planar regions. Some of them may be used for meshing surfaces but it is necessary to take the curvature of the surface under consideration to avoid poor quality mesh. The mesh must be denser in the...
-
Survey of Methodologies, Approaches, and Challenges in Parallel Programming Using High-Performance Computing Systems
PublicationThis paper provides a review of contemporary methodologies and APIs for parallel programming, with representative technologies selected in terms of target system type (shared memory, distributed, and hybrid), communication patterns (one-sided and two-sided), and programming abstraction level. We analyze representatives in terms of many aspects including programming model, languages, supported platforms, license, optimization goals,...
-
Performance and Power-Aware Modeling of MPI Applications for Cluster Computing
PublicationThe paper presents modeling of performance and power consumption when running parallel applications on modern cluster-based systems. The model includes basic so-called blocks representing either computations or communication. The latter includes both point-to-point and collective communication. Real measurements were performed using MPI applications and routines run on three different clusters with both Infiniband and Gigabit Ethernet...
-
Modeling energy consumption of parallel applications
PublicationThe paper presents modeling and simulation of energy consumption of two types of parallel applications: geometric Single Program Multiple Data (SPMD) and divide-and-conquer (DAC). Simulation is performed in a new MERPSYS environment. Model of an application uses the Java language with extension representing message exchange between processes working in parallel. Simulation is performed by running threads representing distinct process...
-
Testing for conformance of parallel programming pattern languages
PublicationThis paper reports on the project being run by TUG and IMAG, aimed at reducing the volume of tests required to exercise parallel programming language compilers and libraries. The idea is to use the ISO STEP standard scheme for conformance testing of software products. A detailed example illustrating the ongoing work is presented.
-
Mechanism of recognition of parallel G-quadruplexes by DEAH/RHAU helicase DHX36 explored by molecular dynamics simulations
PublicationBecause of high stability and slow unfolding rates of G-quadruplexes (G4), cells have evolved specialized helicases that disrupt these non-canonical DNA and RNA structures in an ATP-dependent manner. One example is DHX36, a DEAH-box helicase, which participates in gene expression and replication by recognizing and unwinding parallel G4s. Here, we studied the molecular basis for the high affinity and specificity of DHX36 for parallel-type...
-
Bounds on the Cover Time of Parallel Rotor Walks
PublicationThe rotor-router mechanism was introduced as a deterministic alternative to the random walk in undirected graphs. In this model, a set of k identical walkers is deployed in parallel, starting from a chosen subset of nodes, and moving around the graph in synchronous steps. During the process, each node maintains a cyclic ordering of its outgoing arcs, and successively propagates walkers which visit it along its outgoing arcs in...
-
Genetic Positioning of Fire Stations Utilizing Grid-computing Platform
PublicationA chapter presents a model for determining near-optimal locations of fire stations based on topography of a given area and location of forests, rivers, lakes and other elements of the site. The model is based on principals of genetic algorithms and utilizes the power of the grid to distribute and execute in parallel most performance-demanding computations involved in the algorithm.
-
Machine Learning in Multi-Agent Systems using Associative Arrays
PublicationIn this paper, a new machine learning algorithm for multi-agent systems is introduced. The algorithm is based on associative arrays, thus it becomes less complex and more efficient substitute of artificial neural networks and Bayesian networks, which is confirmed by performance measurements. Implementation of machine learning algorithm in multi-agent system for aided design of selected control systems allowed to improve the performance...
-
Acceleration of the discrete Green's function computations
PublicationResults of the acceleration of the 3-D discrete Green's function (DGF) computations on the multicore processor are presented. The code was developed in the multiple precision arithmetic with use of the OpenMP parallel programming interface. As a result, the speedup factor of three orders of magnitude compared to the previous implementation was obtained thus applicability of the DGF in FDTD simulations was significantly improved.
-
Propagation in rectangular waveguides with a pseudochiral Ω slab
PublicationThe transfer matrix approach is applied for analysis of waveguides loaded with a uniaxial pseudochiral Ω slab. In particular a pseudochiral parallel plate and rectangular guides are investigated. Based on the numerical analysis the influence of the pseudochirality on propagation characteristics and field distribution are examined. Other feature such as a field displacement phenomenon appearing in the both considered structures...
-
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
PublicationAuto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...
-
Two-Stage Identification of Locally Stationary Autoregressive Processes and its Application to the Parametric Spectrum Estimation
PublicationThe problem of identification of a nonstationary autoregressive process with unknown, and possibly time-varying, rate of parameter changes, is considered and solved using the parallel estimation approach. The proposed two-stage estimation scheme, which combines the local estimation approach with the basis function one, offers both quantitative and qualitative improvements compared with the currently used single-stage methods.
-
Steam turbines governors in power system restoration process
PublicationThe paper discusses problems related to electric power system restoration process. The turbine controller operating mode influence in small subsystem is analyzed. There are considered abilities, advantages and drawbacks of the controller's two operating modes: power control and speed (frequen-cy) control, to rebuild electric power system from a single generating unit to a few parallel running generators.
-
Unusual divergence of magnetoacoustic beams
PublicationTwo-dimensional magnetosonic beams directed along a line forming a constant angle h with the equilibrium straight magnetic field are considered. Perturbations in a plasma are described by the system of ideal magnetohydrodynamic equations. The dynamics of perturbations in a beam are different in the cases of fast and slow modes, and it is determined by h and equilibrium parameters of a plasma. In particular, a beam divergence may...
-
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
PublicationIn the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...
-
10-Methyl- and 9,10-dimethyl acridinium methyl sulfate
PublicationThe title compounds, C(14)H(12)N(+).CH(3)O(4)S(-), (I), and C(15)H(14)N(+).CH(3)O(4)S(-), (II), respectively, crystallize with the planar 10-methylacridinium or 9,10-dimethylacridinium cations arranged in layers, parallel to the twofold axis in (I) and perpendicular to the 2(1) axis in (II). Adjacent cations in both compounds are packed in a 'head-to-tail' manner. The methyl sulfate anion only exhibits planar symmetry in (II)....
-
Network-aware Data Prefetching Optimization of Computations in a Heterogeneous HPC Framework
PublicationRapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...
-
A CMOS Pixel With Embedded ADC, Digital CDS and Gain Correction Capability for Massively Parallel Imaging Array
PublicationIn the paper, a CMOS pixel has been proposed for imaging arrays with massively parallel image acquisition and simultaneous compensation of dark signal nonuniformity (DSNU) as well as photoresponse nonuniformity (PRNU). In our solution the pixel contains all necessary functional blocks: a photosensor and an analog-to-digital converter (ADC) with built-in correlated double sampling (CDS) integrated together. It is implemented in...
-
A New Method of Noncausal Identification of Time-varying Systems
PublicationThe paper shows that the problem of noncausal identification of a time-varying FIR (finite impulse response) sys- tem can be reformulated, and solved, as a problem of smoothing of the preestimated parameter trajectories. Characteristics of the smoothing filter should be chosen so as to provide the best trade- off between the bias and variance of the resulting estimates. It is shown that optimization of the smoothing operation can...
-
The evaluation of the vibration measurement usability of electronic indicator lemag "premet C"
PublicationThe measuring possibilities of modern compression and combustion pressure analyzers are extended with additional functions. One of them is parallel to the pressure measurement, the measurement of vibrations in the region of the cylinder head. The paper presents a general assessment of the vibration measurement function of the electronic indicator LEMAG "PREMET C". This feature is very rarely offered by manufacturers of these devices....
-
Parallel immune system for graph coloring
PublicationThis paper presents a parallel artificial immune system designed forgraph coloring. The algorithm is based on the clonal selection principle. Each processor operates on its own pool of antibodies and amigration mechanism is used to allow processors to exchange information. Experimental results show that migration improves the performance of the algorithm. The experiments were performed using a high performance cluster on a set...
-
Fully Adaptive Savitzky-Golay Type Smoothers
PublicationThe problem of adaptive signal smoothing is consid-ered and solved using the weighted basis function approach. Inthe special case of polynomial basis and uniform weighting theproposed method reduces down to the celebrated Savitzky-Golaysmoother. Data adaptiveness is achieved via parallel estimation.It is shown that for the polynomial and harmonic bases andcosinusoidal weighting sequences, the competing signal estimatescan be computed...
-
Channel Blockage and Flow Maldistribution during Unsteady Flow in a Model Microchannel Plate heat Exchanger
PublicationThis paper describes the problem of channel blockage as a result of flow maldistribution between the channels of a model mini channel plate heat exchanger consisting of one pass on each leg. Each leg of the heat exchanger contains 51 parallel and rectangular minichannels of four hydraulic diameters namely 461 μm, 571 μm, 750 μm and 823 μm. In addition, a more complex geometry has been investigated where for the sake of breaking...
-
Edge-Guided Mode Performance and Applications in Nonreciprocal Millimeter-Wave Gyroelectric Components
PublicationThe analogies between the behavior of gyromagnetic and gyroelectric nonreciprocal structures, the use of the simple transfer matrix approach, and the edge-guided (EG) wave property, supported in a parallel plate model for integrated magnetized semiconductor waveguide, are investigated in those frequency regions, where the effective permittivity is negative or positive. As with their ferrite counterparts, the leakage of the EG waves...
-
Infrared techniques for natural convection investigations in channels between two vertical, parallel, isothermal and symmetrically heated plates
PublicationThe effect of the gap width between two symmetrically heated vertical, parallel, isothermal plates on intensity of natural convective heat transfer in a gas (Pr = 0.71) was experimentally studied using the balance and gradient methods. In the former method heat fluxes were determined based on measurements of the voltage and electric current supplying the heaters placed inside the walls. In the latter, heat fluxes were calculated...
-
The bridge over Regalia River in Szczecin - design and construction.
PublicationNowoclowa Route is the largest projekt in Szczecin, that consists of 11 km of roads 3,3 km bridges and viaducts, including three parallel bridges across Regalica River (the east arm of Odra River), 535 m long, with the spans: 59+90+90+116+116+64m. The bridge structure consist of two steel plate girders composite with reinforced concrete deck slab. The design and the construction of the bridge are described in the paper.
-
Modeling the effect of parasitic capacitances on the dead-time distortion in multilevel NPC inverters
PublicationA simple model is derived and verified for evaluating the effect of parasitic capacitances on the dead-time related voltage distortion in multilevel NPC voltage source inverters. The model permits well-defined and precise compensation of dead-time distortion, exhibiting meaningful improvement on compensation methods neglecting the effects of parasitic capacitances. A simple formula is given for evaluating the capacitances as serial/parallel...
-
New Approach to Noncasual Identification of Nonstationary Stochastic FIR Systems Subject to Both Smooth and Abrupt Parameter Changes
PublicationIn this technical note, we consider the problem of finite-interval parameter smoothing for a class of nonstationary linear stochastic systems subject to both smooth and abrupt parameter changes. The proposed parallel estimation scheme combines the estimates yielded by several exponentially weighted basis function algorithms. The resulting smoother automatically adjusts its smoothing bandwidth to the type and rate of nonstationarity...
-
Structural and dynamic insights on the EmrE protein with TPP+ and related substrates through molecular dynamics simulations
PublicationEmrE is a bacterial transporter protein that forms an anti-parallel homodimer with four transmembrane helices in each monomer. EmrE transports positively charged aromatic compounds, such as TPP+ and its derivatives. We performed molecular dynamics (MD) simulations of EmrE in complex with TPP+, MeTPP+, and MBTPP+ embedded in a membrane. The detailed molecular properties and interactions were analysed for all EmrE-ligand complexes....
-
General Provisioning Strategy for Local Specialized Cloud Computing Environments
PublicationThe well-known management strategies in cloud computing based on SLA requirements are considered. A deterministic parallel provisioning algorithm has been prepared and used to show its behavior for three different requirements: load balancing, consolidation, and fault tolerance. The impact of these strategies on the total execution time of different sets of services is analyzed for randomly chosen sets of data. This makes it possible...
-
Towards an efficient multi-stage Riemann solver for nuclear physics simulations
PublicationRelativistic numerical hydrodynamics is an important tool in high energy nuclear science. However, such simulations are extremely demanding in terms of computing power. This paper focuses on improving the speed of solving the Riemann problem with the MUSTA-FORCE algorithm by employing the CUDA parallel programming model. We also propose a new approach to 3D finite difference algorithms, which employ a GPU that uses surface memory....
-
Overflowing tests at the Polish DredgDikes research dike – stability of the dike surface against erosion
PublicationIn the project DredgDikes the different research dike embankments were tested with respect to overflowing water induced erosion. Therefore, flumes were installed on the land side embankments in which the effect of overflowing water on the vegetated surface was investigated. On the Polish DredgDikes research dike near Gdansk, Poland, two parallel flumes were installed and the surface of the dike made of different mixtures of...
-
Degradation of polyurethanes in Compost Under Natural Conditions
PublicationThe estimation of degradibility of different polyurethanes under natural weather depending conditions in compost pile was the subject of the studies. The incubation of polymer samples took place for a period up to 24 months. The characteristic parameters of the compost: temperature, pH, moisture content, and activity of dehydrogenasis were monitored and their influence on degradation of polyuiretahnes was discussed. The compostability...
-
Design of weighted PID controllers for control of the Stewart-Gough platform
PublicationStewart-Gough platform (SGP) is a popular parallel type manipulator that involves a 6 degrees of freedom (DOF) motion. In this paper, the process of mathematical modelling of SGP is presented. Two selected control algorithms that use PID controllers and weighted PID controllers are designed. Both control systems using these algorithms are implemented in MATLAB environment as well as on the actual SGP. Parameters of the controllers...
-
On the preestimation technique and its application to identification of nonstationary systems
PublicationThe problem of noncausal identification of a nonstationary stochastic FIR (finite impulse response) sys- tem is reformulated, and solved, as a problem of smoothing of preestimated parameter trajectories. Three approaches to preestimation are critically analyzed and compared. It is shown that optimization of the smoothing operation can be performed adaptively using the parallel estimation technique. The new approach is computationally...
-
Modern Arrangement for Reduction of Voltage Perturbations
PublicationThe contents of this chapter encompass general problems and the most important issues of power-supply-quality improvement in AC systems. In the context of the above, consideration is given to evaluation of bilateral interactions of receivers with an electrical power-distribution system and methods of their reduction. Also are discussed the basis of operation of the most important compensation-filtration devices and their applications...
-
Seawater intrusion due to pumping mitigated by natural freshwater flux: a case study in Władysławowo, northern Poland
PublicationThe paper presents a case study of seawater intrusion into a coastal aquifer, caused by a groundwater intake located close to the seashore in Władysławowo, northern Poland. Evolution of the basic hydrogeochemical parameters for the 50-year period from 1964 to 2014 indicates progressing encroachment of saline seawater into the aquifer. However, the spatial pattern of salinity was influenced by the variability of hydraulic gradient...
-
Improving web user experience with caching user interface
PublicationOften, Web technologies are used to operate or to configure network-enabled equipment, to configure and administer modular applications, or as teaching environments. The comfort of human work requires a similar response time in these applications as in the Internet. To improve response time, various forms of caching at different levels are employed. To improve the user experience in regard to response time when performing specific...
-
The chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identified complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for different size of the data as well as for different number of the processors used for the computations. The results we got using the model have been compared to the results obtained from real computational environment.
PublicationThe chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identified complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for different size of the data as well as for different number of the processors used...
-
A multithreaded CUDA and OpenMP based power‐aware programming framework for multi‐node GPU systems
PublicationIn the paper, we have proposed a framework that allows programming a parallel application for a multi-node system, with one or more GPUs per node, using an OpenMP+extended CUDA API. OpenMP is used for launching threads responsible for management of particular GPUs and extended CUDA calls allow to manage CUDA objects, data and launch kernels. The framework hides inter-node MPI communication from the programmer who can benefit from...
-
Sensorless predictive control of three-phase parallel active filter
PublicationThe paper presents the control system of parallel active power filter (APF) with predictive reference current calculation and model based predictive current control. The novel estimator and predictor of grid emf is proposed for AC voltage sensorless operation of APF, regardless of distortion of this voltage. Proposed control system provides control of APF current with high precision and dynamics limited only by filter circuit parameters....
-
Single and Series of Multi-valued Decision Diagrams in Representation of Structure Function
PublicationStructure function, which defines dependency of performance of the system on performance of its components, is a key part of system description in reliability analysis. In this paper, we compare two approaches for representation of the structure function. The first one is based on use of a single Multi-valued Decision Diagram (MDD) and the second on use of a series of MDDs. The obtained results indicate that the series of MDDs...
-
BUILDINGS AND CONSTRUCTIONS FROM BUILDING WASTE MATERIALS IN ACCORDANCE WITH THE NEW CIRCULAR ECONOMY TREND
PublicationMATERIALS April 18-22, 2022 / BOSTON, MA Abstract Book 3rd INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE & ENGINEERING Day 5 Friday, April 22, 2022 Parallel Sessio II- (Virtual EST Zone) pp. 154 3rd INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE & ENGINEERING APRIL 18-22, 2022 | BOSTON, MAAt: Boston, MAAffiliation: UNICEF at: USA BOSTONVolume: 2022
-
Multi-source-supplied parallel hybrid propulsion of the inland passenger ship STA.H. Research work on energy efficiency of a hybrid propulsion system operating in the electric motor drive mode
PublicationIn the Faculty of Ocean Engineering and Ship Technology, Gdansk University of Technology, design has recently been developed of a small inland ship with hybrid propulsion and supply system. The ship will be propelled by a specially designed so called parallel hybrid propulsion system. The work was aimed at carrying out the energy efficiency analysis of a hybrid propulsion system operating in the electric motor drive mode and at...
-
Experiences from operation of various expansion devices in small scale ORC
PublicationThe main aim of this paper was to present various expansion devices for an application in the small scale ORC system. The investigations were carried out in two parallel directions. One direction was to design and construct a device dedicated to the analyzed ORC system. The second direction was to adapt existing expansion devices for the needs of the analyzed ORC system. Four various devices were described and presented together...
-
Benchmarking Parallel Chess Search in Stockfish on Intel Xeon and Intel Xeon Phi Processors
PublicationThe paper presents results from benchmarking the parallel multithreaded Stockfish chess engine on selected multi- and many-core processors. It is shown how the strength of play for an n-thread version compares to 1-thread version on both Intel Xeon and latest Intel Xeon Phi x200 processors. Results such as the number of wins, losses and draws are presented and how these change for growing numbers of threads. Impact of using particular...
-
Low harmonic multipulse voltage converters using coupled reactors
PublicationThis paper presents a novel approach to the multi pulse voltage converters (VC), especially voltage source inverters (VSI) and matrix converters (MC) based on several typical identical modules connected in parallel using coupled reactors. Such arrangements resulting in lower voltage distortions at extremely low switching frequency. The proposed arrangement was validated by simulation. Laboratory models of 18- and 24pulse 3-level...
-
Method of reconstructing two-dimensional velocity fields on the basis of temperature field values measured with a thermal imaging camera
PublicationThis paper describes a novel numerical reconstruction procedure (NRP) of the velocity field during natural convective heat transfer from a two-sided, isothermal, heated vertical plate based only on the known temperature field obtained, e.g. with a thermal imaging camera. It has been demonstrated that with a knowledge of temperature distributions, the NRP enables the reconstruction of velocity fields by solving the Navier-Stokes...
-
Assessment of OpenMP Master–Slave Implementations for Selected Irregular Parallel Applications
PublicationThe paper investigates various implementations of a master–slave paradigm using the popular OpenMP API and relative performance of the former using modern multi-core workstation CPUs. It is assumed that a master partitions available input into a batch of predefined number of data chunks which are then processed in parallel by a set of slaves and the procedure is repeated until all input data has been processed. The paper experimentally...
-
Chained machine learning model for predicting load capacity and ductility of steel fiber–reinforced concrete beams
PublicationOne of the main issues associated with steel fiber–reinforced concrete (SFRC) beams is the ability to anticipate their flexural response. With a comprehensive grid search, several stacked models (i.e., chained, parallel) consisting of various machine learning (ML) algorithms and artificial neural networks (ANNs) were developed to predict the flexural response of SFRC beams. The flexural performance of SFRC beams under bending was...
-
Parallelization of large vector similarity computations in a hybrid CPU+GPU environment
PublicationThe paper presents design, implementation and tuning of a hybrid parallel OpenMP+CUDA code for computation of similarity between pairs of a large number of multidimensional vectors. The problem has a wide range of applications, and consequently its optimization is of high importance, especially on currently widespread hybrid CPU+GPU systems targeted in the paper. The following are presented and tested for computation of all vector...
-
Surface irregularities as a complex signal of tool representation together with uneven displacement in respect to the workpiece
PublicationIn a dynamic machining process, distortion in surface irregularity is a very complex phenomenon. Surface irregularities form a periodic representation of the tool profile with various kinds of disturbance in a broad range of changes in the height and length of the profile. To discern these irregularity disturbances, interactions of the tool in the form of changes perpendicular and parallel relative to the workpiece were analyzed...
-
Small Vessel with Inboard Engine Retrofitting Concepts; Real Boat Tests, Laboratory Hybrid Drive Tests and Theoretical Studies
PublicationThe development of modern technologies and their increasing availability, as well as the falling costs of highly ecient propulsion systems and power sources, have resulted in electric or hybrid propulsions systems’ growing popularity for use on watercraft. Presented in the paper are design and lab tests of a prototype parallel hybrid propulsion system. It describes a concept of retrofitting a conventionally powered nine meter-long...
-
Decentralized control of a different rated parallel UPS systems
PublicationThe paper presents the single phase uninterruptible power supply (UPS) system with galvanic separated DC-AC-DC-AC converters operating in parallel. The CAN physical layer based system of communication between converters has been developed and applied, which allow to utilize a decentralized master-slave control providing high availability factor of the whole UPS system. The control system of particular converters has been developed...
-
Quasi-resonant DC-link voltage inverter with enhanced zero-voltage switching control
PublicationA new topology modification of the parallel quasi-resonant circuit for a dc-link voltage inverter enables regulation of the zero voltage dc-link subperiods and the dc-link voltage gradient settings. The proposed circuit is based on four MOSFET switches with free-wheeling diodes for controlled quasi-resonant recharging between L-C tank in order to assure inverter zero voltage switching (ZVS) conditions. Design optimization of the...
-
Uniform Model Interface for Assurance Case Integration with System Models
PublicationAssurance cases are developed and maintained in parallel with corresponding system models and therefore need to reference each other. Managing the correctness and consistency of interrelated safety argument and system models is essential for system dependability and is a nontrivial task. The model interface presented in this paper enables a uniform process of establishing and managing assurance case references to various types...
-
DEPO: A dynamic energy‐performance optimizer tool for automatic power capping for energy efficient high‐performance computing
PublicationIn the article we propose an automatic power capping software tool DEPO that allows one to perform runtime optimization of performance and energy related metrics. For an assumed application model with an initialization phase followed by a running phase with uniform compute and memory intensity, the tool performs automatic tuning engaging one of the two exploration algorithms—linear search (LS) and golden section search (GSS), finds...
-
Implementation of spatial/polarization diversity for improved-performance circularly polarized multiple-input-multiple-output ultra-wideband antenna
PublicationIn this paper, spatial and polarization diversities are simultaneously implemented in an ultra-wideband (UWB) multiple-input-multiple-output (MIMO) antenna to reduce the correlation between the parallel-placed radiators. The keystone of the antenna is systematically modified coplanar ground planes that enable excitation of circular polarization (CP). To realize one sense of circular polarization as well as ultra-wideband operation,...
-
Development and application of asphalt binder relaxation test in different dynamic shear rheometers
PublicationIn this study, a novel relaxation test is proposed to evaluate asphalt binder low temperature properties using a Dynamic Shear Rheometer (DSR) with parallel plates of 4 mm in diameter. Three rheometers from three different manufacturers are used to analyze seven asphalt binders. Different material parameters are derived which are useful to evaluate and discriminate different asphalt binders. Test results of all three instruments...
-
Initial Report on Numerical Modeling of Blood Flow in Myocardial Bridge Region of Coronary Artery: Concept of Model Validation
PublicationThe paper presents a numerical method of blood flow simulation within the coronary artery covered by the myocardial bridge. The myocardial bridge is a congenital coronary abnormality caused by the blood vessel location under one of the heart muscles. In this case, the blood flow within the vessel is partially disturbed which can cause several consequences. The presented numerical simulation allowed us to estimate the blood flow...
-
Linking Fashion and Tourism: From Body to Clothing and Lifestyle
PublicationThere are many profound links between fashion and tourism. This chapter provides a critical reflection, mainly from a philosophical, historical, and linguistic perspective, on the dynamic relationship and parallel evolution between these two sectors. It explains how their intercon nectedness form and mirror contemporary society. This chapter classifies the connections between the two, starting with the person, her body, and the...
-
A Concept of Modeling and Optimization of Applications in Large Scale Systems
PublicationThe chapter presents the idea that includes modeling and subsequent optimization of application execution on large scale parallel and distributed systems. The model considers performance, reliability and power consumption. It should allow easy modeling of various classes of applications while reflecting key parameters of both the applications and two classes of target systems: clusters and volunteer based systems. The chapter presents...
-
Resonant DC link inverters for AC motor drive systems – critical evaluation
PublicationIn this survey paper, resonant and quasiresonant DC link inverters are reexamined for AC motor drive applications. Critical evaluation of representative topologies is based on simulation and waveform analysis to characterize current/voltage stress of components, control timing constraints and feasibility. A special concern over inverter common-mode voltage and voltage gradient du/dt limitation capacity is discussed for motor bearing...
-
On Anti-Plane Surface Waves Considering Highly Anisotropic Surface Elasticity Constitutive Relations
PublicationWithin the framework of highly anisotropic surface elasticity model we discuss the propagation of new type of surface waves that are anti-plane surface waves. By the highly anisotropic surface elasticity model we mean the model with a surface strain energy density which depends on incomplete set of second derivatives of displacements. From the physical point of view this model corresponds to a coating made of a family of parallel...
-
Scheduling of identical jobs with bipartite incompatibility graphs on uniform machines. Computational experiments
PublicationWe consider the problem of scheduling unit-length jobs on three or four uniform parallel machines to minimize the schedule length or total completion time. We assume that the jobs are subject to some types of mutual exclusion constraints, modeled by a bipartite graph of a bounded degree. The edges of the graph correspond to the pairs of jobs that cannot be processed on the same machine. Although the problem is generally NP-hard,...
-
Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training
PublicationIn the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...
-
Improving Effectiveness of SVM Classifier for Large Scale Data
PublicationThe paper presents our approach to SVM implementation in parallel environment. We describe how classification learning and prediction phases were pararellised. We also propose a method for limiting the number of necessary computations during classifier construction. Our method, named one-vs-near, is an extension of typical one-vs-all approach that is used for binary classifiers to work with multiclass problems. We perform experiments...
-
Ammonium <i>O</i>,<i>O</i>'-diethyl dithiophosphate
PublicationIn the title compound, NH4+·(C2H5O)2PS2−, the ammonium cation is connected by four charge-assisted N−H···S hydrogen bonds to four tetrahedral O,O'-diethyl dithiophosphate anions, forming layers parallel to (100). The polar and non-polar constituents of the layers are stacked alternately along [100]. Interlacing of the external ethyl groups...
-
On noncausal weighted least squares identification of nonstationary stochastic systems
PublicationIn this paper, we consider the problem of noncausal identification of nonstationary, linear stochastic systems, i.e., identification based on prerecorded input/output data. We show how several competing weighted (windowed) least squares parameter smoothers, differing in memory settings, can be combined together to yield a better and more reliable smoothing algorithm. The resulting parallel estimation scheme automatically adjusts...
-
On noncausal identification of nonstationary stochastic systems
PublicationIn this paper we consider the problem of noncausal identification of nonstationary,linear stochastic systems, i.e., identification based on prerecorded input/output data. We show how several competing weighted least squares parameter smoothers, differing in memory settings, can be combined together to yield a better and more reliable smoothing algorithm. The resulting parallel estimation scheme automatically adjusts its smoothing...
-
Comparison of EHD devices with parallel and in series spiked electrodes
PublicationIn this paper two electrohydrodynamic (EHD) devices for gas pumping and cleaning are presented. In both cases to induce an airflow in these EHD devices corona discharge was used. The discharge was generated between the spiked electrodes set parallel (the first case) or in series (the second case) and the plate electrodes. An asymmetric electric field and generated discharge result in unidirectional gas flow through the EHD device....
-
Risk Analysis by a Probabilistic Model of the Measurement Process
PublicationThe aim of the article is presentation of the testing methodology and results of examination the probabilistic model of the measurement process. The case study concerns the determination of the risk of an incorrect decision in the assessment of the compliance of products by measurement. Measurand is characterized by the generalized Rayleigh distribution. The model of the meas-urement process was tested in parallel mode by six risk...
-
Selected studies of flow maldistribution in a minichannel plate heat exchanger
PublicationAnalysis of the state of-the-art in research of minichannel heat exchangers, especially on the topic of flow maldistribution in multiple channels, has been accomplished. Studies on minichannel plate heat exchanger with 51 parallel minichannels with four hydraulic diameters, i.e., 461 μm, 574 μm, 667 μm, and 750 μm have been presented. Flow at the instance of filling the microchannel with water at low flow rates has been visualized. The...
-
Process zone in the Single Cantilever Beam under transverse loading. - Part I: Theoretical analysis
PublicationSingle Cantilever Beam (SCB) specimen loaded with a transverse force parallel to the crack front is proposed for the analysis of crack propagation phenomena under mixed mode conditions. The stress redistribution in the adhesive layer in the vicinity of the crack front so as the beam deformation are estimated using a Timoshenko beam on elastic foundation model. This model emphasizes the Mode II contribution due to flexural beam...
-
Investigation of Mechanical and Microstructural Properties of Welded Specimens of AA6061-T6 Alloy with Friction Stir Welding and Parallel Friction Stir Welding Methods
PublicationThe present study investigates the effect of two parameters of process type and tool offset on tensile, microhardness, and microstructure properties of AA6061-T6 aluminum alloy joints. Three methods of Friction Stir Welding (FSW), Advancing Parallel-Friction Stir Welding (AP-FSW), and Retreating Parallel-Friction Stir Welding (RP-FSW) were used. In addition, four modes of 0.5, 1, 1.5, and 2 mm of tool offset were used in two welding...
-
Real-Time Bleeding Detection in Gastrointestinal Tract Endoscopic Examinations Video
PublicationThe article presents a novel approach to medical video data analysis and recognition of bleedings. Emphasis has been put on adapting pre-existing algorithms dedicated to the detection of bleedings for real-time usage in a medical doctor’s office during an endoscopic examination. A real-time system for analyzing endoscopic videos has been designed according to the most significant requirements of medical doctors. The main goal of...
-
Design and experimental validation of a single-stage PV string inverter with optimal number of interleaved buck-boost cells.
PublicationIncreasing converter power density is a problem of topical interest. This paper discusses an interleaved approach of the efficiency increase in the buck-boost stage of an inverter with unfolding circuit in terms of losses in semiconductors, output voltage ripples and power density. Main trends in the power converter development are reviewed. A losses model was designed and used for the proposed solution to find an optimal number...
-
Fixed Pattern Noise Reduction and Linearity Improvement in Time-Mode CMOS Image Sensors
PublicationIn the paper, a digital clock stopping technique for gain and offset correction in time-mode analog-to-digital converters (ADCs) has been proposed. The technique is dedicated to imagers with massively parallel image acquisition working in the time mode where compensation of dark signal non-uniformity (DSNU) as well as photo-response non-uniformity (PRNU) is critical. Fixed pattern noise (FPN) reduction has been experimentally validated...
-
Microstrip four-port circulator using a ferrite coupled line section
PublicationThis paper describes an alternative configuration of a four-port circulator realized in a microstrip ferrite coupled line technology. The proposed fully planar device employs two three-port circulators consisting of a ferrite coupled line junction and T junction. Both circulators are connected through the same arm, hence, the problem of anti-parallel magnetization met in this type of circulators is avoided without the increase...
-
Comment on "On accurate capacitance characterization of organic photovoltaic cells"
PublicationIn the 100th volume of Applied Physics Letters Carr and Chaudhary have presented a work on capacitance characterization of organic photovoltaic cells. The work concerns small signal measurements of various organic photovoltaic structures. The authors however limit their considerations to one part of small signal response, namely to capacitance measured either in parallel mode or in series mode. This attitude generally does not...
-
Multi-pulse VSC arrangements with coupled reactors
PublicationThis paper presents a novel approach to the multipulse VSC (Voltage Source Converter) arrangements based on several conventional inverter modules connected in parallel by using coupled reactors. This solution reduces the THD of the output voltage, despite the low switching frequency of transistors. The advantage of the proposed solution is also a relatively small rated power of the reactors. Proposed new arrangements for different...
-
Benchmarking Deep Neural Network Training Using Multi- and Many-Core Processors
PublicationIn the paper we provide thorough benchmarking of deep neural network (DNN) training on modern multi- and many-core Intel processors in order to assess performance differences for various deep learning as well as parallel computing parameters. We present performance of DNN training for Alexnet, Googlenet, Googlenet_v2 as well as Resnet_50 for various engines used by the deep learning framework, for various batch sizes. Furthermore,...
-
A new look at the statistical identification of nonstationary systems
PublicationThe paper presents a new, two-stage approach to identification of linear time-varying stochastic systems, based on the concepts of preestimation and postfiltering. The proposed preestimated parameter trajectories are unbiased but have large variability. Hence, to obtain reliable estimates of system parameters, the preestimated trajectories must be further filtered (postfiltered). It is shown how one can design and optimize such...
-
Fast implementation of FDTD-compatible green's function on multicore processor
PublicationIn this letter, numerically efficient implementation of the finite-difference time domain (FDTD)-compatible Green's function on a multicore processor is presented. Recently, closed-form expression of this discrete Green's function (DGF) was derived, which simplifies its application in the FDTD simulations of radiation and scattering problems. Unfortunately, the new DGF expression involves binomial coefficients, whose computations...
-
Fully enzymatic mediatorless fuel cell with efficient naphthylated carbon nanotube-laccase composite cathodes
PublicationAn efficient, mediator-free enzymatic glucose/O2 biofuel cell with an oxygen intensive anode based on glucose dehydrogenase is presented. In the device,the power of the biofuel cell and electrode potentials of each of the enzymatic electrodes were monitored in parallel under the biofuel cell working conditions. The carbon nanotube composite biocathode demonstrates an almost constant electrode potential vs. saturated calomel electrode...
-
A Novel Synthesis Technique for Microwave Bandpass Filters with Frequency-Dependent Couplings
PublicationThis paper presents a novel synthesis technique for microwave bandpass filters with frequency-dependent couplings. The proposed method is based on the systematic extraction of a dispersive coupling coefficient using an optimization technique based on the zeros and poles of scattering parameters representing two coupled resonators.The application of this method of synthesis is illustrated using two examples involving four and five-pole...
-
Use of ICT infrastructure for teaching HPC
PublicationIn this paper we look at modern ICT infrastructure as well as curriculum used for conducting a contemporary course on high performance computing taught over several years at the Faculty of Electronics Telecommunications and Informatics, Gdansk University of Technology, Poland. We describe the infrastructure in the context of teaching parallel programming at the cluster level using MPI, node level using OpenMP and CUDA. We present...