Filters
total: 3211
filtered: 1733
-
Catalog
- Publications 1733 available results
- Journals 9 available results
- Conferences 10 available results
- People 51 available results
- Projects 1 available results
- Research Equipment 2 available results
- e-Learning Courses 15 available results
- Events 1 available results
- Open Research Data 1389 available results
Chosen catalog filters
displaying 1000 best results Help
Search results for: MULTI-CORE CPU
-
Parallelization of Selected Algorithms on Multi-core CPUs, a Cluster and in a Hybrid CPU+Xeon Phi Environment
PublicationIn the paper we present parallel implementations as well as execution times and speed-ups of three different algorithms run in various environments such as on a workstation with multi-core CPUs and a cluster. The parallel codes, implementing the master-slave model in C+MPI, differ in computation to communication ratios. The considered problems include: a genetic algorithm with various ratios of master processing time to communication...
-
Multi Queue Approach for Network Services Implemented for Multi Core CPUs
PublicationMultiple core processors have already became the dominant design for general purpose CPUs. Incarnations of this technology are present in solutions dedicated to such areas like computer graphics, signal processing and also computer networking. Since the key functionality of network core components is fast package servicing, multicore technology, due to multi tasking ability, seems useful to support packet processing. Dedicated...
-
Performance assessment of OpenMP constructs and benchmarks using modern compilers and multi-core CPUs
PublicationConsidering ongoing developments of both modern CPUs, especially in the context of increasing numbers of cores, cache memory and architectures as well as compilers there is a constant need for benchmarking representative and frequently run workloads. The key metric is speed-up as the computational power of modern CPUs stems mainly from using multiple cores. In this paper, we show and discuss results from running codes such as:...
-
Multi-core and Multiprocessor Implementation of Numerical Integration in Finite Element Method
PublicationThe paper presents techniques for accelerating a numerical integration process which appears in the Finite Element Method. The acceleration is achieved by taking advantages of multi-core and multiprocessor devices. It is shown that using multi-core implementation with OpenMP and a GPU acceleration using CUDA architecture allows one to achieve the speedups by a factor of 5 and 10 on a CPU and GPUs, respectively.
-
Benchmarking Deep Neural Network Training Using Multi- and Many-Core Processors
PublicationIn the paper we provide thorough benchmarking of deep neural network (DNN) training on modern multi- and many-core Intel processors in order to assess performance differences for various deep learning as well as parallel computing parameters. We present performance of DNN training for Alexnet, Googlenet, Googlenet_v2 as well as Resnet_50 for various engines used by the deep learning framework, for various batch sizes. Furthermore,...
-
Modified Preisach model of hysteresis in multi air gap ferrite core medium frequency transformer
PublicationThis article presents the modified Preisach model of hysteresis for a 3-phase medium frequency transformer in a 100 kW dual active bridge converter. The transformer magnetic core is assembled out of ferrite I-cores, which results in multiple parasitic air gaps. For this transformer, the hysteresis loops were measured and parameters of the Preisach model were determined. The Preisach distribution function is approximated with a...
-
Multi-core processing system for real-time image processing in embedded computer vision applications
PublicationW artykule opisano architekturę wielordzeniowego programowalnego systemu do przetwarzania obrazów w czasie rzeczywistym. Dane obrazu są przetwarzane równocześnie przez wszystkie procesory. System umożliwia niskopoziomowe przetwarzanie obrazów,np. odejmowanie tła, wykrywanie obiektów ruchomych, transformacje geometryczne, indeksowanie wykrytych obiektów, ocena ich kształtu oraz podstawowa analiza trajektorii ruchu. Ang:This paper...
-
Analyzing energy/performance trade-offs with power capping for parallel applications on modern multi and many core processors
PublicationIn the paper we present extensive results from analyzing energy/performance trade-offs with power capping observed on four different modern CPUs, for three different parallel applications such as 2D heat distribution, numerical integration and Fast Fourier Transform. The CPU tested represent both multi-core type CPUs such as Intel⃝R Xeon⃝R E5, desktop and mobile i7 as well as many-core Intel⃝R Xeon PhiTM x200 but also server, desktop...
-
Multi-functional monodispersed SiO2-TiO2 core-shell nanostructure and TEOS in the consolidation of archaeological lime mortars surfaces
PublicationArchaeological traditional lime mortars are susceptible to many environmental conditions such as the impact of water (rain, humidity, groundwater, etc.), variation of temperatures' degrees, wind and/or pollution. Accordingly, this research aims to provide newly assessed multifunctional Nano-coating for the purpose of archaeological lime mortar protection. For this, the study combined physicochemical and mechanical characterizations...
-
Effective Permeability of Multi Air Gap Ferrite Core 3-Phase Medium Frequency Transformer in Isolated DC-DC Converters
PublicationThe magnetizing inductance of the medium frequency transformer (MFT) impacts the performance of the isolated dc-dc power converters. The ferrite material is considered for high power transformers but it requires an assembly of type “I” cores resulting in a multi air gap structure of the magnetic core. The authors claim that the multiple air gaps are randomly distributed and that the average air gap length is unpredictable at the...
-
Finite element matrix generation on a GPU
PublicationThis paper presents an efficient technique for fast generation of sparse systems of linear equations arising in computational electromagnetics in a finite element method using higher order elements. The proposed approach employs a graphics processing unit (GPU) for both numerical integration and matrix assembly. The performance results obtained on a test platform consisting of a Fermi GPU (1x Tesla C2075) and a CPU (2x twelve-core...
-
DEPO: A dynamic energy‐performance optimizer tool for automatic power capping for energy efficient high‐performance computing
PublicationIn the article we propose an automatic power capping software tool DEPO that allows one to perform runtime optimization of performance and energy related metrics. For an assumed application model with an initialization phase followed by a running phase with uniform compute and memory intensity, the tool performs automatic tuning engaging one of the two exploration algorithms—linear search (LS) and golden section search (GSS), finds...
-
Tuning a Hybrid GPU-CPU V-Cycle Multilevel Preconditioner for Solving Large Real and Complex Systems of FEM Equations
PublicationThis letter presents techniques for tuning an accelerated preconditioned conjugate gradient solver with a multilevel preconditioner. The solver is optimized for a fast solution of sparse systems of equations arising in computational electromagnetics in a finite element method using higher-order elements. The goal of the tuning is to increase the throughput while at the same time reducing the memory requirements in order to allow...
-
Assessment of OpenMP Master–Slave Implementations for Selected Irregular Parallel Applications
PublicationThe paper investigates various implementations of a master–slave paradigm using the popular OpenMP API and relative performance of the former using modern multi-core workstation CPUs. It is assumed that a master partitions available input into a batch of predefined number of data chunks which are then processed in parallel by a set of slaves and the procedure is repeated until all input data has been processed. The paper experimentally...
-
Implementation of Coprocessor for Integer Multiple Precision Arithmetic on Zynq Ultrascale+ MPSoC
PublicationRecently, we have opened the source code of coprocessor for multiple-precision arithmetic (MPA). In this contribution, the implementation and benchmarking results for this MPA coprocessor are presented on modern Zynq Ultrascale+ multiprocessor system on chip, which combines field-programmable gate array with quad-core ARM Cortex-A53 64-bit central processing unit (CPU). In our benchmark, a single coprocessor can be up to 4.5 times...
-
IP Core of Coprocessor for Multiple-Precision-Arithmetic Computations
PublicationIn this paper, we present an IP core of coprocessor supporting computations requiring integer multiple-precision arithmetic (MPA). Whilst standard 32/64-bit arithmetic is sufficient to solve many computing problems, there are still applications that require higher numerical precision. Hence, the purpose of the developed coprocessor is to support and offload central processing unit (CPU) in such computations. The developed digital...
-
Generation of large finite-element matrices on multiple graphics processors
PublicationThis paper presents techniques for generating very large finite-element matrices on a multicore workstation equipped with several graphics processing units (GPUs). To overcome the low memory size limitation of the GPUs, and at the same time to accelerate the generation process, we propose to generate the large sparse linear systems arising in finite-element analysis in an iterative manner on several GPUs and to use the graphics...
-
Open-Source Coprocessor for Integer Multiple Precision Arithmetic
PublicationThis paper presents an open-source digital circuit of the coprocessor for an integer multiple-precision arithmetic (MPA). The purpose of this coprocessor is to support a central processing unit (CPU) by offloading computations requiring integer precision higher than 32/64 bits. The coprocessor is developed using the very high speed integrated circuit hardware description language (VHDL) as an intellectual property (IP) core. Therefore,...
-
FPGA Acceleration of Matrix-Assembly Phase of RWG-Based MoM
PublicationIn this letter, the field-programmable-gate-array accelerated implementation of matrix-assembly phase of the method of moments (MoM) is presented. The solution is based on a discretization of the frequency-domain mixed potential integral equation using the Rao-Wilton-Glisson basis functions and their extension to wire-to-surface junctions. To take advantage of the given hardware resources (i.e., Xilinx Alveo U200 accelerator card),...
-
Acceleration of Electromagnetic Simulations on Reconfigurable FPGA Card
PublicationIn this contribution, the hardware acceleration of electromagnetic simulations on the reconfigurable field-programmable-gate-array (FPGA) card is presented. In the developed implementation of scientific computations, the matrix-assembly phase of the method of moments (MoM) is accelerated on the Xilinx Alveo U200 card. The computational method involves discretization of the frequency-domain mixed potential integral equation using...
-
Tuning matrix-vector multiplication on GPU
PublicationA matrix times vector multiplication (matvec) is a cornerstone operation in iterative methods of solving large sparse systems of equations such as the conjugate gradients method (cg), the minimal residual method (minres), the generalized residual method (gmres) and exerts an influence on overall performance of those methods. An implementation of matvec is particularly demanding when one executes computations on a GPU (Graphics...
-
Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors
PublicationThe paper deals with parallelization of computing similarity measures between large vectors. Such computations are important components within many applications and consequently are of high importance. Rather than focusing on optimization of the algorithm itself, assuming specific measures, the paper assumes a general scheme for finding similarity measures for all pairs of vectors and investigates optimizations for scalability...
-
Optymalizacja zasobów chmury obliczeniowej z wykorzystaniem inteligentnych agentów w zdalnym nauczaniu
PublicationRozprawa dotyczy optymalizacji zasobów chmury obliczeniowej, w której zastosowano inteligentne agenty w zdalnym nauczaniu. Zagadnienie jest istotne w edukacji, gdzie wykorzystuje się nowoczesne technologie, takie jak Internet Rzeczy, rozszerzoną i wirtualną rzeczywistość oraz deep learning w środowisku chmury obliczeniowej. Zagadnienie jest istotne również w sytuacji, gdy pandemia wymusza stosowanie zdalnego nauczania na dużą skalę...
-
A memory efficient and fast sparse matrix vector product on a Gpu
PublicationThis paper proposes a new sparse matrix storage format which allows an efficient implementation of a sparse matrix vector product on a Fermi Graphics Processing Unit (GPU). Unlike previous formats it has both low memory footprint and good throughput. The new format, which we call Sliced ELLR-T has been designed specifically for accelerating the iterative solution of a large sparse and complex-valued system of linear equations arising...
-
Implementation of Addition and Subtraction Operations in Multiple Precision Arithmetic
PublicationIn this paper, we present a digital circuit of arithmetic unit implementing addition and subtraction operations in multiple-precision arithmetic (MPA). This adder-subtractor unit is a part of MPA coprocessor supporting and offloading the central processing unit (CPU) in computations requiring precision higher than 32/64 bits. Although addition and subtraction operations of two n-digit numbers require O(n) operations, the efficient...
-
Evaluation the effectiveness of virtual machine integrated with CPU
PublicationIn the paper effectiveness of example CPU with integrated virtual machine is presented. The idea and implementation of virtual machine is shown. In next sections reference CPU and sample virtual machine is described. Finally optimality of the translation process is analysed.
-
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
PublicationAuto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...
-
Wielopoziomowy przekształtnik trakcyjny SiC z izolacją od sieci 3kV DC realizowaną za pomocą transformatorów 30kHz do napędów EZT
PublicationW referacie przedstawiono wielopoziomowy izolowany kaskadowy przekształtnik DC-AC z tranzystorami SiC MOSFET 1,2kV, przeznaczony do napędów elektrycznych zespołów trakcyjnych (EZT). Zaproponowana konstrukcja przekształtnika, przeznaczonego do pracy przy zasilaniu z sieci trakcyjnej 3kV DC, spełnia założenia energoelektronicznego transformatora trakcyjnego (z ang. Power Electronic Traction Transformer). Budowa modułowa z niskonapięciowych...
-
Parallelization of large vector similarity computations in a hybrid CPU+GPU environment
PublicationThe paper presents design, implementation and tuning of a hybrid parallel OpenMP+CUDA code for computation of similarity between pairs of a large number of multidimensional vectors. The problem has a wide range of applications, and consequently its optimization is of high importance, especially on currently widespread hybrid CPU+GPU systems targeted in the paper. The following are presented and tested for computation of all vector...
-
Implementation of FDTD-compatible Green's function on heterogeneous CPU-GPU parallel processing system
PublicationThis paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited to each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates...
-
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
PublicationIn the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...
-
Nodal models of Pressurized Water Reactor core for control purposes – A comparison study
PublicationThe paper focuses on the presentation and comparison of basic nodal and expanded multi-nodal models of the Pressurized Water Reactor (PWR) core, which includes neutron kinetics, heat transfer between fuel and coolant, and internal and external reactivity feedback processes. In the expanded multi-nodal model, the authors introduce a novel approach to the implementation of thermal power distribution phenomena into the multi-node...
-
On Wrinkling in Sandwich Panels with an Orthotropic Core
PublicationThis paper deals with the local loss of stability (wrinkling) problem of a thin facing of a sandwich panel. Classical solutions to the problem of facing instability resting on a homogeneous and isotropic substructure (a core) are compared. The relations between strain energy components associated with different forms of core deformations are discussed. Next, a new solution for the orthotropic core is presented in detail, which...
-
Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services
PublicationStreaming multimedia services have become very popular in recent years, due to the development of wireless networks. With the growing number of mobile devices worldwide, service providers offer dedicated applications that allow to deliver on-demand audio and video content anytime and everywhere. The aim of this study was to compare different streaming services and investigate their impact on the CPU and RAM resources, with respect...
-
Investigation of Parallel Data Processing Using Hybrid High Performance CPU + GPU Systems and CUDA Streams
PublicationThe paper investigates parallel data processing in a hybrid CPU+GPU(s) system using multiple CUDA streams for overlapping communication and computations. This is crucial for efficient processing of data, in particular incoming data stream processing that would naturally be forwarded using multiple CUDA streams to GPUs. Performance is evaluated for various compute time to host-device communication time ratios, numbers of CUDA streams,...
-
Method of determining the residual fluxes in transformer core
PublicationThe article presents the method of calculating the residual induction in transformer columns. The method is based on measurement of the magnetic induction in selected points around the transformer core. The values of residual induction are calculated as linear combination of the results of measurement.
-
Bioactive core material for porous load-bearing implants
PublicationSo far state of knowledge on biodegradable materials is reviewed. Among a variety of investigated materials, those composed of polymers and ceramics may be considered as only candidates for a core material in porous titanium alloy. The collagen and chitosan among natural polymers, polyhydroxy acids among synthetic polymers, and hydroxyapatite and tricalcium phosphate among ceramics are proposed for further research. Three essential...
-
Core-Shell Nanoparticles with Hyperbranched Poly(arylene-oxindole) Interiors
PublicationCore-shell type star polymers composed of poly(tert-butyl acrylate) (poly(t-BuA)) arms and 100% hyperbranched poly(arylene-oxindole) interiors were synthesized via the "core-first" method. Atom transfer radical polymerization of t-BuA initiated by 2-bromopropionyl terminal groups of the hyperbranched core was applied for the synthesis of the stars. The resultant star structures were characterized by gel permeation chromatography...
-
TiO2-based magnetic nanocomposites with core-shell structure
PublicationThe main aim of the doctoral dissertation was preparation and characterization of photocatalysts, with particular emphasis on modified titanium (IV) oxide photocatalysts, which can be applied for the degradation of organic pollutants not susceptible to biodegradation. A particularly important aspect of the work was the development of preparation method of nanocomposites with the magnetic core-shell and photocatalyst shell (TiO2)...
-
Gaze tracking in multi-display environment
PublicationThis paper presents the basic ideas of eye and gaze tracking in multiple-display environment. The algorithm for display detection and identification is described as well as the rules for gaze interaction in multi display environment. The core of the method is to use special LED markers and eye and scene tracking glasses. Scene tracking camera registers markers position which is then represented as a cloud of points. Analyzing the...
-
Multi-level Virtualization and Its Impact on System Performance in Cloud Computing
PublicationThe results of benchmarking tests of multi-level virtualized environments are presented. There is analysed the performance impact of hardware virtualization, container-type isolation and programming level abstraction. The comparison is made on the basis of a proposed score metric that allows you to compare different aspects of performance. There is general performance (CPU and memory), networking, disk operations and application-like...
-
Przestrzeń publiczna a przemiany miasta
PublicationPraca składa się z dwunastu rozdziałów. Pierwsza część ma charakter wprowadzający do problematyki przestrzeni publicznej. W rozdziale drugim autorka przywołuje wybrane i ważne dla przedmiotu badań koncepcje teoretyczne oparte na modelu wieloparadygmatycznym. Rozdział trzeci osnuty jest wokół antycznego wzorca przestrzeni publicznej, który ze względu na niezwykłe bogactwo rozwiązań w tym zakresie wywodzi się właśnie z tego pnia....
-
Assessment of dynamic characteristics of thin cylindrical sandwich panels with magnetorheological core
PublicationBased on the equivalent single-layer linear theory for laminated shells, free and forced vibrations of thin cylindrical sandwich panels with magnetorheological core are studied. Five variants of available magnetorheological elastomers differing in their composition and physical properties are considered for smart viscoelastic core. Coupled differential equations in terms of displacements based on the generalized kinematic hypotheses...
-
Core-Periphery Model
Publication -
Numerical investigation of the core eccentricity effect on wave propagation in embedded waveguide
PublicationThe paper presents results of theoretical and numerical investigation of guided wave propagation in two-layer bars with geometric imperfections in the form of eccentric location of steel core. Steel rod of diameter equal to 1 cm embedded in composite mortar-type cover with external diameter equal to 5 cm has been taken into consideration. Several different rods with variable size of eccentricity are analysed. Results for rods with...
-
The influence of core material on strength properties of hybrid sandwich panels
PublicationAlong with high prices of fuels and more restrictive safety and environmental regulations (including environment protection) increased interest in sandwich structures is being observed. One of the solution having growing application potential is steel sandwich panel. The construction consist of very thin steel plates (about 2mm) and stiffeners between them. The main advantage of using such solution is very high strength to weight...
-
<title>Manufacturing and measurements of triple-core, double-core, and twin-core single-mode soft-glass optical fibers</title>
Publication -
EM-Driven Multi-Objective Optimization of Antenna Structures in Multi-Dimensional Design Spaces
PublicationFeasible multi-objective optimization of antenna structures is presented. An initial set of Pareto optimal solutions is found using a multi-objective evolutionary algorithm (MOEA) working with a fast surrogate antenna model obtained by kriging interpolation of coarse-discretization EM simulation data. To make the surrogate construction computationally feasible in multi-dimensional design space, the space subset containing non-dominated...
-
Multi agent grid systems
PublicationThis chapter presents an idea of merging grid and volunteer systemswith multi agent systems. It gives some basics concerning multi agentsystem and the most followed standard. Some deliberations concerningsuch an existing systems were made in order to finally present possibilities of introducing agents into the Comcute system.
-
On the mechanism of photocatalytic reactions on CuxO@TiO2 core–shell photocatalysts
PublicationTitania (titanium(IV) oxide) is a highly active, stable, cheap and abundant photocatalyst, and is thus commonly applied in various environmental applications. However, two main shortcomings of titania, i.e., charge carrier recombination and inactivity under visible-light (vis) irradiation, should be overcome for widespread commercialization. Accordingly, titania has been doped, surface modified and coupled with various ions/compounds,...