Search results for: CPU

Evaluation the effectiveness of virtual machine integrated with CPU

Publication

T. Bieliński

- Year 2013

In the paper effectiveness of example CPU with integrated virtual machine is presented. The idea and implementation of virtual machine is shown. In next sections reference CPU and sample virtual machine is described. Finally optimality of the translation process is analysed.

Parallelization of Selected Algorithms on Multi-core CPUs, a Cluster and in a Hybrid CPU+Xeon Phi Environment

Publication

- Advances in Intelligent Systems and Computing - Year 2017

In the paper we present parallel implementations as well as execution times and speed-ups of three different algorithms run in various environments such as on a workstation with multi-core CPUs and a cluster. The parallel codes, implementing the master-slave model in C+MPI, differ in computation to communication ratios. The considered problems include: a genetic algorithm with various ratios of master processing time to communication...

Full text available to download

Parallelization of large vector similarity computations in a hybrid CPU+GPU environment

Publication

P. Czarnul

- JOURNAL OF SUPERCOMPUTING - Year 2018

The paper presents design, implementation and tuning of a hybrid parallel OpenMP+CUDA code for computation of similarity between pairs of a large number of multidimensional vectors. The problem has a wide range of applications, and consequently its optimization is of high importance, especially on currently widespread hybrid CPU+GPU systems targeted in the paper. The following are presented and tested for computation of all vector...

Full text available to download

Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services

Publication

- Year 2021

Streaming multimedia services have become very popular in recent years, due to the development of wireless networks. With the growing number of mobile devices worldwide, service providers offer dedicated applications that allow to deliver on-demand audio and video content anytime and everywhere. The aim of this study was to compare different streaming services and investigate their impact on the CPU and RAM resources, with respect...

Full text to download in external service

Implementation of FDTD-compatible Green's function on heterogeneous CPU-GPU parallel processing system

Publication

T. Stefański

- Progress in Electromagnetics Research-PIER - Year 2013

This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited to each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates...

Full text to download in external service

The Speedup Analysis in GEM Detector Based Acquisition System Algorithms with CPU and PCIe Cards

Publication

R. Krawczyk
P. Linczuk
P. Kolasinski
A. Wojenski
G. Kasprowicz
K. Pozniak
R. Romaniuk
W. Zabolotny
P. Zienkiewicz
T. Czarski... and 2 others

- Acta Physica Polonica B Proceedings Supplement - Year 2016

Full text to download in external service

Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system

Publication

- SIMULATION MODELLING PRACTICE AND THEORY - Year 2023

In the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...

Full text available to download

Investigation of Parallel Data Processing Using Hybrid High Performance CPU + GPU Systems and CUDA Streams

Publication

P. Czarnul

- COMPUTING AND INFORMATICS - Year 2020

The paper investigates parallel data processing in a hybrid CPU+GPU(s) system using multiple CUDA streams for overlapping communication and computations. This is crucial for efficient processing of data, in particular incoming data stream processing that would naturally be forwarded using multiple CUDA streams to GPUs. Performance is evaluated for various compute time to host-device communication time ratios, numbers of CUDA streams,...

Full text available to download

Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge

Publication

- Year 2020

Auto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...

Full text available to download

Tuning a Hybrid GPU-CPU V-Cycle Multilevel Preconditioner for Solving Large Real and Complex Systems of FEM Equations

Publication

- IEEE Antennas and Wireless Propagation Letters - Year 2011

This letter presents techniques for tuning an accelerated preconditioned conjugate gradient solver with a multilevel preconditioner. The solver is optimized for a fast solution of sparse systems of equations arising in computational electromagnetics in a finite element method using higher-order elements. The goal of the tuning is to increase the throughput while at the same time reducing the memory requirements in order to allow...

Full text to download in external service

Multi Queue Approach for Network Services Implemented for Multi Core CPUs

Publication

- Journal of Telecommunications and Information Technology - Year 2011

Multiple core processors have already became the dominant design for general purpose CPUs. Incarnations of this technology are present in solutions dedicated to such areas like computer graphics, signal processing and also computer networking. Since the key functionality of network core components is fast package servicing, multicore technology, due to multi tasking ability, seems useful to support packet processing. Dedicated...

Full text available to download

Performance assessment of OpenMP constructs and benchmarks using modern compilers and multi-core CPUs

Publication

- Year 2023

Considering ongoing developments of both modern CPUs, especially in the context of increasing numbers of cores, cache memory and architectures as well as compilers there is a constant need for benchmarking representative and frequently run workloads. The key metric is speed-up as the computational power of modern CPUs stems mainly from using multiple cores. In this paper, we show and discuss results from running codes such as:...

Full text to download in external service

Investigation of Performance and Energy Consumption of Tokenization Algorithms on Multi-core CPUs Under Power Capping

Publication

- Year 2024

In this paper we investigate performance-energy optimization of tokenizer algorithm training using power capping. We focus on parallel, multi-threaded implementations of Byte Pair Encoding (BPE), Unigram, WordPiece, and WordLevel run on two systems with different multi-core CPUs: Intel Xeon 6130 and desktop Intel i7-13700K. We analyze execution times and energy consumption for various numbers of threads and various power caps and...

Full text available to download

Optimization of Execution Time under Power Consumption Constraints in a Heterogeneous Parallel System with GPUs and CPUs

Publication

- Year 2014

The paper proposes an approach for parallelization of computations across a collection of clusters with heterogeneous nodes with both GPUs and CPUs. The proposed system partitions input data into chunks and assigns to par- ticular devices for processing using OpenCL kernels defined by the user. The sys- tem is able to minimize the execution time of the application while maintaining the power consumption of the utilized GPUs and...

Full text to download in external service

KernelHive: a new workflow-based framework for multilevel high performance computing using clusters and workstations with CPUs and GPUs

Publication

- CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE - Year 2016

The paper presents a new open-source framework called KernelHive for multilevel parallelization of computations among various clusters, cluster nodes, and finally, among both CPUs and GPUs for a particular application. An application is modeled as an acyclic directed graph with a possibility to run nodes in parallel and automatic expansion of nodes (called node unrolling) depending on the number of computation units available....

Full text to download in external service

Preconditioners with Low Memory Requirements for Higher-Order Finite-Element Method Applied to Solving Maxwell’s Equations on Multicore CPUs and GPUs

Publication

A. Dziekoński
G. Fotyga
M. Mrozowski

- IEEE Access - Year 2018

This paper discusses two fast implementations of the conjugate gradient iterative method using a hierarchical multilevel preconditioner to solve the complex-valued, sparse systems obtained using the higher order finite-element method applied to the solution of the time-harmonic Maxwell equations. In the first implementation, denoted PCG-V, a classical V-cycle is applied and the system of equations on the lowest level is solved...

Full text available to download

Implementation of Coprocessor for Integer Multiple Precision Arithmetic on Zynq Ultrascale+ MPSoC

Publication

T. Stefański
K. Rudnicki
W. Żebrowski

- Year 2021

Recently, we have opened the source code of coprocessor for multiple-precision arithmetic (MPA). In this contribution, the implementation and benchmarking results for this MPA coprocessor are presented on modern Zynq Ultrascale+ multiprocessor system on chip, which combines field-programmable gate array with quad-core ARM Cortex-A53 64-bit central processing unit (CPU). In our benchmark, a single coprocessor can be up to 4.5 times...

Full text to download in external service

Jacobi and gauss-seidel preconditioned complex conjugate gradient method with GPU acceleration for finite element method

Publication

- Year 2010

In this paper two implementations of iterative solvers for solving complex symmetric and sparse systems resulting from finite element method applied to wave equation are discussed. The problem under investigation is a dielectric resonator antenna (DRA) discretized by FEM with vector elements of the second order (LT/QN). The solvers use the preconditioned conjugate gradient (pcg) method implemented on Graphics Processing Unit (GPU)...

Full text to download in external service

Finite element matrix generation on a GPU

Publication

- Progress in Electromagnetics Research-PIER - Year 2012

This paper presents an efficient technique for fast generation of sparse systems of linear equations arising in computational electromagnetics in a finite element method using higher order elements. The proposed approach employs a graphics processing unit (GPU) for both numerical integration and matrix assembly. The performance results obtained on a test platform consisting of a Fermi GPU (1x Tesla C2075) and a CPU (2x twelve-core...

Full text to download in external service

Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA

Publication

M. J. Adiletta
J. J. Tithi
E. Farsarakis
G. Gerogiannis
R. Adolf
R. Benke
S. Kashyap
S. Hsia
K. Lakhotia
F. Petrini... and 2 others

- Year 2023

Large-scale Graph Convolutional Network (GCN) inference on traditional CPU/GPU systems is challenging due to a large memory footprint, sparse computational patterns, and irregular memory accesses with poor locality. Intel’s Programmable Integrated Unffied Memory Architecture (PIUMA) is designed to address these challenges for graph analytics. In this paper, a detailed characterization of GCNs is presented using the Open-Graph Benchmark...

Full text to download in external service

Acceleration of Electromagnetic Simulations on Reconfigurable FPGA Card

Publication

T. Topa
A. Noga
T. Stefański

- Year 2023

In this contribution, the hardware acceleration of electromagnetic simulations on the reconfigurable field-programmable-gate-array (FPGA) card is presented. In the developed implementation of scientific computations, the matrix-assembly phase of the method of moments (MoM) is accelerated on the Xilinx Alveo U200 card. The computational method involves discretization of the frequency-domain mixed potential integral equation using...

Full text to download in external service

GPU-Accelerated LOBPCG Method with Inexact Null-Space Filtering for Solving Generalized Eigenvalue Problems in Computational Electromagnetics Analysis with Higher-Order FEM

Publication

- Communications in Computational Physics - Year 2017

This paper presents a GPU-accelerated implementation of the Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) method with an inexact nullspace filtering approach to find eigenvalues in electromagnetics analysis with higherorder FEM. The performance of the proposed approach is verified using the Kepler (Tesla K40c) graphics accelerator, and is compared to the performance of the implementation based on functions from...

Full text to download in external service

Parallel implementation of the DGF-FDTD method on GPU Using the CUDA technology

Publication

- Year 2016

The discrete Green's function (DGF) formulation of the finite-difference time-domain method (FDTD) is accelerated on a graphics processing unit (GPU) by means of the Compute Unified Device Architecture (CUDA) technology. In the developed implementation of the DGF-FDTD method, a new analytic expression for dyadic DGF derived based on scalar DGF is employed in computations. The DGF-FDTD method on GPU returns solutions that are compatible...

Full text to download in external service

Acceleration of the DGF-FDTD method on GPU using the CUDA technology

Publication

- Year 2015

We present a parallel implementation of the discrete Green's function formulation of the finite-difference time-domain (DGF-FDTD) method on a graphics processing unit (GPU). The compute unified device architecture (CUDA) parallel computing platform is applied in the developed implementation. For the sake of example, arrays of Yagi-Uda antennas were simulated with the use of DGF-FDTD on GPU. The efficiency of parallel computations...

Full text to download in external service

FPGA Acceleration of Matrix-Assembly Phase of RWG-Based MoM

Publication

T. Topa
A. Noga
T. Stefański

- IEEE Antennas and Wireless Propagation Letters - Year 2022

In this letter, the field-programmable-gate-array accelerated implementation of matrix-assembly phase of the method of moments (MoM) is presented. The solution is based on a discretization of the frequency-domain mixed potential integral equation using the Rao-Wilton-Glisson basis functions and their extension to wire-to-surface junctions. To take advantage of the given hardware resources (i.e., Xilinx Alveo U200 accelerator card),...

Full text available to download

Serce dzielnicy w stanie embrionalnego rozwoju - śrómieście reurbanizowanego wielkiego osiedla w polskim mieście metropolitalnym = Heart of the districts in embrional faze of development -city core structures for the large scale housing in Polish metropolitan city

Publication

G. Rembarz

- Czasopismo Techniczne - Year 2008

W ostatnich 10 latach obserwowana jest intensyfikacja procesu uzupełniania zabudowy śródmieścia Gdańska. Jest to z jednej strony kontynuacja odbudowy historycznego centrum miasta, z drugiej zaś rewitalizacja dzielnic o przewadze zabudowy z początku wieku obejmująca śródmiejskie tereny powojskowe i poprzemysłowe m.in dawnej Stoczni Gdańskiej, koszaty we Wrzeszczu. Liczne komercyjne projekty prowadzą do istotnych dogęszczeń funcjami...

Full text available to download

Open-Source Coprocessor for Integer Multiple Precision Arithmetic

Publication

K. Rudnicki
T. Stefański
W. Żebrowski

- Electronics - Year 2020

This paper presents an open-source digital circuit of the coprocessor for an integer multiple-precision arithmetic (MPA). The purpose of this coprocessor is to support a central processing unit (CPU) by offloading computations requiring integer precision higher than 32/64 bits. The coprocessor is developed using the very high speed integrated circuit hardware description language (VHDL) as an intellectual property (IP) core. Therefore,...

Full text available to download

Generation of large finite-element matrices on multiple graphics processors

Publication

- INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING - Year 2013

This paper presents techniques for generating very large finite-element matrices on a multicore workstation equipped with several graphics processing units (GPUs). To overcome the low memory size limitation of the GPUs, and at the same time to accelerate the generation process, we propose to generate the large sparse linear systems arising in finite-element analysis in an iterative manner on several GPUs and to use the graphics...

Full text to download in external service

Analyzing energy/performance trade-offs with power capping for parallel applications on modern multi and many core processors

Publication

- Annals of Computer Science and Information Systems - Year 2018

In the paper we present extensive results from analyzing energy/performance trade-offs with power capping observed on four different modern CPUs, for three different parallel applications such as 2D heat distribution, numerical integration and Fast Fourier Transform. The CPU tested represent both multi-core type CPUs such as Intel⃝R Xeon⃝R E5, desktop and mobile i7 as well as many-core Intel⃝R Xeon PhiTM x200 but also server, desktop...

Full text available to download

Single and Dual-GPU Generalized Sparse Eigenvalue Solvers for Finding a Few Low-Order Resonances of a Microwave Cavity Using the Finite-Element Method

Publication

- RADIOENGINEERING - Year 2018

This paper presents two fast generalized eigenvalue solvers for sparse symmetric matrices that arise when electromagnetic cavity resonances are investigated using the higher-order finite element method (FEM). To find a few loworder resonances, the locally optimal block preconditioned conjugate gradient (LOBPCG) algorithm with null-space deflation is applied. The computations are expedited by using one or two graphical processing...

Full text available to download

Multi-core and Multiprocessor Implementation of Numerical Integration in Finite Element Method

Publication

- Year 2012

The paper presents techniques for accelerating a numerical integration process which appears in the Finite Element Method. The acceleration is achieved by taking advantages of multi-core and multiprocessor devices. It is shown that using multi-core implementation with OpenMP and a GPU acceleration using CUDA architecture allows one to achieve the speedups by a factor of 5 and 10 on a CPU and GPUs, respectively.

Block Conjugate Gradient Method with Multilevel Preconditioning and GPU Acceleration for FEM Problems in Electromagnetics

Publication

- IEEE Antennas and Wireless Propagation Letters - Year 2018

In this paper a GPU-accelerated block conjugate gradient solver with multilevel preconditioning is presented for solving large system of sparse equations with multiple right hand-sides (RHSs) which arise in the finite-element analysis of electromagnetic problems. We demonstrate that blocking reduces the time to solution significantly and allows for better utilization of the computing power of GPUs, especially when the system matrix...

Full text to download in external service

Fast clutter cancellation for noise radars via waveform design

Publication

M. Meller

- IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS - Year 2014

Canceling clutter is an important, but very expensive part of signal processing in noise radars. It is shown that considerable improvements can be made to a simple least squares canceler if minor constraints are imposed onto noise waveform. Using a combination of FPGA and CPU, the proposed scheme is capable of canceling both stationary clutter and moving targets in real-time, even for high sampling rates.

Full text to download in external service

Document Agents with the Intelligent Negotiations Capability

Publication

- Year 2015

The paper focus is on augmenting proactive document-agents with built -in intelligence to enable them to recognize execution context provided by devices visited durning the business process, and to reach collaboration agreement despite of their conflicting requirements. We propose a solution based on neural networks to improve simple multi-issue negotiation between the document and the device, practically with no excessive cost...

Multi-level Virtualization and Its Impact on System Performance in Cloud Computing

Publication

- Communications in Computer and Information Science - Year 2016

The results of benchmarking tests of multi-level virtualized environments are presented. There is analysed the performance impact of hardware virtualization, container-type isolation and programming level abstraction. The comparison is made on the basis of a proposed score metric that allows you to compare different aspects of performance. There is general performance (CPU and memory), networking, disk operations and application-like...

Full text available to download

Towards an efficient multi-stage Riemann solver for nuclear physics simulations

Publication

S. Cygert
J. Porter-Sobieraj
D. Kikoła
J. Sikorski
M. Słodkowski

- Year 2013

Relativistic numerical hydrodynamics is an important tool in high energy nuclear science. However, such simulations are extremely demanding in terms of computing power. This paper focuses on improving the speed of solving the Riemann problem with the MUSTA-FORCE algorithm by employing the CUDA parallel programming model. We also propose a new approach to 3D finite difference algorithms, which employ a GPU that uses surface memory....

Full text to download in external service

AUTOMATED NEGOTIATIONS OVER COLLABORATION PROTOCOL AGREEMENTS

Publication

J. Kaczorek

- Year 2015

The dissertation focuses on the augmentation of proactive document - agents with built-in intelligence to recognize execution context provided by devices visited during a business process, and to reach collaboration agreement despite conflicting requirements. The proposed solution, based on intelligent bargaining using neural networks to improve simple multi-issue negotiation between the document and thedevice, requires practically...

Optimization of Data Assignment for Parallel Processing in a Hybrid Heterogeneous Environment Using Integer Linear Programming

Publication

- COMPUTER JOURNAL - Year 2021

In the paper we investigate a practical approach to application of integer linear programming for optimization of data assignment to compute units in a multi-level heterogeneous environment with various compute devices, including CPUs, GPUs and Intel Xeon Phis. The model considers an application that processes a large number of data chunks in parallel on various compute units and takes into account computations, communication including...

Full text available to download

GPU Acceleration of Multilevel Solvers for Analysis of Microwave Components With Finite Element Method

Publication

- IEEE MICROWAVE AND WIRELESS COMPONENTS LETTERS - Year 2011

The letter discusses a fast implementation of the conjugate gradient iterative method with ${rm E}$-field multilevel preconditioner applied to solving real symmetric and sparse systems obtained with vector finite element method. In order to accelerate computations, a graphics processing unit (GPU) was used and significant speed-up (2.61 fold) was achieved comparing to a central processing unit (CPU) based approach. These results...

Full text to download in external service

Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors

Publication

P. Czarnul

- INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING - Year 2016

The paper deals with parallelization of computing similarity measures between large vectors. Such computations are important components within many applications and consequently are of high importance. Rather than focusing on optimization of the algorithm itself, assuming specific measures, the paper assumes a general scheme for finding similarity measures for all pairs of vectors and investigates optimizations for scalability...

Full text available to download

GPU based implementation of Temperature-Vegetation Dryness Index for AVHRR3 Satellite Data

Publication

- Year 2014

Paper presents an implementation of TVDI (Temperature-Vegetation-Dryness Index) algorithm on GPU (Graphics Processing Unit). Calculation of this index is based on LST (Land Surface Temperature) and NDVI (Normalized Difference Vegetation Index). Discussed results are based on multi-spectral imagery retrieved from AVHRR3 sensors for area of Poland. All phases of TVDI implementation on GPU are modified in respect to CUDA platform....

Using Disparity Map for Moving Object Position Estimation in Pan Tilt Camera Images

Publication

T. Kocejko
J. Rumiński
J. Kang-Hyun

- Year 2022

In this paper we present the algorithm for rapid moving object position estimation in an images acquired from pan tilt camera. Detection of a moving object in a image acquired from a moving camera might be quite challenging. Standard methods that relay on analyzing two consecutive frames are not applicable due to the changing background. To overtake this problem we decided to evaluate the possibility of calculating a disparity...

Full text to download in external service

DEPO: A dynamic energy‐performance optimizer tool for automatic power capping for energy efficient high‐performance computing

Publication

- SOFTWARE-PRACTICE & EXPERIENCE - Year 2022

In the article we propose an automatic power capping software tool DEPO that allows one to perform runtime optimization of performance and energy related metrics. For an assumed application model with an initialization phase followed by a running phase with uniform compute and memory intensity, the tool performs automatic tuning engaging one of the two exploration algorithms—linear search (LS) and golden section search (GSS), finds...

Full text to download in external service

Implementation of TVDI calculation for coastal zone

Publication

T. Bieliński

- Year 2015

Paper will show an implementation of TVDI (Temperature-Vegetation-Dryness Index) algorithm on GPU (Graphics Processing Unit). Calculation of this index is based on LST (Land Surface Temperature) and NDVI (Normalized Difference Vegetation Index). Discussed results are based on multi-spectral imagery retrieved from AVHRR3 sensors for area of Poland, especially from region of Gdańsk coastal zone. All phases of TVDI implementation...

Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training

Publication

P. Rościszewski

- Procedia Computer Science - Year 2017

In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

Full text available to download

Optymalizacja zasobów chmury obliczeniowej z wykorzystaniem inteligentnych agentów w zdalnym nauczaniu

Publication

P. Dryja

- Year 2023

Rozprawa dotyczy optymalizacji zasobów chmury obliczeniowej, w której zastosowano inteligentne agenty w zdalnym nauczaniu. Zagadnienie jest istotne w edukacji, gdzie wykorzystuje się nowoczesne technologie, takie jak Internet Rzeczy, rozszerzoną i wirtualną rzeczywistość oraz deep learning w środowisku chmury obliczeniowej. Zagadnienie jest istotne również w sytuacji, gdy pandemia wymusza stosowanie zdalnego nauczania na dużą skalę...

Full text available to download

Big Data and the Internet of Things in Edge Computing for Smart City

Publication

J. Balicki
H. Balicka
P. Dryja
M. Tyszka

- Year 2019

Requests expressing collective human expectations and outcomes from city service tasks can be partially satisfied by processing Big Data provided to a city cloud via the Internet of Things. To improve the efficiency of the city clouds an edge computing has been introduced regarding Big Data mining. This intelligent and efficient distributed system can be developed for citizens that are supposed to be informed and educated by the...

Full text to download in external service

Tuning matrix-vector multiplication on GPU

Publication

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2010

A matrix times vector multiplication (matvec) is a cornerstone operation in iterative methods of solving large sparse systems of equations such as the conjugate gradients method (cg), the minimal residual method (minres), the generalized residual method (gmres) and exerts an influence on overall performance of those methods. An implementation of matvec is particularly demanding when one executes computations on a GPU (Graphics...

Analysis of cores affinity within the containerized environment based on selected IOT middleware - observations and recommendations

Publication

R. Kałaska

- TASK Quarterly - Year 2023

The Internet of Things gets bigger and bigger audiences. This topic is really popular in science and also in industry. There are many fields for research. One of them is efficient deployment against resource utilization. Another one is containerization within IoT platforms. One of the commonalities of these two topics is different CPU affinity against containerized platforms to get the best performance. There were plenty of papers...

Full text to download in external service

Linux scheduler improvement for time demanding network applications, running on Communication Platform Systems

Publication

- Journal of Telecommunications and Information Technology - Year 2009

Communication Platform Systems as ex. ATCA standard blades located in standardized chassis provides high level communication services between system peripherals. Each ATCA blade brings dedicated functionality to the system but can as well exist as separated host responsible for servicing set of task. According to platform philosophy these parts of system can be quite independent against another solutions provided by competitors....

Full text available to download

Search

Filters

Catalog

Category

Year

Options