Search results for: GPU ENERGY OPTIMIZATION

Search results for: GPU ENERGY OPTIMIZATION

results on page:
embed this view on your website

Filters

total: 4881

clear all filters disabled

displaying 1000 best results Help

Jacobi and gauss-seidel preconditioned complex conjugate gradient method with GPU acceleration for finite element method
Publication
- Year 2010
In this paper two implementations of iterative solvers for solving complex symmetric and sparse systems resulting from finite element method applied to wave equation are discussed. The problem under investigation is a dielectric resonator antenna (DRA) discretized by FEM with vector elements of the second order (LT/QN). The solvers use the preconditioned conjugate gradient (pcg) method implemented on Graphics Processing Unit (GPU)...

Full text to download in external service
A GPU Solver for Sparse Generalized Eigenvalue Problems with Symmetric Complex-Valued Matrices Obtained Using Higher-Order FEM
Publication
- A. Dziekoński
- M. Mrozowski
- IEEE Access - Year 2018
The paper discusses a fast implementation of the stabilized locally optimal block preconditioned conjugate gradient (sLOBPCG) method, using a hierarchical multilevel preconditioner to solve nonHermitian sparse generalized eigenvalue problems with large symmetric complex-valued matrices obtained using the higher-order finite-element method (FEM), applied to the analysis of a microwave resonator. The resonant frequencies of the low-order...

Full text available to download
Implementation of FDTD-compatible Green's function on heterogeneous CPU-GPU parallel processing system
Publication
- T. Stefański
- Progress in Electromagnetics Research-PIER - Year 2013
This paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited to each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates...

Full text to download in external service
Tuning matrix-vector multiplication on GPU
Publication
- A. Dziekoński
- M. Mrozowski
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2010
A matrix times vector multiplication (matvec) is a cornerstone operation in iterative methods of solving large sparse systems of equations such as the conjugate gradients method (cg), the minimal residual method (minres), the generalized residual method (gmres) and exerts an influence on overall performance of those methods. An implementation of matvec is particularly demanding when one executes computations on a GPU (Graphics...
Performance evaluation of parallel background subtraction on GPU platforms
Publication
- G. Szwoch
- Elektronika : konstrukcje, technologie, zastosowania - Year 2015
Implementation of the background subtraction algorithm on parallel GPUs is presented. The algorithm processes video streams and extracts foreground pixels. The work focuses on optimizing parallel algorithm implementation by taking into account specific features of the GPU architecture, such as memory access, data transfers and work group organization. The algorithm is implemented in both OpenCL and CUDA. Various optimizations of...

Full text to download in external service
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
Publication
- J. Skrzypczak
- P. Czarnul
- SIMULATION MODELLING PRACTICE AND THEORY - Year 2023
In the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...

Full text to download in external service
Single and Dual-GPU Generalized Sparse Eigenvalue Solvers for Finding a Few Low-Order Resonances of a Microwave Cavity Using the Finite-Element Method
Publication
- A. Dziekoński
- M. Mrozowski
- RADIOENGINEERING - Year 2018
This paper presents two fast generalized eigenvalue solvers for sparse symmetric matrices that arise when electromagnetic cavity resonances are investigated using the higher-order finite element method (FEM). To find a few loworder resonances, the locally optimal block preconditioned conjugate gradient (LOBPCG) algorithm with null-space deflation is applied. The computations are expedited by using one or two graphical processing...

Full text available to download
A multithreaded CUDA and OpenMP based power‐aware programming framework for multi‐node GPU systems
Publication
- P. Czarnul
- CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE - Year 2023
In the paper, we have proposed a framework that allows programming a parallel application for a multi-node system, with one or more GPUs per node, using an OpenMP+extended CUDA API. OpenMP is used for launching threads responsible for management of particular GPUs and extended CUDA calls allow to manage CUDA objects, data and launch kernels. The framework hides inter-node MPI communication from the programmer who can benefit from...

Full text available to download
GPU based implementation of Temperature-Vegetation Dryness Index for AVHRR3 Satellite Data
Publication
- T. Bieliński
- A. Chybicki
- Year 2014
Paper presents an implementation of TVDI (Temperature-Vegetation-Dryness Index) algorithm on GPU (Graphics Processing Unit). Calculation of this index is based on LST (Land Surface Temperature) and NDVI (Normalized Difference Vegetation Index). Discussed results are based on multi-spectral imagery retrieved from AVHRR3 sensors for area of Poland. All phases of TVDI implementation on GPU are modified in respect to CUDA platform....
Parallel Background Subtraction in Video Streams Using OpenCL on GPU Platforms
Publication
- G. Szwoch
- Year 2014
Implementation of the background subtraction algorithm using OpenCL platform is presented. The algorithm processes live stream of video frames from the surveillance camera in on-line mode. Processing is performed using a host machine and a parallel computing device. The work focuses on optimizing an OpenCL algorithm implementation for GPU devices by taking into account specific features of the GPU architecture, such as memory access,...

Full text to download in external service
GPU-accelerated finite element method
Publication
- Year 2016
In this paper the results of the acceleration of computations involved in analysing electromagnetic problems by means of the finite element method (FEM), obtained with graphics processors (GPU), are presented. A 4.7-fold acceleration was achieved thanks to the massive parallelization of the most time-consuming steps of FEM, namely finite-element matrix-generation and the solution of a sparse system of linear equations with the...

Full text to download in external service
Investigation of Parallel Data Processing Using Hybrid High Performance CPU + GPU Systems and CUDA Streams
Publication
- P. Czarnul
- COMPUTING AND INFORMATICS - Year 2020
The paper investigates parallel data processing in a hybrid CPU+GPU(s) system using multiple CUDA streams for overlapping communication and computations. This is crucial for efficient processing of data, in particular incoming data stream processing that would naturally be forwarded using multiple CUDA streams to GPUs. Performance is evaluated for various compute time to host-device communication time ratios, numbers of CUDA streams,...

Full text available to download
Energy-Aware Scheduling for High-Performance Computing Systems: A Survey
Publication
- ENERGIES - Year 2023
High-performance computing (HPC), according to its name, is traditionally oriented toward performance, especially the execution time and scalability of the computations. However, due to the high cost and environmental issues, energy consumption has already become a very important factor that needs to be considered. The paper presents a survey of energy-aware scheduling methods used in a modern HPC environment, starting with the...

Full text available to download
Performance Evaluation of Selected Parallel Object Detection and Tracking Algorithms on an Embedded GPU Platform
Publication
- G. Szwoch
- M. Szczodrak
- Year 2017
Performance evaluation of selected complex video processing algorithms, implemented on a parallel, embedded GPU platform Tegra X1, is presented. Three algorithms were chosen for evaluation: a GMM-based object detection algorithm, a particle filter tracking algorithm and an optical flow based algorithm devoted to people counting in a crowd flow. The choice of these algorithms was based on their computational complexity and parallel...

Full text to download in external service
Multi-GPU-powered UNRES package for physics-based coarse-grained simulations of structure, dynamics, and thermodynamics of protein systems at biological size- and timescales
Publication
- C. Czaplewski
- P. Czarnul
- H. Krawczyk
- A. Lipska
- E. Lubecka
- K. Ocetkiewicz
- J. Proficz
- A. Sieradzan
- R. Ślusarz
- J. Liwo
- BIOPHYSICAL JOURNAL - Year 2024
Coarse-grained models are nowadays extensively used in biomolecular simulations owing to the tremendous extension of size- and time-scale of simulations. The physics-based UNRES (UNited RESidue) model of proteins developed in our laboratory has only two interaction sites per amino-acid residue (united peptide groups and united side chains) and implicit solvent. However, owing to rigorous physics-based derivation, which enabled...

Full text to download in external service
Block Conjugate Gradient Method with Multilevel Preconditioning and GPU Acceleration for FEM Problems in Electromagnetics
Publication
- A. Dziekoński
- M. Mrozowski
- IEEE Antennas and Wireless Propagation Letters - Year 2018
In this paper a GPU-accelerated block conjugate gradient solver with multilevel preconditioning is presented for solving large system of sparse equations with multiple right hand-sides (RHSs) which arise in the finite-element analysis of electromagnetic problems. We demonstrate that blocking reduces the time to solution significantly and allows for better utilization of the computing power of GPUs, especially when the system matrix...

Full text to download in external service
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
Publication
- P. Czarnul
- P. Rościszewski
- Year 2020
Auto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...

Full text available to download
Tuning a Hybrid GPU-CPU V-Cycle Multilevel Preconditioner for Solving Large Real and Complex Systems of FEM Equations
Publication
- IEEE Antennas and Wireless Propagation Letters - Year 2011
This letter presents techniques for tuning an accelerated preconditioned conjugate gradient solver with a multilevel preconditioner. The solver is optimized for a fast solution of sparse systems of equations arising in computational electromagnetics in a finite element method using higher-order elements. The goal of the tuning is to increase the throughput while at the same time reducing the memory requirements in order to allow...

Full text to download in external service
Sign Language Recognition Using Convolution Neural Networks
Publication
- Year 2024
The objective of this work was to provide an app that can automatically recognize hand gestures from the American Sign Language (ASL) on mobile devices. The app employs a model based on Convolutional Neural Network (CNN) for gesture classification. Various CNN architectures and optimization strategies suitable for devices with limited resources were examined. InceptionV3 and VGG-19 models exhibited negligibly higher accuracy than...

Full text available to download
Optymalizacja efektywności hamowania odzyskowego w transporcie szynowym przez sterowanie czasem przyjazdu na stację
Publication
- M. Urbaniak
- E. Kardas-Cinal
- Problemy Kolejnictwa - Year 2018
Artykuł nawiązuje do poprzednich prac autorów, w których przedstawiono model organizacji ruchu kooperujących pociągów z uwzględnieniem optymalizacji wykorzystania energii zwracanej do sieci jezdnej. W przedstawionej pracy zmodyfikowano model zmieniając główną zmienną sterującą, mającą wpływ na efektywne wykorzystanie energii, z czasu odjazdu na czas przyjazdu pociągu na stację lub przystanek. Optymalizacja dokonywana jest przez...

Full text available to download
Optymalizacja rozkładu jazdy na kolei z uwzględnieniem efektywności hamowania odzyskowego.
Publication
- M. Urbaniak
- Logistyka - Year 2015
Na wstępie artykułu przybliżono czytelnikowi, czym jest rozkład jazdy na sieci kolejowej, na czym polega jego optymalizacja oraz odwołano się do literatury opisującej proces jego konstrukcji. W dalszej części przedstawiono kryteria optymalizacji rozkładu jazdy i zaproponowano podejście od strony efektywności wykorzystania energii pochodzącej z hamowania rekuperacyjnego, realizowanego metodą odzysku bezpośrednio do sieci trakcyjnej....

Full text available to download
Towards an efficient multi-stage Riemann solver for nuclear physics simulations
Publication
- S. Cygert
- J. Porter-Sobieraj
- D. Kikoła
- J. Sikorski
- M. Słodkowski
- Year 2013
Relativistic numerical hydrodynamics is an important tool in high energy nuclear science. However, such simulations are extremely demanding in terms of computing power. This paper focuses on improving the speed of solving the Riemann problem with the MUSTA-FORCE algorithm by employing the CUDA parallel programming model. We also propose a new approach to 3D finite difference algorithms, which employ a GPU that uses surface memory....

Full text to download in external service
Performance evaluation of the parallel object tracking algorithm employing the particle filter
Publication
- G. Szwoch
- Year 2016
An algorithm based on particle filters is employed to track moving objects in video streams from fixed and non-fixed cameras. Particle weighting is based on color histograms computed in the iHLS color space. Particle computations are parallelized with CUDA framework. The algorithm was tested on various GPU devices: a desktop GPU card, a mobile chipset and two embedded GPU platforms. The processing speed depending on the number...
Implementation of TVDI calculation for coastal zone
Publication
- T. Bieliński
- Year 2015
Paper will show an implementation of TVDI (Temperature-Vegetation-Dryness Index) algorithm on GPU (Graphics Processing Unit). Calculation of this index is based on LST (Land Surface Temperature) and NDVI (Normalized Difference Vegetation Index). Discussed results are based on multi-spectral imagery retrieved from AVHRR3 sensors for area of Poland, especially from region of Gdańsk coastal zone. All phases of TVDI implementation...
How to render FDTD computations more effective using agraphics accelerator.
Publication
- IEEE TRANSACTIONS ON MAGNETICS - Year 2009
Graphics processing units (GPUs) for years have been dedicated mostly to real time rendering. Recently leading GPU manufactures have extended their research area and decided to support also graphics computing. In this paper, we describe an impact of new GPU features on development process of an efficient finite difference time domain (FDTD) implementation.

Full text to download in external service
A memory efficient and fast sparse matrix vector product on a Gpu
Publication
- Progress in Electromagnetics Research-PIER - Year 2011
This paper proposes a new sparse matrix storage format which allows an efficient implementation of a sparse matrix vector product on a Fermi Graphics Processing Unit (GPU). Unlike previous formats it has both low memory footprint and good throughput. The new format, which we call Sliced ELLR-T has been designed specifically for accelerating the iterative solution of a large sparse and complex-valued system of linear equations arising...

Full text to download in external service
Modelling and simulation of GPU processing in the MERPSYS environment
Publication
- T. Gajger
- P. Czarnul
- Scalable Computing: Practice and Experience - Year 2018
In this work, we evaluate an analytical GPU performance model based on Little's law, that expresses the kernel execution time in terms of latency bound, throughput bound, and achieved occupancy. We then combine it with the results of several research papers, introduce equations for data transfer time estimation, and finally incorporate it into the MERPSYS framework, which is a general-purpose simulator for parallel and distributed...

Full text available to download
GPU-Accelerated LOBPCG Method with Inexact Null-Space Filtering for Solving Generalized Eigenvalue Problems in Computational Electromagnetics Analysis with Higher-Order FEM
Publication
- Communications in Computational Physics - Year 2017
This paper presents a GPU-accelerated implementation of the Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) method with an inexact nullspace filtering approach to find eigenvalues in electromagnetics analysis with higherorder FEM. The performance of the proposed approach is verified using the Kepler (Tesla K40c) graphics accelerator, and is compared to the performance of the implementation based on functions from...

Full text to download in external service
Advanced Potential Energy Surfaces for Molecular Simulation
Publication
- A. Albaugh
- H. Boateng
- R. Bradshaw
- O. Demerdash
- J. Dziedzic
- Y. Mao
- D. Margul
- J. Swails
- Q. Zeng
- D. Case... and 10 others
- JOURNAL OF PHYSICAL CHEMISTRY B - Year 2016
Advanced potential energy surfaces are defined as theoretical models that explicitly include many-body effects that transcend the standard fixed-charge, pairwise-additive paradigm typically used in molecular simulation. However, several factors relating to their software implementation have precluded their widespread use in condensed-phase simulations: the computational cost of the theoretical models, a paucity of approximate models...

Full text available to download
Możliwości ograniczenia zużycia energii napędowej urządzeń przez optymalizację doboru wymienników ciepła, właściwą konfigurację i kontrolę przepływu płynów roboczych
Publication
- R. Andrzejczyk
- T. Muszyński
- Technika Chłodnicza i Klimatyzacyjna - Year 2012
Artykuł poświęcony możliwości ograniczenia zużycia energii napędowej urządzeń na drodze optymalizacji doboru wymienników ciepła, właściwej konfiguracji i kontroli przepływu płynów roboczych. Przedstawiono w nim ocenę zużycia energii w systemach energetycznych o największej energochłonności w realiach Polski. Zwrócono uwagę na możliwość wykorzystania wymienników ciepła o wysokiej efektywności dla zmniejszenia oporu przenoszenia...
A Task-Scheduling Approach for Efficient Sparse Symmetric Matrix-Vector Multiplication on a GPU
Publication
- SIAM JOURNAL ON SCIENTIFIC COMPUTING - Year 2015
In this paper, a task-scheduling approach to efficiently calculating sparse symmetric matrix-vector products and designed to run on Graphics Processing Units (GPUs) is presented. The main premise is that, for many sparse symmetric matrices occurring in common applications, it is possible to obtain significant reductions in memory usage and improvements in performance when the matrix is prepared in certain ways prior to computation....

Full text to download in external service
Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA
Publication
- M. J. Adiletta
- J. J. Tithi
- E. Farsarakis
- G. Gerogiannis
- R. Adolf
- R. Benke
- S. Kashyap
- S. Hsia
- K. Lakhotia
- F. Petrini... and 2 others
- Year 2023
Large-scale Graph Convolutional Network (GCN) inference on traditional CPU/GPU systems is challenging due to a large memory footprint, sparse computational patterns, and irregular memory accesses with poor locality. Intel’s Programmable Integrated Unffied Memory Architecture (PIUMA) is designed to address these challenges for graph analytics. In this paper, a detailed characterization of GCNs is presented using the Open-Graph Benchmark...

Full text to download in external service
STEROWNIK MIKROSIECI ELEKTROENERGETYCZNEJ
Publication
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Year 2015
W artykule rozpatruje się konstrukcję sterownika mikrosieci elektroenergetycznej. Sterownik zarządza zasobamienergii elektrycznej w celu pokrycia zapotrzebowania lokalnych gospodarstw domowych z uwzględnieniem kwestii ekonomicznych. Przedstawiono strukturę sterowania, zdefiniowano zadanie optymalizacji, dokonano badań symulacyjnych dla przykładowej mikrosieci o zróżnicowanych sposobach generowania i magazynowania. Zaproponowano...

Full text available to download
Implementation of FDTD-Compatible Green's Function on Graphics Processing Unit
Publication
- T. Stefański
- K. Krzyżanowska
- IEEE Antennas and Wireless Propagation Letters - Year 2012
In this letter, implementation of the finite-difference time domain (FDTD)-compatible Green's function on a graphics processing unit (GPU) is presented. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision...

Full text to download in external service
Model organizacji ruchu na sieci kolejowej z uwzględnieniem rekuperacji energii
Publication
- M. Urbaniak
- Year 2018
Na wstępie przeanalizowano aktualny stan wiedzy z zakresu metod wykorzystywania energii z rekuperacji oraz istniejących modeli optymalizujących ich efektywność. Na tej podstawie za główny cel pracy wyznaczono opracowanie metody modyfikacji kolejowego rozkładu jazdy, która doprowadzi do zwiększenia efektywności wykorzystania energii pochodzącej z rekuperacji. W związku z powyższym postawiono tezę, że możliwe jest zwiększenie efektywności...
Generation of large finite-element matrices on multiple graphics processors
Publication
- INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING - Year 2013
This paper presents techniques for generating very large finite-element matrices on a multicore workstation equipped with several graphics processing units (GPUs). To overcome the low memory size limitation of the GPUs, and at the same time to accelerate the generation process, we propose to generate the large sparse linear systems arising in finite-element analysis in an iterative manner on several GPUs and to use the graphics...

Full text to download in external service
Wybrane zagadnienia optymalizacji organizacji ruchu kolejowego w celu minimalizacji kosztów energii elektrycznej
Publication
- M. Urbaniak
- M. Jacyna
- PRACE NAUKOWE POLITECHNIKI WARSZAWSKIEJ. SERIA: TRANSPORT - Year 2016
W artykule przedstawiono podział kosztów w transporcie kolejowym z uwzględnieniem kosztów wewnętrznych przedsiębiorstwa, do których zaliczają się między innymi koszty dostępu do infrastruktury, czy koszty energii. Stwierdzono, że przy odpowiedniej organizacji ruchu pociągów na sieci kolejowej, bez ponoszenia dodatkowych nakładów na infrastrukturę i specjalistyczne urządzenia, można znacznie ograniczyć zużycie energii, a co za tym...

Full text available to download
ZASTOSOWANIA DRONÓW I SENSORÓW WIZYJNYCH I AKUSTYCZNYCH DO ZDALNEJ DETEKCJI I LOKALIZACJI OBIEKTÓW I ZDARZEŃ
Publication
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2016
W referacie przedstawiono wybrane sensory akustyczne i wizyjne i propozycje ich zastosowania do wykrywania i lokalizacji obiektów i zdarzeń z pokładu drona. Opisano pokrótce zastosowane algorytmy analizy strumieni, przedstawiono wyniki badań stworzonych prototypów i metod, zaimplementowanych na wydajnych układach GPU
Multi-core and Multiprocessor Implementation of Numerical Integration in Finite Element Method
Publication
- Year 2012
The paper presents techniques for accelerating a numerical integration process which appears in the Finite Element Method. The acceleration is achieved by taking advantages of multi-core and multiprocessor devices. It is shown that using multi-core implementation with OpenMP and a GPU acceleration using CUDA architecture allows one to achieve the speedups by a factor of 5 and 10 on a CPU and GPUs, respectively.
ENERGY

Journals

ISSN: 0360-5442 , eISSN: 1873-6785
Using GPUs for Parallel Stencil Computations in Relativistic Hydrodynamic Simulation
Publication
- S. Cygert
- D. Kikoła
- J. Porter-Sobieraj
- J. Sikorski
- M. Słodkowski
- Year 2014
This paper explores the possibilities of using a GPU for complex 3D finite difference computation. We propose a new approach to this topic using surface memory and compare it with 3D stencil computations carried out via shared memory, which is currently considered to be the best approach. The case study was performed for the extensive computation of collisions between heavy nuclei in terms of relativistic hydrodynamics.

Full text to download in external service
Modelowanie reorganizacji ruchu w transporcie szynowym zwiększające efektywne wykorzystanie energii z hamowania odzyskowego
Publication
- M. Urbaniak
- E. Kardas-Cinal
- PRACE NAUKOWE POLITECHNIKI WARSZAWSKIEJ. SERIA: TRANSPORT - Year 2017
We wstępie artykułu przedstawiono metody wykorzystania energii elektrycznej odzyskanej w procesie hamowania elektrodynamicznego. Szczególną uwagę zwrócono na metodę zwrotu odzyskanej energii do sieci jezdnej i wykorzystania jej przez inne pojazdy szynowe, której efektywne zastosowanie niejednokrotnie wymaga reorganizacji ruchu. W pracy przeanalizowano opisany w literaturze model organizacji ruchu w transporcie szynowym, który uwzględnia...

Full text available to download
Nowoczesne koncepcje integracji usług w systemie BeesyCluster
Publication
- P. Czarnul
- Year 2010
Opisano funkcje aktualnej wersji systemu BeesyCluster jakowarstwy pośredniej w dostępie do rozproszonych zasobów wraz podsystemami integracji usług, wyboru usług oraz ich wykonania. Zaprezentowano rozszerzenia podsystemu integracji usług zorientowane na green computing. Omówiono problemy inteligentnego wyszukiwania usług, wykorzystanie GPU, współpracę z urządzeniami mobilnymi oraz przetwarzanie w przestrzeniach inteligentnych.Dodatkowo...
Optymalizacja wydajności obliczeniowej metody elementów skończonych w architekturze CUDA
Publication
- A. Dziekoński
- Year 2015
Celem niniejszej rozprawy oraz stypendium odbytego w ramach projektu było opracowanie numerycznie efektywnego rozwiązania algorytmicznego i sprzętowego, które umożliwia przyspieszenie analizy problemów elektromagnetycznych metodą elementów skończonych (MES) z funkcjami bazowymi wysokiego rzędu. Metoda elementów skończonych w dziedzinie częstotliwości stanowi wydajne i uniwersalne narzędzie analizy układów mikrofalowych (rys....
Preconditioners with Low Memory Requirements for Higher-Order Finite-Element Method Applied to Solving Maxwell’s Equations on Multicore CPUs and GPUs
Publication
- A. Dziekoński
- G. Fotyga
- M. Mrozowski
- IEEE Access - Year 2018
This paper discusses two fast implementations of the conjugate gradient iterative method using a hierarchical multilevel preconditioner to solve the complex-valued, sparse systems obtained using the higher order finite-element method applied to the solution of the time-harmonic Maxwell equations. In the first implementation, denoted PCG-V, a classical V-cycle is applied and the system of equations on the lowest level is solved...

Full text available to download
What entrepreneurs think about tax optimization?
Open Research Data
open access
- P. Kasprzak
- P. Dębniak
The study conducted on a group of 259 entrepreneurs concerned the behavioral attitudes of business owners regarding their opinion on tax optimization. From the study we will learn, among others, how tax optimization is defined according to entrepreneurs, their attitude towards it, as well as what optimization actions they have taken so far.
Grzegorz Boczkaj dr hab. inż.

People

Department of Sanitary Engineering
ENGINEERING OPTIMIZATION

Journals

ISSN: 0305-215X , eISSN: 1029-0273
A Regular Expression Matching Application with Configurable Data Intensity for Testing Heterogeneous HPC Systems
Publication
- Year 2014
Modern High Performance Computing (HPC) systems are becoming increasingly heterogeneous in terms of utilized hardware, as well as software solutions. The problems, that we wish to efficiently solve using those systems have different complexity, not only considering magnitude, but also the type of complexity: computation, data or communication intensity. Developing new mechanisms for dealing with those complexities or choosing an...
The assessment of renewable energy in Poland on the background of the world renewable energy sector
Publication
- B. Igliński
- M. B. Pietrzak
- U. Kiełkowska
- M. Skrzatek
- G. Kumar
- G. Piechota
- ENERGY - Year 2022
The issues of the article are associated with the development of the renewable energy source (RES) sector in the world and in Poland. The subject is undoubtedly connected with the problem of the energy transformation taking place in most countries nowadays. Energy transformation processes are mainly associated with an increase in the share of energy production from RES and increased awareness of energy use by end consumers. This...

Full text available to download

Search

Filters

Catalog

Search results for: GPU ENERGY OPTIMIZATION

Grzegorz Boczkaj dr hab. inż.