Filtry
wszystkich: 4557
-
Katalog
- Publikacje 3813 wyników po odfiltrowaniu
- Czasopisma 231 wyników po odfiltrowaniu
- Konferencje 20 wyników po odfiltrowaniu
- Osoby 133 wyników po odfiltrowaniu
- Projekty 15 wyników po odfiltrowaniu
- Kursy Online 164 wyników po odfiltrowaniu
- Wydarzenia 8 wyników po odfiltrowaniu
- Dane Badawcze 173 wyników po odfiltrowaniu
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: gpu energy optimization
-
Dynamic GPU power capping with online performance tracing for energy efficient GPU computing using DEPO tool
PublikacjaGPU accelerators have become essential to the recent advance in computational power of high- performance computing (HPC) systems. Current HPC systems’ reaching an approximately 20–30 mega-watt power demand has resulted in increasing CO2 emissions, energy costs and necessitate increasingly complex cooling systems. This is a very real challenge. To address this, new mechanisms of software power control could be employed. In this...
-
GPU-Accelerated 3D Mesh Deformation for Optimization Based on the Finite Element Method
PublikacjaThis paper discusses a strategy for speeding up the mesh deformation process in the design-byoptimization of high-frequency components involving electromagnetic field simulations using the 3D finite element method (FEM). The mesh deformation is assumed to be described by a linear elasticity model of a rigid body; therefore, each time the shape of the device is changed, an auxiliary elasticity finite-element problem must be solved....
-
Communication and Load Balancing Optimization for Finite Element Electromagnetic Simulations Using Multi-GPU Workstation
PublikacjaThis paper considers a method for accelerating finite-element simulations of electromagnetic problems on a workstation using graphics processing units (GPUs). The focus is on finite-element formulations using higher order elements and tetrahedral meshes that lead to sparse matrices too large to be dealt with on a typical workstation using direct methods. We discuss the problem of rapid matrix generation and assembly, as well as...
-
Solar Photovoltaic Energy Optimization and Challenges
PublikacjaThe study paper focuses on solar energy optimization approaches, as well as the obstacles and concerns that come with them. This study discusses the most current advancements in solar power generation devices in order to provide a reference for decision-makers in the field of solar plant construction throughout the world. These technologies are divided into three groups: photovoltaic, thermal, and hybrid (thermal/photovoltaic)....
-
Solar Photovoltaic Energy Optimization and Challenges
PublikacjaThe study paper focuses on solar energy optimization approaches, as well as the obstacles and concerns that come with them. This study discusses the most current advancements in solar power generation devices in order to provide a reference for decision-makers in the field of solar plant construction throughout the world. These technologies are divided into three groups: photovoltaic, thermal, and hybrid (thermal/photovoltaic)....
-
Performance and Energy Aware Training of a Deep Neural Network in a Multi-GPU Environment with Power Capping
PublikacjaIn this paper we demonstrate that it is possible to obtain considerable improvement of performance and energy aware metrics for training of deep neural networks using a modern parallel multi-GPU system, by enforcing selected, non-default power caps on the GPUs. We measure the power and energy consumption of the whole node using a professional, certified hardware power meter. For a high performance workstation with 8 GPUs, we were...
-
GPU Power Capping for Energy-Performance Trade-Offs in Training of Deep Convolutional Neural Networks for Image Recognition
PublikacjaIn the paper we present performance-energy trade-off investigation of training Deep Convolutional Neural Networks for image recognition. Several representative and widely adopted network models, such as Alexnet, VGG-19, Inception V3, Inception V4, Resnet50 and Resnet152 were tested using systems with Nvidia Quadro RTX 6000 as well as Nvidia V100 GPUs. Using GPU power capping we found other than default configurations minimizing...
-
Optimization of the UNRES Force Field by Hierarchical Design of the Potential-Energy Landscape. 3. Use of Many Proteins in Optimization
Publikacja -
Performance/energy aware optimization of parallel applications on GPUs under power capping
PublikacjaIn the paper we present an approach and results from application of the modern power capping mechanism available for NVIDIA GPUs to the bench- marks such as NAS Parallel Benchmarks BT, SP and LU as well as cublasgemm- benchmark which are widely used for assessment of high performance computing systems’ performance. Specifically, depending on the benchmarks, various power cap configurations are best for desired trade-off of performance...
-
Optimization of Train Energy Cooperation Using Scheduled Service Time Reserve
PublikacjaThe main aim of the paper was to develop an innovative approach to the preliminary estimation possibility of train energy cooperation based on data from timetables, without traction calculations. The article points out the need to strive for sustainable and environmentally friendly transport. It was pointed out that rail transport using electric traction is one of the more ecological branches of transport. It also offers a number...
-
Optimization of using recuperative braking energy on a double-track railway line
PublikacjaIn the introduction, possible ways of reusing energy from recuperation are presented. Next, the paper investigates the possibility of using regenerative braking in the range allowed by the detailed timetable by adopting the method of transferring the recovered electric energy directly to the catenary and immediate use of this energy by another train at the same power section. In the main part of the work, it is shown, that the...
-
Optimization of Parameters in Macromolecular Potential Energy Functions by Conformational Space Annealing
Publikacja -
Modification and Optimization of the United-Residue (UNRES) Potential Energy Function for Canonical Simulations. I. Temperature Dependence of the Effective Energy Function and Tests of the Optimization Method with Single Training Proteins
Publikacja -
Energy Systems-Optimization Modeling Simulation and Economic Aspects
Czasopisma -
Optimization of the efficiency of braking energy recovery in rail transport by changing arrival time
PublikacjaThe article refers to the previous work of the authors, in which the model of traffic organization of cooperating trains including the optimization of the use of energy returned to the catenary was presented. In the presented article, the model was modified by changing the main control variable, which affects the efficient use of energy. Departure time was changed for the arrival time of the train to the stop or station. The optimization...
-
Mitigating the Energy Consumption and the Carbon Emission in the Building Structures by Optimization of the Construction Processes
PublikacjaFor decades, among other industries, the construction sector has accounted for high energy consumption and emissions. As the energy crisis and climate change have become a growing concern, mitigating energy usage is a significant issue. The operational and end of life phases are all included in the building life cycle stages. Although the operation stage accounts for more energy consumption with higher carbon emissions, the...
-
Geometry optimization of steroid sulfatase inhibitors - the influence on the free binding energy with STS
PublikacjaIn the paper we review the application of two techniques (molecular mechanics and quantum mechanics) to study the influence of geometry optimization of the steroid sulfatase inhibitors on the values of descriptors coded their chemical structure and their free binding energy with the STS protein. We selected 22 STS-inhibitors and compared their structures optimized with MM+, PM7 and DFT B3LYP/6–31++G* approaches considering separately...
-
Optimization of the Geothermal Energy for District Heating in the Polish Tatras Region: A Case Study
Publikacja -
Recent improvements in prediction of protein structure by global optimization of a potential energy function
Publikacja -
Efficient parallel algorithms in global optimization of potential energy functions for peptides, proteins, and crystals
Publikacja -
Energy consumption optimization in wastewater treatment plants: Machine learning for monitoring incineration of sewage sludge
PublikacjaBiomass management in terms of energy consumption optimization has become a recent challenge for developed countries. Nevertheless, the multiplicity of materials and operating parameters controlling energy consumption in wastewater treatment plants necessitates the need for sophisticated well-organized disciplines in order to minimize energy consumption and dissipation. Sewage sludge (SS) disposal management is the key stage of...
-
Improving the energy balance in wastewater treatment plants by optimization of aeration control and application of new technologies
PublikacjaThe methods to improve the energy balance of a wastewater treatment plant (WWTP) by optimization of aeration process control and application of innovative nitrogen removal technologies were overviewed in the study. The control of aeration based on the ABAC (Ammonia-Based Aeration Control) system allows not only for significant savings in electricity consumption, but it can also increase the efficiency of the denitrification process....
-
Generalized regression neural network and fitness dependent optimization: Application to energy harvesting of centralized TEG systems
PublikacjaThe thermoelectric generator (TEG) system has attracted extensive attention because of its applications in centralized solar heat utilization and recoverable heat energy. The operating efficiency of the TEG system is highly affected by operating conditions. In a series-parallel structure, due to diverse temperature differences, the TEG modules show non-linear performance. Due to the non-uniform temperature distribution (NUTD) condition,...
-
Modeling and optimization of chemical-treated torrefaction of wheat straw to improve energy density by response surface methodology
PublikacjaToday, torrefaction is important technique for extending the potential of biomass for improvement of energy density. The independent variables investigated for torrefaction study were temperature, retention time, acid concentration, and particle size. The experiment was designed by central composite design (CCD) method using design expert (version 11). The three dependent variables were higher heating value (HHV), energy enhancement...
-
Analyzing Wind Energy Potential Using Efficient Global Optimization: A Case Study for the City Gdańsk in Poland
PublikacjaWind energy (WE), which is one of the renewable energy (RE) sources for generating electricity, has been making a significant contribution to obtaining clean and green energy in recent years. Fitting an appropriate statistical distribution to the wind speed (WS) data is crucial in analyzing and estimating WE potential. Once the best suitable statistical distribution for WS data is determined, WE potential and potential yield could...
-
A dual-control strategy based on electrode material and electrolyte optimization to construct an asymmetric supercapacitor with high energy density
PublikacjaMetal-organic frames (MOFs) are regarded as excellent candidates for supercapacitors that have attracted much attention because of their diversity, adjustability and porosity. However, both poor structural stability in aqueous alkaline electrolytes and the low electrical conductivity of MOF materials constrain their practical implementation in supercapacitors. In this study, bimetallic CoNi-MOF were synthesized to enhance the electrical...
-
Advanced Supervisory Control System Implemented at Full-Scale WWTP—A Case Study of Optimization and Energy Balance Improvement
PublikacjaIn modern and cost-eective Wastewater Treatment Plants (WWTPs), processes such as aeration, chemical feeds and sludge pumping are usually controlled by an operating system integrated with online sensors. The proper verification of these data-driven measurements and the control of different unit operations at the same time has a strong influence on better understanding and accurately optimizing the biochemical processes at WWTP—especially...
-
Aeration Process in Bioreactors as the Main Energy Consumer in a Wastewater Treatment Plant. Review of Solutions and Methods of Process Optimization
PublikacjaDue to the key role of the biological decomposition process of organic compounds in wastewater treatment, a very important thing is appropriate aeration of activated sludge, because microorganisms have to be supplied with an appropriate amount of oxygen. Aeration is one of the most energy-consuming processes in the conventional activated sludge systems of wastewater treatment technology (may consume from 50% to 90% of electricity...
-
Estimation, optimization and analysis based investigation of the energy consumption in machinability of ceramic-based metal matrix composite materials
Publikacja -
A Salp-Swarm Optimization based MPPT technique for harvesting maximum energy from PV systems under partial shading conditions
Publikacja -
Optimization of the UNRES Force Field by Hierarchical Design of the Potential-Energy Landscape. 2. Off-Lattice Tests of the Method with Single Proteins
Publikacja -
The Application of the Thermal Stabilization Prompted by the Ice Cover Expansion Considering the Energy Production Optimization in the Dam-Reservoir Coupled Systems on the Vistula River
PublikacjaIn this study, the thermal stabilization of a water resource together with an energy production optimization in the power plant of the dam–reservoir coupled system is conducted. This coupled dam system is designed to consist of a primary (Włocławek) and secondary (Siarzewo) dam due to the erosion control aspect. The other beneficial aspect of this coupled dam design is to have an additional power plant, with the aim of achieving...
-
Optimization of the UNRES Force Field by Hierarchical Design of the Potential-Energy Landscape. 1. Tests of the Approach Using Simple Lattice Protein Models
Publikacja -
International Journal of Energy Optimization and Engineering
Czasopisma -
Mutual Interaction between Temperature and DO Set Point on AOB and NOB Activity during Shortcut Nitrification in a Sequencing Batch Reactor in Terms of Energy Consumption Optimization
PublikacjaRecently, many wastewater treatment plants (WWTPs) have had to deal with serious problems related to the restrictive requirements regarding the euent quality, as well as significant energy consumption associated with it. In this situation, mainstream deammonification and/or shortened nitrification-denitrification via nitrite (so-called “nitrite shunt”) is a new promising strategy. This study shows the mechanisms and operating conditions...
-
Chaotic Dynamics and Bifurcations in Impact Systems
Publikacja -
Paweł Czarnul dr hab. inż.
OsobyPaweł Czarnul uzyskał stopień doktora habilitowanego w dziedzinie nauk technicznych w dyscyplinie informatyka w roku 2015 zaś stopień doktora nauk technicznych w zakresie informatyki(z wyróżnieniem) nadany przez Radę Wydziału Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej w roku 2003. Dziedziny jego zainteresowań obejmują: przetwarzanie równoległei rozproszone w tym programowanie równoległe na klastrach obliczeniowych,...
-
Parallelization of large vector similarity computations in a hybrid CPU+GPU environment
PublikacjaThe paper presents design, implementation and tuning of a hybrid parallel OpenMP+CUDA code for computation of similarity between pairs of a large number of multidimensional vectors. The problem has a wide range of applications, and consequently its optimization is of high importance, especially on currently widespread hybrid CPU+GPU systems targeted in the paper. The following are presented and tested for computation of all vector...
-
GPU-Accelerated Finite-Element Matrix Generation for Lossless, Lossy, and Tensor Media [EM Programmer's Notebook]
PublikacjaThis paper presents an optimization approach for limiting memory requirements and enhancing the performance of GPU-accelerated finite-element matrix generation applied in the implementation of the higher-order finite-element method (FEM). It emphasizes the details of the implementation of the matrix-generation algorithm for the simulation of electromagnetic wave propagation in lossless, lossy, and tensor media. Moreover, the impact...
-
Accuracy, Memory and Speed Strategies in GPU-based Finite-Element Matrix-Generation
PublikacjaThis paper presents strategies on how to optimize GPU-based finite-element matrix-generation that occurs in the finite-element method (FEM) using higher order curvilinear elements. The goal of the optimization is to increase the speed of evaluation and assembly of large finite-element matrices on a single GPU (Graphics Processing Unit) while maintaining the accuracy of numerical integration at the desired level. For this reason,...
-
Numerical optimization of planar antenna structures using trust-region algorithm with adaptively adjusted finite differences
Dane BadawczeThe dataset contains initial designs and optimization results for three planar structures that include quasi-patch antenna for WLAN applications, compact spline-parameterized monopole dedicated for ultra-wideband applications, as well as rectifier for energy harvesting with enhanced bandwidth. The numerical results for the first two structures are also...
-
GPU Acceleration of Multilevel Solvers for Analysis of Microwave Components With Finite Element Method
PublikacjaThe letter discusses a fast implementation of the conjugate gradient iterative method with ${rm E}$-field multilevel preconditioner applied to solving real symmetric and sparse systems obtained with vector finite element method. In order to accelerate computations, a graphics processing unit (GPU) was used and significant speed-up (2.61 fold) was achieved comparing to a central processing unit (CPU) based approach. These results...
-
Finite element matrix generation on a GPU
PublikacjaThis paper presents an efficient technique for fast generation of sparse systems of linear equations arising in computational electromagnetics in a finite element method using higher order elements. The proposed approach employs a graphics processing unit (GPU) for both numerical integration and matrix assembly. The performance results obtained on a test platform consisting of a Fermi GPU (1x Tesla C2075) and a CPU (2x twelve-core...
-
Ryszard Strzelecki prof. dr hab. inż.
Osoby -
Optimization of Data Assignment for Parallel Processing in a Hybrid Heterogeneous Environment Using Integer Linear Programming
PublikacjaIn the paper we investigate a practical approach to application of integer linear programming for optimization of data assignment to compute units in a multi-level heterogeneous environment with various compute devices, including CPUs, GPUs and Intel Xeon Phis. The model considers an application that processes a large number of data chunks in parallel on various compute units and takes into account computations, communication including...
-
Acceleration of the DGF-FDTD method on GPU using the CUDA technology
PublikacjaWe present a parallel implementation of the discrete Green's function formulation of the finite-difference time-domain (DGF-FDTD) method on a graphics processing unit (GPU). The compute unified device architecture (CUDA) parallel computing platform is applied in the developed implementation. For the sake of example, arrays of Yagi-Uda antennas were simulated with the use of DGF-FDTD on GPU. The efficiency of parallel computations...
-
Parallel implementation of the DGF-FDTD method on GPU Using the CUDA technology
PublikacjaThe discrete Green's function (DGF) formulation of the finite-difference time-domain method (FDTD) is accelerated on a graphics processing unit (GPU) by means of the Compute Unified Device Architecture (CUDA) technology. In the developed implementation of the DGF-FDTD method, a new analytic expression for dyadic DGF derived based on scalar DGF is employed in computations. The DGF-FDTD method on GPU returns solutions that are compatible...
-
Jacobi and gauss-seidel preconditioned complex conjugate gradient method with GPU acceleration for finite element method
PublikacjaIn this paper two implementations of iterative solvers for solving complex symmetric and sparse systems resulting from finite element method applied to wave equation are discussed. The problem under investigation is a dielectric resonator antenna (DRA) discretized by FEM with vector elements of the second order (LT/QN). The solvers use the preconditioned conjugate gradient (pcg) method implemented on Graphics Processing Unit (GPU)...
-
A GPU Solver for Sparse Generalized Eigenvalue Problems with Symmetric Complex-Valued Matrices Obtained Using Higher-Order FEM
PublikacjaThe paper discusses a fast implementation of the stabilized locally optimal block preconditioned conjugate gradient (sLOBPCG) method, using a hierarchical multilevel preconditioner to solve nonHermitian sparse generalized eigenvalue problems with large symmetric complex-valued matrices obtained using the higher-order finite-element method (FEM), applied to the analysis of a microwave resonator. The resonant frequencies of the low-order...
-
Implementation of FDTD-compatible Green's function on heterogeneous CPU-GPU parallel processing system
PublikacjaThis paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited to each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates...
-
Tuning matrix-vector multiplication on GPU
PublikacjaA matrix times vector multiplication (matvec) is a cornerstone operation in iterative methods of solving large sparse systems of equations such as the conjugate gradients method (cg), the minimal residual method (minres), the generalized residual method (gmres) and exerts an influence on overall performance of those methods. An implementation of matvec is particularly demanding when one executes computations on a GPU (Graphics...
-
Performance evaluation of parallel background subtraction on GPU platforms
PublikacjaImplementation of the background subtraction algorithm on parallel GPUs is presented. The algorithm processes video streams and extracts foreground pixels. The work focuses on optimizing parallel algorithm implementation by taking into account specific features of the GPU architecture, such as memory access, data transfers and work group organization. The algorithm is implemented in both OpenCL and CUDA. Various optimizations of...
-
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
PublikacjaIn the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...
-
Single and Dual-GPU Generalized Sparse Eigenvalue Solvers for Finding a Few Low-Order Resonances of a Microwave Cavity Using the Finite-Element Method
PublikacjaThis paper presents two fast generalized eigenvalue solvers for sparse symmetric matrices that arise when electromagnetic cavity resonances are investigated using the higher-order finite element method (FEM). To find a few loworder resonances, the locally optimal block preconditioned conjugate gradient (LOBPCG) algorithm with null-space deflation is applied. The computations are expedited by using one or two graphical processing...
-
GPU based implementation of Temperature-Vegetation Dryness Index for AVHRR3 Satellite Data
PublikacjaPaper presents an implementation of TVDI (Temperature-Vegetation-Dryness Index) algorithm on GPU (Graphics Processing Unit). Calculation of this index is based on LST (Land Surface Temperature) and NDVI (Normalized Difference Vegetation Index). Discussed results are based on multi-spectral imagery retrieved from AVHRR3 sensors for area of Poland. All phases of TVDI implementation on GPU are modified in respect to CUDA platform....
-
A multithreaded CUDA and OpenMP based power‐aware programming framework for multi‐node GPU systems
PublikacjaIn the paper, we have proposed a framework that allows programming a parallel application for a multi-node system, with one or more GPUs per node, using an OpenMP+extended CUDA API. OpenMP is used for launching threads responsible for management of particular GPUs and extended CUDA calls allow to manage CUDA objects, data and launch kernels. The framework hides inter-node MPI communication from the programmer who can benefit from...
-
Parallel Background Subtraction in Video Streams Using OpenCL on GPU Platforms
PublikacjaImplementation of the background subtraction algorithm using OpenCL platform is presented. The algorithm processes live stream of video frames from the surveillance camera in on-line mode. Processing is performed using a host machine and a parallel computing device. The work focuses on optimizing an OpenCL algorithm implementation for GPU devices by taking into account specific features of the GPU architecture, such as memory access,...
-
GPU-accelerated finite element method
PublikacjaIn this paper the results of the acceleration of computations involved in analysing electromagnetic problems by means of the finite element method (FEM), obtained with graphics processors (GPU), are presented. A 4.7-fold acceleration was achieved thanks to the massive parallelization of the most time-consuming steps of FEM, namely finite-element matrix-generation and the solution of a sparse system of linear equations with the...
-
Investigation of Parallel Data Processing Using Hybrid High Performance CPU + GPU Systems and CUDA Streams
PublikacjaThe paper investigates parallel data processing in a hybrid CPU+GPU(s) system using multiple CUDA streams for overlapping communication and computations. This is crucial for efficient processing of data, in particular incoming data stream processing that would naturally be forwarded using multiple CUDA streams to GPUs. Performance is evaluated for various compute time to host-device communication time ratios, numbers of CUDA streams,...
-
Energy-Aware Scheduling for High-Performance Computing Systems: A Survey
PublikacjaHigh-performance computing (HPC), according to its name, is traditionally oriented toward performance, especially the execution time and scalability of the computations. However, due to the high cost and environmental issues, energy consumption has already become a very important factor that needs to be considered. The paper presents a survey of energy-aware scheduling methods used in a modern HPC environment, starting with the...
-
Performance Evaluation of Selected Parallel Object Detection and Tracking Algorithms on an Embedded GPU Platform
PublikacjaPerformance evaluation of selected complex video processing algorithms, implemented on a parallel, embedded GPU platform Tegra X1, is presented. Three algorithms were chosen for evaluation: a GMM-based object detection algorithm, a particle filter tracking algorithm and an optical flow based algorithm devoted to people counting in a crowd flow. The choice of these algorithms was based on their computational complexity and parallel...
-
Block Conjugate Gradient Method with Multilevel Preconditioning and GPU Acceleration for FEM Problems in Electromagnetics
PublikacjaIn this paper a GPU-accelerated block conjugate gradient solver with multilevel preconditioning is presented for solving large system of sparse equations with multiple right hand-sides (RHSs) which arise in the finite-element analysis of electromagnetic problems. We demonstrate that blocking reduces the time to solution significantly and allows for better utilization of the computing power of GPUs, especially when the system matrix...
-
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
PublikacjaAuto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...
-
Tuning a Hybrid GPU-CPU V-Cycle Multilevel Preconditioner for Solving Large Real and Complex Systems of FEM Equations
PublikacjaThis letter presents techniques for tuning an accelerated preconditioned conjugate gradient solver with a multilevel preconditioner. The solver is optimized for a fast solution of sparse systems of equations arising in computational electromagnetics in a finite element method using higher-order elements. The goal of the tuning is to increase the throughput while at the same time reducing the memory requirements in order to allow...
-
Optymalizacja efektywności hamowania odzyskowego w transporcie szynowym przez sterowanie czasem przyjazdu na stację
PublikacjaArtykuł nawiązuje do poprzednich prac autorów, w których przedstawiono model organizacji ruchu kooperujących pociągów z uwzględnieniem optymalizacji wykorzystania energii zwracanej do sieci jezdnej. W przedstawionej pracy zmodyfikowano model zmieniając główną zmienną sterującą, mającą wpływ na efektywne wykorzystanie energii, z czasu odjazdu na czas przyjazdu pociągu na stację lub przystanek. Optymalizacja dokonywana jest przez...
-
Optymalizacja rozkładu jazdy na kolei z uwzględnieniem efektywności hamowania odzyskowego.
PublikacjaNa wstępie artykułu przybliżono czytelnikowi, czym jest rozkład jazdy na sieci kolejowej, na czym polega jego optymalizacja oraz odwołano się do literatury opisującej proces jego konstrukcji. W dalszej części przedstawiono kryteria optymalizacji rozkładu jazdy i zaproponowano podejście od strony efektywności wykorzystania energii pochodzącej z hamowania rekuperacyjnego, realizowanego metodą odzysku bezpośrednio do sieci trakcyjnej....
-
Towards an efficient multi-stage Riemann solver for nuclear physics simulations
PublikacjaRelativistic numerical hydrodynamics is an important tool in high energy nuclear science. However, such simulations are extremely demanding in terms of computing power. This paper focuses on improving the speed of solving the Riemann problem with the MUSTA-FORCE algorithm by employing the CUDA parallel programming model. We also propose a new approach to 3D finite difference algorithms, which employ a GPU that uses surface memory....
-
Performance evaluation of the parallel object tracking algorithm employing the particle filter
PublikacjaAn algorithm based on particle filters is employed to track moving objects in video streams from fixed and non-fixed cameras. Particle weighting is based on color histograms computed in the iHLS color space. Particle computations are parallelized with CUDA framework. The algorithm was tested on various GPU devices: a desktop GPU card, a mobile chipset and two embedded GPU platforms. The processing speed depending on the number...
-
Implementation of TVDI calculation for coastal zone
PublikacjaPaper will show an implementation of TVDI (Temperature-Vegetation-Dryness Index) algorithm on GPU (Graphics Processing Unit). Calculation of this index is based on LST (Land Surface Temperature) and NDVI (Normalized Difference Vegetation Index). Discussed results are based on multi-spectral imagery retrieved from AVHRR3 sensors for area of Poland, especially from region of Gdańsk coastal zone. All phases of TVDI implementation...
-
How to render FDTD computations more effective using agraphics accelerator.
PublikacjaGraphics processing units (GPUs) for years have been dedicated mostly to real time rendering. Recently leading GPU manufactures have extended their research area and decided to support also graphics computing. In this paper, we describe an impact of new GPU features on development process of an efficient finite difference time domain (FDTD) implementation.
-
A memory efficient and fast sparse matrix vector product on a Gpu
PublikacjaThis paper proposes a new sparse matrix storage format which allows an efficient implementation of a sparse matrix vector product on a Fermi Graphics Processing Unit (GPU). Unlike previous formats it has both low memory footprint and good throughput. The new format, which we call Sliced ELLR-T has been designed specifically for accelerating the iterative solution of a large sparse and complex-valued system of linear equations arising...
-
Modelling and simulation of GPU processing in the MERPSYS environment
PublikacjaIn this work, we evaluate an analytical GPU performance model based on Little's law, that expresses the kernel execution time in terms of latency bound, throughput bound, and achieved occupancy. We then combine it with the results of several research papers, introduce equations for data transfer time estimation, and finally incorporate it into the MERPSYS framework, which is a general-purpose simulator for parallel and distributed...
-
GPU-Accelerated LOBPCG Method with Inexact Null-Space Filtering for Solving Generalized Eigenvalue Problems in Computational Electromagnetics Analysis with Higher-Order FEM
PublikacjaThis paper presents a GPU-accelerated implementation of the Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) method with an inexact nullspace filtering approach to find eigenvalues in electromagnetics analysis with higherorder FEM. The performance of the proposed approach is verified using the Kepler (Tesla K40c) graphics accelerator, and is compared to the performance of the implementation based on functions from...
-
Advanced Potential Energy Surfaces for Molecular Simulation
PublikacjaAdvanced potential energy surfaces are defined as theoretical models that explicitly include many-body effects that transcend the standard fixed-charge, pairwise-additive paradigm typically used in molecular simulation. However, several factors relating to their software implementation have precluded their widespread use in condensed-phase simulations: the computational cost of the theoretical models, a paucity of approximate models...
-
A Task-Scheduling Approach for Efficient Sparse Symmetric Matrix-Vector Multiplication on a GPU
PublikacjaIn this paper, a task-scheduling approach to efficiently calculating sparse symmetric matrix-vector products and designed to run on Graphics Processing Units (GPUs) is presented. The main premise is that, for many sparse symmetric matrices occurring in common applications, it is possible to obtain significant reductions in memory usage and improvements in performance when the matrix is prepared in certain ways prior to computation....
-
Możliwości ograniczenia zużycia energii napędowej urządzeń przez optymalizację doboru wymienników ciepła, właściwą konfigurację i kontrolę przepływu płynów roboczych
PublikacjaArtykuł poświęcony możliwości ograniczenia zużycia energii napędowej urządzeń na drodze optymalizacji doboru wymienników ciepła, właściwej konfiguracji i kontroli przepływu płynów roboczych. Przedstawiono w nim ocenę zużycia energii w systemach energetycznych o największej energochłonności w realiach Polski. Zwrócono uwagę na możliwość wykorzystania wymienników ciepła o wysokiej efektywności dla zmniejszenia oporu przenoszenia...
-
Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA
PublikacjaLarge-scale Graph Convolutional Network (GCN) inference on traditional CPU/GPU systems is challenging due to a large memory footprint, sparse computational patterns, and irregular memory accesses with poor locality. Intel’s Programmable Integrated Unffied Memory Architecture (PIUMA) is designed to address these challenges for graph analytics. In this paper, a detailed characterization of GCNs is presented using the Open-Graph Benchmark...
-
STEROWNIK MIKROSIECI ELEKTROENERGETYCZNEJ
PublikacjaW artykule rozpatruje się konstrukcję sterownika mikrosieci elektroenergetycznej. Sterownik zarządza zasobamienergii elektrycznej w celu pokrycia zapotrzebowania lokalnych gospodarstw domowych z uwzględnieniem kwestii ekonomicznych. Przedstawiono strukturę sterowania, zdefiniowano zadanie optymalizacji, dokonano badań symulacyjnych dla przykładowej mikrosieci o zróżnicowanych sposobach generowania i magazynowania. Zaproponowano...
-
Implementation of FDTD-Compatible Green's Function on Graphics Processing Unit
PublikacjaIn this letter, implementation of the finite-difference time domain (FDTD)-compatible Green's function on a graphics processing unit (GPU) is presented. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision...
-
Model organizacji ruchu na sieci kolejowej z uwzględnieniem rekuperacji energii
PublikacjaNa wstępie przeanalizowano aktualny stan wiedzy z zakresu metod wykorzystywania energii z rekuperacji oraz istniejących modeli optymalizujących ich efektywność. Na tej podstawie za główny cel pracy wyznaczono opracowanie metody modyfikacji kolejowego rozkładu jazdy, która doprowadzi do zwiększenia efektywności wykorzystania energii pochodzącej z rekuperacji. W związku z powyższym postawiono tezę, że możliwe jest zwiększenie efektywności...
-
Generation of large finite-element matrices on multiple graphics processors
PublikacjaThis paper presents techniques for generating very large finite-element matrices on a multicore workstation equipped with several graphics processing units (GPUs). To overcome the low memory size limitation of the GPUs, and at the same time to accelerate the generation process, we propose to generate the large sparse linear systems arising in finite-element analysis in an iterative manner on several GPUs and to use the graphics...
-
ZASTOSOWANIA DRONÓW I SENSORÓW WIZYJNYCH I AKUSTYCZNYCH DO ZDALNEJ DETEKCJI I LOKALIZACJI OBIEKTÓW I ZDARZEŃ
PublikacjaW referacie przedstawiono wybrane sensory akustyczne i wizyjne i propozycje ich zastosowania do wykrywania i lokalizacji obiektów i zdarzeń z pokładu drona. Opisano pokrótce zastosowane algorytmy analizy strumieni, przedstawiono wyniki badań stworzonych prototypów i metod, zaimplementowanych na wydajnych układach GPU
-
Multi-core and Multiprocessor Implementation of Numerical Integration in Finite Element Method
PublikacjaThe paper presents techniques for accelerating a numerical integration process which appears in the Finite Element Method. The acceleration is achieved by taking advantages of multi-core and multiprocessor devices. It is shown that using multi-core implementation with OpenMP and a GPU acceleration using CUDA architecture allows one to achieve the speedups by a factor of 5 and 10 on a CPU and GPUs, respectively.
-
Wybrane zagadnienia optymalizacji organizacji ruchu kolejowego w celu minimalizacji kosztów energii elektrycznej
PublikacjaW artykule przedstawiono podział kosztów w transporcie kolejowym z uwzględnieniem kosztów wewnętrznych przedsiębiorstwa, do których zaliczają się między innymi koszty dostępu do infrastruktury, czy koszty energii. Stwierdzono, że przy odpowiedniej organizacji ruchu pociągów na sieci kolejowej, bez ponoszenia dodatkowych nakładów na infrastrukturę i specjalistyczne urządzenia, można znacznie ograniczyć zużycie energii, a co za tym...
-
Using GPUs for Parallel Stencil Computations in Relativistic Hydrodynamic Simulation
PublikacjaThis paper explores the possibilities of using a GPU for complex 3D finite difference computation. We propose a new approach to this topic using surface memory and compare it with 3D stencil computations carried out via shared memory, which is currently considered to be the best approach. The case study was performed for the extensive computation of collisions between heavy nuclei in terms of relativistic hydrodynamics.
-
Modelowanie reorganizacji ruchu w transporcie szynowym zwiększające efektywne wykorzystanie energii z hamowania odzyskowego
PublikacjaWe wstępie artykułu przedstawiono metody wykorzystania energii elektrycznej odzyskanej w procesie hamowania elektrodynamicznego. Szczególną uwagę zwrócono na metodę zwrotu odzyskanej energii do sieci jezdnej i wykorzystania jej przez inne pojazdy szynowe, której efektywne zastosowanie niejednokrotnie wymaga reorganizacji ruchu. W pracy przeanalizowano opisany w literaturze model organizacji ruchu w transporcie szynowym, który uwzględnia...
-
ENERGY
Czasopisma -
Nowoczesne koncepcje integracji usług w systemie BeesyCluster
PublikacjaOpisano funkcje aktualnej wersji systemu BeesyCluster jakowarstwy pośredniej w dostępie do rozproszonych zasobów wraz podsystemami integracji usług, wyboru usług oraz ich wykonania. Zaprezentowano rozszerzenia podsystemu integracji usług zorientowane na green computing. Omówiono problemy inteligentnego wyszukiwania usług, wykorzystanie GPU, współpracę z urządzeniami mobilnymi oraz przetwarzanie w przestrzeniach inteligentnych.Dodatkowo...
-
Optymalizacja wydajności obliczeniowej metody elementów skończonych w architekturze CUDA
PublikacjaCelem niniejszej rozprawy oraz stypendium odbytego w ramach projektu było opracowanie numerycznie efektywnego rozwiązania algorytmicznego i sprzętowego, które umożliwia przyspieszenie analizy problemów elektromagnetycznych metodą elementów skończonych (MES) z funkcjami bazowymi wysokiego rzędu. Metoda elementów skończonych w dziedzinie częstotliwości stanowi wydajne i uniwersalne narzędzie analizy układów mikrofalowych (rys....
-
Preconditioners with Low Memory Requirements for Higher-Order Finite-Element Method Applied to Solving Maxwell’s Equations on Multicore CPUs and GPUs
PublikacjaThis paper discusses two fast implementations of the conjugate gradient iterative method using a hierarchical multilevel preconditioner to solve the complex-valued, sparse systems obtained using the higher order finite-element method applied to the solution of the time-harmonic Maxwell equations. In the first implementation, denoted PCG-V, a classical V-cycle is applied and the system of equations on the lowest level is solved...
-
What entrepreneurs think about tax optimization?
Dane BadawczeThe study conducted on a group of 259 entrepreneurs concerned the behavioral attitudes of business owners regarding their opinion on tax optimization. From the study we will learn, among others, how tax optimization is defined according to entrepreneurs, their attitude towards it, as well as what optimization actions they have taken so far.
-
Grzegorz Boczkaj dr hab. inż.
Osoby -
ENGINEERING OPTIMIZATION
Czasopisma -
The assessment of renewable energy in Poland on the background of the world renewable energy sector
PublikacjaThe issues of the article are associated with the development of the renewable energy source (RES) sector in the world and in Poland. The subject is undoubtedly connected with the problem of the energy transformation taking place in most countries nowadays. Energy transformation processes are mainly associated with an increase in the share of energy production from RES and increased awareness of energy use by end consumers. This...
-
A Regular Expression Matching Application with Configurable Data Intensity for Testing Heterogeneous HPC Systems
PublikacjaModern High Performance Computing (HPC) systems are becoming increasingly heterogeneous in terms of utilized hardware, as well as software solutions. The problems, that we wish to efficiently solve using those systems have different complexity, not only considering magnitude, but also the type of complexity: computation, data or communication intensity. Developing new mechanisms for dealing with those complexities or choosing an...
-
The regional energy transformation in the context of renewable energy sources potential
PublikacjaThe topics addressed in the article concern the problem of exploiting the potential of renewable energy sources (RES) at the regional level and the problem of the course of regional energy transition processes. Throughout the world, energy transition proceeds in a specific way for each country, due to the different potential of the selected RES and political, institutional and socio-economic conditions. Energy transition processes...
-
Food Classification from Images Using a Neural Network Based Approach with NVIDIA Volta and Pascal GPUs
PublikacjaIn the paper we investigate the problem of food classification from images, for the Food-101 dataset extended with 31 additional food classes from Polish cuisine. We adopted transfer learning and firstly measured training times for models such as MobileNet, MobileNetV2, ResNet50, ResNet50V2, ResNet101, ResNet101V2, InceptionV3, InceptionResNetV2, Xception, NasNetMobile and DenseNet, for systems with NVIDIA Tesla V100 (Volta) and...
-
Józef Woźniak prof. dr hab. inż.
OsobyProf. dr hab. inż. Józef Woźniak prof. zw. Politechniki Gdańskiej ukończył studia na Wydziale Elektroniki Politechniki Gdańskiej w 1971 r. W 1976 r. uzyskał stopień doktora nauk technicznych, a w 1991 r. stopień doktora habilitowanego w dyscyplinie telekomunikacja i specjalności teleinformatyka. W styczniu roku 2002 otrzymał tytuł profesora nauk technicznych. W 1994 r. został mianowany na stanowisko profesora nadzwyczajnego w Politechnice...
-
A hybrid approach to optimization of radial inflow turbine with principal component analysis
PublikacjaEnergy conversion efficiency is one of the most important features of power systems as it greatly influences the economic balance. The efficiency can be increased in many ways. One of them is to optimize individual components of the power plant. In most Organic Rankine Cycle (ORC) systems the power is created in the turbine and these systems can benefit from effective turbine optimization. The paper presents the use of two kinds...
-
Optimization of Energetic Train Cooperation
PublikacjaIn the article, possible ways of using energy recovered during regenerative braking of trains are presented. It is pointed out that the return of recovered electricity directly to the catenary and its use in the energy cooperation of vehicles can be a no-cost method (without additional infrastructure). The method of energy cooperation between trains and its main assumptions, that uses the law of conservation of energy, are described...