Filtry
wszystkich: 3109
-
Katalog
- Publikacje 1642 wyników po odfiltrowaniu
- Czasopisma 9 wyników po odfiltrowaniu
- Konferencje 10 wyników po odfiltrowaniu
- Osoby 48 wyników po odfiltrowaniu
- Projekty 1 wyników po odfiltrowaniu
- Aparatura Badawcza 2 wyników po odfiltrowaniu
- Kursy Online 12 wyników po odfiltrowaniu
- Wydarzenia 1 wyników po odfiltrowaniu
- Dane Badawcze 1384 wyników po odfiltrowaniu
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: MULTI-CORE CPU
-
Acceleration of Electromagnetic Simulations on Reconfigurable FPGA Card
PublikacjaIn this contribution, the hardware acceleration of electromagnetic simulations on the reconfigurable field-programmable-gate-array (FPGA) card is presented. In the developed implementation of scientific computations, the matrix-assembly phase of the method of moments (MoM) is accelerated on the Xilinx Alveo U200 card. The computational method involves discretization of the frequency-domain mixed potential integral equation using...
-
Tuning matrix-vector multiplication on GPU
PublikacjaA matrix times vector multiplication (matvec) is a cornerstone operation in iterative methods of solving large sparse systems of equations such as the conjugate gradients method (cg), the minimal residual method (minres), the generalized residual method (gmres) and exerts an influence on overall performance of those methods. An implementation of matvec is particularly demanding when one executes computations on a GPU (Graphics...
-
Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors
PublikacjaThe paper deals with parallelization of computing similarity measures between large vectors. Such computations are important components within many applications and consequently are of high importance. Rather than focusing on optimization of the algorithm itself, assuming specific measures, the paper assumes a general scheme for finding similarity measures for all pairs of vectors and investigates optimizations for scalability...
-
A memory efficient and fast sparse matrix vector product on a Gpu
PublikacjaThis paper proposes a new sparse matrix storage format which allows an efficient implementation of a sparse matrix vector product on a Fermi Graphics Processing Unit (GPU). Unlike previous formats it has both low memory footprint and good throughput. The new format, which we call Sliced ELLR-T has been designed specifically for accelerating the iterative solution of a large sparse and complex-valued system of linear equations arising...
-
Implementation of Addition and Subtraction Operations in Multiple Precision Arithmetic
PublikacjaIn this paper, we present a digital circuit of arithmetic unit implementing addition and subtraction operations in multiple-precision arithmetic (MPA). This adder-subtractor unit is a part of MPA coprocessor supporting and offloading the central processing unit (CPU) in computations requiring precision higher than 32/64 bits. Although addition and subtraction operations of two n-digit numbers require O(n) operations, the efficient...
-
Evaluation the effectiveness of virtual machine integrated with CPU
PublikacjaIn the paper effectiveness of example CPU with integrated virtual machine is presented. The idea and implementation of virtual machine is shown. In next sections reference CPU and sample virtual machine is described. Finally optimality of the translation process is analysed.
-
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
PublikacjaAuto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...
-
Wielopoziomowy przekształtnik trakcyjny SiC z izolacją od sieci 3kV DC realizowaną za pomocą transformatorów 30kHz do napędów EZT
PublikacjaW referacie przedstawiono wielopoziomowy izolowany kaskadowy przekształtnik DC-AC z tranzystorami SiC MOSFET 1,2kV, przeznaczony do napędów elektrycznych zespołów trakcyjnych (EZT). Zaproponowana konstrukcja przekształtnika, przeznaczonego do pracy przy zasilaniu z sieci trakcyjnej 3kV DC, spełnia założenia energoelektronicznego transformatora trakcyjnego (z ang. Power Electronic Traction Transformer). Budowa modułowa z niskonapięciowych...
-
Parallelization of large vector similarity computations in a hybrid CPU+GPU environment
PublikacjaThe paper presents design, implementation and tuning of a hybrid parallel OpenMP+CUDA code for computation of similarity between pairs of a large number of multidimensional vectors. The problem has a wide range of applications, and consequently its optimization is of high importance, especially on currently widespread hybrid CPU+GPU systems targeted in the paper. The following are presented and tested for computation of all vector...
-
Core Evidence
Czasopisma -
Implementation of FDTD-compatible Green's function on heterogeneous CPU-GPU parallel processing system
PublikacjaThis paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited to each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates...
-
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
PublikacjaIn the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...
-
Nodal models of Pressurized Water Reactor core for control purposes – A comparison study
PublikacjaThe paper focuses on the presentation and comparison of basic nodal and expanded multi-nodal models of the Pressurized Water Reactor (PWR) core, which includes neutron kinetics, heat transfer between fuel and coolant, and internal and external reactivity feedback processes. In the expanded multi-nodal model, the authors introduce a novel approach to the implementation of thermal power distribution phenomena into the multi-node...
-
On Wrinkling in Sandwich Panels with an Orthotropic Core
PublikacjaThis paper deals with the local loss of stability (wrinkling) problem of a thin facing of a sandwich panel. Classical solutions to the problem of facing instability resting on a homogeneous and isotropic substructure (a core) are compared. The relations between strain energy components associated with different forms of core deformations are discussed. Next, a new solution for the orthotropic core is presented in detail, which...
-
Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services
PublikacjaStreaming multimedia services have become very popular in recent years, due to the development of wireless networks. With the growing number of mobile devices worldwide, service providers offer dedicated applications that allow to deliver on-demand audio and video content anytime and everywhere. The aim of this study was to compare different streaming services and investigate their impact on the CPU and RAM resources, with respect...
-
SciPost Physics Core
Czasopisma -
Investigation of Parallel Data Processing Using Hybrid High Performance CPU + GPU Systems and CUDA Streams
PublikacjaThe paper investigates parallel data processing in a hybrid CPU+GPU(s) system using multiple CUDA streams for overlapping communication and computations. This is crucial for efficient processing of data, in particular incoming data stream processing that would naturally be forwarded using multiple CUDA streams to GPUs. Performance is evaluated for various compute time to host-device communication time ratios, numbers of CUDA streams,...
-
Wiktoria Wojnicz dr hab. inż.
OsobyDSc in Mechanics (in the field of Biomechanics) - Lodz Univeristy of Technology, 2019 PhD in Mechanics (in the field of Biomechanics) - Lodz Univeristy of Technology, 2009 (with distinction) Publikacje z listy MNiSW (2009 - ) Wojnicz W., Wittbrodt E., Analysis of muscles' behaviour. Part I. The computational model of muscle. Acta of Bioengineering and Biomechanics, Vol. 11, No.4, 2009, p. 15-21 Wojnicz W., Wittbrodt E., Analysis...
-
Method of determining the residual fluxes in transformer core
PublikacjaThe article presents the method of calculating the residual induction in transformer columns. The method is based on measurement of the magnetic induction in selected points around the transformer core. The values of residual induction are calculated as linear combination of the results of measurement.
-
Bioactive core material for porous load-bearing implants
PublikacjaSo far state of knowledge on biodegradable materials is reviewed. Among a variety of investigated materials, those composed of polymers and ceramics may be considered as only candidates for a core material in porous titanium alloy. The collagen and chitosan among natural polymers, polyhydroxy acids among synthetic polymers, and hydroxyapatite and tricalcium phosphate among ceramics are proposed for further research. Three essential...