Filters
total: 267
filtered: 168
Search results for: CUDA
-
Acceleration of the DGF-FDTD method on GPU using the CUDA technology
PublicationWe present a parallel implementation of the discrete Green's function formulation of the finite-difference time-domain (DGF-FDTD) method on a graphics processing unit (GPU). The compute unified device architecture (CUDA) parallel computing platform is applied in the developed implementation. For the sake of example, arrays of Yagi-Uda antennas were simulated with the use of DGF-FDTD on GPU. The efficiency of parallel computations...
-
Optymalizacja wydajności obliczeniowej metody elementów skończonych w architekturze CUDA
PublicationCelem niniejszej rozprawy oraz stypendium odbytego w ramach projektu było opracowanie numerycznie efektywnego rozwiązania algorytmicznego i sprzętowego, które umożliwia przyspieszenie analizy problemów elektromagnetycznych metodą elementów skończonych (MES) z funkcjami bazowymi wysokiego rzędu. Metoda elementów skończonych w dziedzinie częstotliwości stanowi wydajne i uniwersalne narzędzie analizy układów mikrofalowych (rys....
-
Parallel implementation of the DGF-FDTD method on GPU Using the CUDA technology
PublicationThe discrete Green's function (DGF) formulation of the finite-difference time-domain method (FDTD) is accelerated on a graphics processing unit (GPU) by means of the Compute Unified Device Architecture (CUDA) technology. In the developed implementation of the DGF-FDTD method, a new analytic expression for dyadic DGF derived based on scalar DGF is employed in computations. The DGF-FDTD method on GPU returns solutions that are compatible...
-
Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications
PublicationThe aim of this paper is to evaluate performance of new CUDA mechanisms—unified memory and dynamic parallelism for real parallel applications compared to standard CUDA API versions. In order to gain insight into performance of these mechanisms, we decided to implement three applications with control and data flow typical of SPMD, geometric SPMD and divide-and-conquer schemes, which were then used for tests and experiments. Specifically,...
-
Implementation of algebraic procedures on the GPU using CUDA architecture on the example of generalized eigenvalue problem
Publication -
High performance filtering for big datasets from Airborne Laser Scanning with CUDA technology
PublicationThere are many studies on the problems of processing big datasets provided by Airborne Laser Scanning (ALS). The processing of point clouds is often executed in stages or on the fragments of the measurement set. Therefore, solutions that enable the processing of the entire cloud at the same time in a simple, fast, efficient way are the subject of many researches. In this paper, authors propose to use General-Purpose computation...
-
A multithreaded CUDA and OpenMP based power‐aware programming framework for multi‐node GPU systems
PublicationIn the paper, we have proposed a framework that allows programming a parallel application for a multi-node system, with one or more GPUs per node, using an OpenMP+extended CUDA API. OpenMP is used for launching threads responsible for management of particular GPUs and extended CUDA calls allow to manage CUDA objects, data and launch kernels. The framework hides inter-node MPI communication from the programmer who can benefit from...
-
Wykorzystanie technologii CUDA do kompresji w czasie rzeczywistym danych pochodzących z sonarów wielowiązkowych.
PublicationW pracy przedstawiono projekt oraz implementację systemu przeznaczonego do kompresji danych z sonarów wielowiązkowych działającego z wykorzystaniem technologii CUDA. Omówiono oraz zastosowano metody bezstratnej kompresji danych oraz techniki przetwarzania równoległego. Stworzoną aplikację przetestowano pod kątem prędkości i stopnia kompresji oraz porównano z innymi rozwiązaniami umożliwiającymi kompresję tego typu informacji.
-
Investigation of Parallel Data Processing Using Hybrid High Performance CPU + GPU Systems and CUDA Streams
PublicationThe paper investigates parallel data processing in a hybrid CPU+GPU(s) system using multiple CUDA streams for overlapping communication and computations. This is crucial for efficient processing of data, in particular incoming data stream processing that would naturally be forwarded using multiple CUDA streams to GPUs. Performance is evaluated for various compute time to host-device communication time ratios, numbers of CUDA streams,...
-
Crack monitoring in concrete beams under bending using ultrasonic waves and coda wave interferometry: the effect of excitation frequency on coda
PublicationConcrete is one of the most widely used construction materials in the world. In recent years, various non-destructive testing (NDT) and structural health monitoring (SHM) techniques have been investigated to improve the safety and control of the current condition of concrete structures. This study focuses on micro-crack monitoring in concrete beams. The experimental analysis was carried out on concrete elements subjected to three-point...
-
Performance evaluation of Unified Memory with prefetching and oversubscription for selected parallel CUDA applications on NVIDIA Pascal and Volta GPUs
PublicationThe paper presents assessment of Unified Memory performance with data prefetching and memory oversubscription. Several versions of code are used with: standard memory management, standard Unified Memory and optimized Unified Memory with programmer-assisted data prefetching. Evaluation of execution times is provided for four applications: Sobel and image rotation filters, stream image processing and computational fluid dynamic simulation,...
-
Coda wave interferometry in monitoring the fracture process of concrete beams under bending test
PublicationEarly detection of damage is necessary for the safe and reliable use of civil engineering structures made of concrete. Recently, the identification of micro-cracks in concrete has become an area of growing interest, especially using wave-based techniques. In this paper, a non-destructive testing approach for the characterization of the fracture process was presented. Experimental tests were made on concrete beams subjected to mechanical...
-
Computationally Efficient Multi-Objective Optimization of and Experimental Validation of Yagi-Uda Antenna
PublicationIn this paper, computationally efficient multi-objective optimization of antenna structures is discussed. As a design case, we consider a multi-parameter planar Yagi-Uda antenna structure, featuring a driven element, three directors, and a feeding structure. Direct optimization of the high-fidelity electromagnetic (EM) antenna model is prohibitive in computational terms. Instead, our design methodology exploits response surface...
-
Low-Cost Multi-Objective Optimization Yagi-Uda Antenna in Multi-Dimensional Parameter Space
PublicationA surrogate-based technique for fast multi-objective optimization of a multi-parameter planar Yagi-Uda antenna structure is presented. The proposed method utilizes response surface approximation (RSA) models constructed using training samples obtained from evaluation of the low-fidelity antenna model. Utilization of the RSA models allowsfor fast determination of the best possible trade-offs between conflicting objectives in multi-objective...
-
Weakly non-local theories of damage and plasticity based on balance of dissipative material forces
PublicationWykazano, że założenie o siłach materialnych oraz odpowiadających im równaniach równowagi w połączeniu z klasycznymi prawami dynamiki w przestrzeni fizycznej, pozwoliło na sformułowanie teoretycznych podstaw w zakresie słabo-nielokalnych gradientowych modeli zniszczenia i plastyczności. Odpowiednie równania równowagi w przestrzeni fizycznej i materialnej oraz pierwsze i drugie prawo termodynamiki zapisano w postaci całkowej. Następnie...
-
Monitoring the fracture process of concrete during splitting using integrated ultrasonic coda wave interferometry, digital image correlation and X-ray micro-computed tomography
PublicationThe paper deals with the continuous-time monitoring of mechanical degradation in concrete cubes under splitting. A series of experiments performed with integrated coda wave interferometry (CWI) and digital image correlation (DIC), supported with X-ray micro-computed tomography (micro-CT) is reported. DIC and micro-CT techniques were used to characterize the fracture process in detail. CWI method was proved to be effective in the...
-
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
PublicationIn the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...
-
Latająca Kawiarenka Naukowa
PublicationW artykule opisano spotkanie Latającej Kawiarenki Naukowej mającej na celu popularyzację nauki z zakresu mechaniki konstrukcji oraz mostów. Kawiarenka zatytułowana „Mosty: cuda architektury i techniki” została zorganizowana przez Akademię Młodych Uczonych PAN oraz Koło Naukowe Mechaniki Budowli KoMBo.
-
Multi-core and Multiprocessor Implementation of Numerical Integration in Finite Element Method
PublicationThe paper presents techniques for accelerating a numerical integration process which appears in the Finite Element Method. The acceleration is achieved by taking advantages of multi-core and multiprocessor devices. It is shown that using multi-core implementation with OpenMP and a GPU acceleration using CUDA architecture allows one to achieve the speedups by a factor of 5 and 10 on a CPU and GPUs, respectively.
-
Krylov Space Iterative Solvers on Graphics Processing Units
PublicationCUDA architecture was introduced by Nvidia three years ago and since then there have been many promising publications demonstrating a huge potential of Graphics Processing Units (GPUs) in scientific computations. In this paper, we investigate the performance of iterative methods such as cg, minres, gmres, bicg that may be used to solve large sparse real and complex systems of equations arising in computational electromagnetics.
-
What is important for you makes you think about the pandemic differently: moral foundations, pandemic-related fears and convictions (Lo que cada uno consideramos importante nos hace reflexionar sobre la pandemia de forma distinta: fundamentos morales, temores y convicciones sobre la pandemia)
PublicationBuilding on the Moral Foundations Theory and findings regarding the linkage of values, convictions and beliefs, the aim of the study was to compare people displaying various constellations of moral foundations regarding their tolerance of ambiguity, fear of COVID-19 (FCV), endorsement of COVID-19 conspiracy theories and the extent to which they believed in the effectiveness of five COVID-19 preventive measures. This study was...
-
Towards an efficient multi-stage Riemann solver for nuclear physics simulations
PublicationRelativistic numerical hydrodynamics is an important tool in high energy nuclear science. However, such simulations are extremely demanding in terms of computing power. This paper focuses on improving the speed of solving the Riemann problem with the MUSTA-FORCE algorithm by employing the CUDA parallel programming model. We also propose a new approach to 3D finite difference algorithms, which employ a GPU that uses surface memory....
-
Novel luminescent calixarene-based lanthanide materials: From synthesis and characterization to the selective detection of Fe3+
PublicationCalix[n]arene-based coordination networks are an emerging class of materials with intriguing properties resulted from the presence of the cavity-like structure of the macrocycle and metallic nodes. In this work, four novel luminescent materials based on calix[4]arene-carboxylate and lanthanides (Eu3þ and Tb3þ) were prepared by two synthetic approaches, solvothermal (CDA-Eu-ST) and slow diffusion (CDA-Eu-RT, CDA-Tb-RT, CTA-Tb-complex)...
-
GPU based implementation of Temperature-Vegetation Dryness Index for AVHRR3 Satellite Data
PublicationPaper presents an implementation of TVDI (Temperature-Vegetation-Dryness Index) algorithm on GPU (Graphics Processing Unit). Calculation of this index is based on LST (Land Surface Temperature) and NDVI (Normalized Difference Vegetation Index). Discussed results are based on multi-spectral imagery retrieved from AVHRR3 sensors for area of Poland. All phases of TVDI implementation on GPU are modified in respect to CUDA platform....
-
Performance evaluation of parallel background subtraction on GPU platforms
PublicationImplementation of the background subtraction algorithm on parallel GPUs is presented. The algorithm processes video streams and extracts foreground pixels. The work focuses on optimizing parallel algorithm implementation by taking into account specific features of the GPU architecture, such as memory access, data transfers and work group organization. The algorithm is implemented in both OpenCL and CUDA. Various optimizations of...
-
Optimizing the computation of a parallel 3D finite difference algorithm for graphics processing units
PublicationThis paper explores the possibilities of using a graphics processing unit for complex 3D finite difference computation via MUSTA‐FORCE and WENO algorithms. We propose a novel algorithm based on the new properties of CUDA surface memory optimized for 2D spatial locality and compare it with 3D stencil computations carried out via shared memory, which is currently considered to be the best approach. A case study was performed for...
-
Performance evaluation of the parallel object tracking algorithm employing the particle filter
PublicationAn algorithm based on particle filters is employed to track moving objects in video streams from fixed and non-fixed cameras. Particle weighting is based on color histograms computed in the iHLS color space. Particle computations are parallelized with CUDA framework. The algorithm was tested on various GPU devices: a desktop GPU card, a mobile chipset and two embedded GPU platforms. The processing speed depending on the number...
-
Use of ICT infrastructure for teaching HPC
PublicationIn this paper we look at modern ICT infrastructure as well as curriculum used for conducting a contemporary course on high performance computing taught over several years at the Faculty of Electronics Telecommunications and Informatics, Gdansk University of Technology, Poland. We describe the infrastructure in the context of teaching parallel programming at the cluster level using MPI, node level using OpenMP and CUDA. We present...
-
Dynamic GPU power capping with online performance tracing for energy efficient GPU computing using DEPO tool
PublicationGPU accelerators have become essential to the recent advance in computational power of high- performance computing (HPC) systems. Current HPC systems’ reaching an approximately 20–30 mega-watt power demand has resulted in increasing CO2 emissions, energy costs and necessitate increasingly complex cooling systems. This is a very real challenge. To address this, new mechanisms of software power control could be employed. In this...
-
Education for sustainable development in a systemic perspective
Publication -
Competences of academic tutors – research among participants of the project “Masters of Didactics”
Publication -
Access to Higher Education: the Adult Learners' Perspective
Publication -
Uwarunkowania efektywności kształcenia w liceum dla dorosłych
PublicationTematyka funkcjonowania szkolnictwa dla dorosłych w Polsce od ponad 20 lat wydaje się zapomniana. Wprawdzie podejmowane są nieliczne prace ukazujące wspierający charakter edukacyjny szkół dla dorosłych wobec uczących się w nich słuchaczy, jednak instytucje te nie cieszą się uznaniem społeczeństwa. Celem oddanej w ręce czytelnika monografii jest zweryfikowanie tej oceny na podstawie badania efektów uczenia w szkołach dla dorosłych...
-
Relation between benchmark displacement velocity and seismic activity caused by underground longwall exploitation
Publication -
Possibility to apply unified methodology in vibration analysis for long lasting and impulse sources, in terms of influence on people in buildings
Publication -
Building the Learning Environment for Sustainable Development: a Co-creation approach
PublicationEducation for sustainable development supports the improvement of knowledge, skills, attitudes and behaviors related to global challenges such as climate change, global warming and environmental degradation, among others. It is increasingly taking place through projects based on information and communication technologies. The effectiveness of the actions taken depends not only on the quality of the project activities or the...
-
Exploring perceptions of pro environmental educational mobile applications based on semantic field analysis
PublicationThe paper aims to identify multidimensional perceptions of mobile apps by their users. Special attention has been paid to pro-environmental educational apps. Semantic field analysis and measurement of emotional temperatures were performed to achieve this goal. Transcripts from seven focus group interviews were used as research material. The results indicate that functionality based on a reward or benefit system reinforces environmentally...
-
Implementation of FDTD-Compatible Green's Function on Graphics Processing Unit
PublicationIn this letter, implementation of the finite-difference time domain (FDTD)-compatible Green's function on a graphics processing unit (GPU) is presented. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates its applications in the FDTD simulations of radiation and scattering problems. Unfortunately, implementation of the new DGF formula in software requires a multiple precision...
-
Zarządzanie w zakresie robót pogłębiarskich oraz konstrukcji morskich
PublicationZagadnienia zarządzania ryzykiem w przedsięwzięciach związanych z pogłębianiem oraz konstrukcjami morskimi. Rezultaty konferencji ICE, IADC i CDA. Szczególne problemy zarządzania kontraktem.
-
Dlaczego istnieje w Polsce konieczność budowy elektrowni jądrowych?
PublicationPrzedstawiono konieczność budowy elektrowni jadrowych w aspektach ekonomicznym, energetycznym i ekologicznym z uwagi na wzrost zapotrzebowania na energię elektryczną w Polsce.
-
Dlaczego istnieje w Polsce konieczność budowy elektrowni jądrowych?
PublicationZaprezentowano i uzasadniono korzysci, które wynikają z wdrożenia w Polsce wytwarzania energii elektrycznej w elektrowniach jądrowych. Rozwój energetyki jadrowej przedstawiono w aspektach ekonomicznym, energetycznym i ekologicznym.
-
Dlaczego istnieje w Polsce konieczność budowy elektrowni jądrowych?
PublicationZaprezentowano i uzasadniono korzysci, które wynikają z wdrożenia w Polsce wytwarzania energii elektrycznej w elektrowniach jądrowych. Rozwój energetyki jadrowej przedstawiono w aspektach ekonomicznym, energetycznym i ekologicznym.
-
Technologie spawalnicze w okrętownictwie
PublicationOmówiono skrótowo: cieplne klasyczne technologie spawalnicze (cięcie i spawanie MMA, SAW, CAW, GMAW, MIG-MAG) oraz technologie rozwoju spawania plazmowego, laserowego i hybrydowego oraz zgrzewania tarciowego z przemieszczeniem (FSW).
-
Aspekty energetyczne, ekonomiczne i ekologiczne rozwoju elektrowni jądrowych
PublicationPrzedstawiono skróconą wersję referatu, który był opublikowany na Międzynarodowej Konferencji Naukow-Technicznej pt. ''Elektrownie jądrowe dla Polski- NPPP 2006'' Warszawa, 1-2.06.2006. Uzasadniono konieczność budowy elektrowni jądrowych w Polsce.
-
Katamaran ''Energa Solar'' zasilany energią słoneczną = Solar energy ''Energa Solar'' catamaran
PublicationReferat prezentuje unikalną jednostkę zasilaną energią słoneczną ENERGA Solar.
-
The catamarans George and Energa Solar
PublicationReferat prezentuje unikalne jednostki: George - katamaran zasilany siłą mięśni i Energa Solar - zasilany energią słoneczną.
-
Tackling Air Pollution in Cities with Modelling and Simulation: Remote Group Model Building as an Educational Tool Supporting System Dynamics Modelling
Publication -
Digital competence learning in secondary adult education in Finland and Poland
Publication -
Implementation of FDTD-compatible Green's function on heterogeneous CPU-GPU parallel processing system
PublicationThis paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited to each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates...
-
Urban Food Self-Production in the Perspective of Social Learning Theory: Empowering Self-Sustainability
PublicationUrban food production is becoming an increasingly significant topic in the context of climate change and food security. Conducting research on this subject is becoming an essential element of urban development, deepening knowledge regarding the benefits, challenges, and potential for the development of urban agriculture as an alternative form of food production. Responding to this need, this monograph presents the results of...