Search results for: PARALLEL-PREFIX ADDER

Praca opisuje zagadnienia modelowania i napędzania manipulatorów równoległych. Cechą charakterystyczną manipulatorów równoległych jest występowanie jednego lub kilku łańcuchów kinematycznych zamkniętych (gałęzi równoległych). Standardowo, konstrukcje takie są napędzane jedynie silnikami montowanymi w parach kinematycznych łączących łańcuchy kinematyczne z podstawą. Niekiedy konstrukcje takie są układami napędzanymi nadmiarowo (liczba...

Generation of conformance test suites for parallel and distributed languages and APIS.

Publication

Ł. Garstecki

- Year 2003

Artykuł zarysowuje nową metodologię systematycznego tworzenia Zestawów Testów Zgodności. Testowanie zgodności ma na celu sprawdzenie, czy implementacja jest zgodna ze swoją specyfikacją, co jest szczególnie ważne w środowiskach równoległych i rozproszonych, gdzie musi ze sobą współpracować wiele różnych pakietów. Autor rozpoczął swoje badania w dziedzinie testowania zgodności dla języka równoległego sterowanego danymi Athapascan,...

Using GPUs for Parallel Stencil Computations in Relativistic Hydrodynamic Simulation

Publication

S. Cygert
D. Kikoła
J. Porter-Sobieraj
J. Sikorski
M. Słodkowski

- Year 2014

This paper explores the possibilities of using a GPU for complex 3D finite difference computation. We propose a new approach to this topic using surface memory and compare it with 3D stencil computations carried out via shared memory, which is currently considered to be the best approach. The case study was performed for the extensive computation of collisions between heavy nuclei in terms of relativistic hydrodynamics.

Full text to download in external service

Investigation of Mechanical and Microstructural Properties of Welded Specimens of AA6061-T6 Alloy with Friction Stir Welding and Parallel Friction Stir Welding Methods

Publication

A. Ghiasvand
M. M. Yavari
J. Tomków
J. W. Grimaldo Guerrero
H. Kheradmandan
A. Dorofeev
S. Memon
H. A. Derazkola

- Materials - Year 2021

The present study investigates the effect of two parameters of process type and tool offset on tensile, microhardness, and microstructure properties of AA6061-T6 aluminum alloy joints. Three methods of Friction Stir Welding (FSW), Advancing Parallel-Friction Stir Welding (AP-FSW), and Retreating Parallel-Friction Stir Welding (RP-FSW) were used. In addition, four modes of 0.5, 1, 1.5, and 2 mm of tool offset were used in two welding...

Full text available to download

Massively parallel linear-scaling Hartree–Fock exchange and hybrid exchange–correlation functionals with plane wave basis set accuracy

Publication

J. Dziedzic
J. C. Womack
R. Ali
C. Skylaris

- JOURNAL OF CHEMICAL PHYSICS - Year 2021

We extend our linear-scaling approach for the calculation of Hartree–Fock exchange energy using localized in situ optimized orbitals [Dziedzic et al., J. Chem. Phys. 139, 214103 (2013)] to leverage massive parallelism. Our approach has been implemented in the ONETEP (Order-N Electronic Total Energy Package) density functional theory framework, which employs a basis of non-orthogonal generalized Wannier functions (NGWFs) to achieve...

Full text available to download

Performance evaluation of the parallel object tracking algorithm employing the particle filter

Publication

G. Szwoch

- Year 2016

Full text to download in external service

Molecular Diffusion Simulation on ARUZ – Massively-parallel FPGA-based Machine

Publication

R. Kielbik
K. Halagan
K. Rudnicki
P. Polanowski
G. Jablonski
J. Jung

- Year 2021

Full text to download in external service

Scheduling with precedence constraints: mixed graph coloring in series-parallel graphs.

Publication

H. Furmańczyk
A. Kosowski
P. Żyliński

- Year 2008

W pracy rozważono problem kolorowania grafów mieszanych, opisujący zagadnienie szeregowania zadań, w którym zależności czasowe zadań mają charakter częściowego porządku lub wzajemnego wykluczania. Dla przypadku, w którym graf zależności jest szeregowo-równoległy, podano algorytm rozwiązujący problem optymalnie w czasie $O(n^3.376 * log n)$.

Full text to download in external service

Parallel implementation of the DGF-FDTD method on GPU Using the CUDA technology

Publication

- Year 2016

The discrete Green's function (DGF) formulation of the finite-difference time-domain method (FDTD) is accelerated on a graphics processing unit (GPU) by means of the Compute Unified Device Architecture (CUDA) technology. In the developed implementation of the DGF-FDTD method, a new analytic expression for dyadic DGF derived based on scalar DGF is employed in computations. The DGF-FDTD method on GPU returns solutions that are compatible...

Full text to download in external service

Effective methods for functional confermance testing of parallel and distributed programming libraries.

Publication

Ł. Garstecki

- Year 2004

Rozprawa przedstawia kompletna metodykę tworzenia Zestawów Testów Zgodności dla języków programowania, bibliotek i API, ze szczególnym uwzględnieniem języków i bibliotek programowania równoleglego i rozproszonego. Autor rozpoczął badania w dziedzinie testowania zgodności dla bibliotek programowania równoleglego i rozproszonego, ale Metodyka Kolejnych zawężeń (ang. Consecutive Confinenments Method -CoCoM, stworzona przez Autora,...

Towards Efficient Parallel Image Processing on Cluster Grids Using GIMP.

Publication

- Year 2004

Ze względu na fakt, iż niewielu użytkowników posiada wiedzę niezbędną do wykorzystania niskopoziomowych bibliotek programowania równoległego w celu przyspieszenia działania programów operujących na obrazach, proponujemy plugin do znanej aplikacji GIMP, który umożliwia potokowe wykonanie szeregu filtrów na obrazach załadowanych przez plugin. Prezentujemy szczegóły implementacyjne, scenariusze testowe i wyniki na klastrach, potencjalnie...

Redundantly Actuated 3RRR Parallel Planar Manipulator - Numerical Analyses of its Dynamics Sensitivity on Modifications of its Platform’s Inertia Parameters

Publication

K. Lipiński

- Solid State Phenomena - Year 2013

In the paper, numerical analyses, as well as dynamics of a complex mechanism, are presented. Two objectives are crucial for the paper: inverse dynamic model is needed (dedicated to be use in the model predictive controller); an identification method is searched (some trajectory parameters are controlled, when specific trajectory is tracked under an open-loop model-based control), as selected parameters must be identified for the...

From the Dynamic Lattice Liquid Algorithm to the Dedicated Parallel Computer – mDLL Machine

Publication

J. Jung
R. Kiełbik
K. Rudnicki
K. Hałagan
P. Polanowski
A. Sikorski

- Computational Methods in Science and Technology - Year 2018

Full text to download in external service

Parallel in vitro and in silico investigations into anti-inflammatory effects of non-prenylated stilbenoids

Publication

V. Leláková
K. Šmejkal
K. Jakubczyk
O. Veselý
P. Landa
J. Václavík
P. Bobáľ
H. Pížová
V. Temml
T. Steinacher... and 4 others

- Food Chemistry - Year 2019

Full text to download in external service

Makespan minimization of multi-slot just-in-time scheduling on single and parallel machines

Publication

D. Dereniowski
W. Kubiak

- JOURNAL OF SCHEDULING - Year 2010

Artykuł podejmuje problem szeregowania zadań przy założeniu podziału czasu na sloty jednakowej długości, gdzie każde z zadań ma ustaloną długość oraz czas jego zakończenia, który jest relatywny do końca slotu. Problem znalezienia uszeregowania polega na dokonaniu przydziału zadań do poszczególnych slotów, przy czym w ogólności długość zadania może wymuszać sytuację, w której zadańie jest realizowane nie tylko w slocie, w którym...

Full text to download in external service

Optimizing the computation of a parallel 3D finite difference algorithm for graphics processing units

Publication

J. Porter-Sobieraj
S. Cygert
K. Daniel
J. Sikorski
M. Słodkowski

- CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE - Year 2015

This paper explores the possibilities of using a graphics processing unit for complex 3D finite difference computation via MUSTA‐FORCE and WENO algorithms. We propose a novel algorithm based on the new properties of CUDA surface memory optimized for 2D spatial locality and compare it with 3D stencil computations carried out via shared memory, which is currently considered to be the best approach. A case study was performed for...

Full text to download in external service

Generating reliable conformance test suites for parallel and distributed languages, libraries, and APIs.

Publication

Ł. Garstecki

- Year 2004

Artykuł nakreśla nową metodykę dla tworzenia Zestawów Testów Zgodności (ZTG) dla języków, bibliotek i API programowania równoległego i rozproszonego. Autor rozpoczął swoje badania w zakresie testowania zgodności dla języka równoległego sterowanego danymi Athapascan, opracował metodykę dla projektowania i analizowania ZTG nazwaną Metodą Kolejnych Zawężeń (ang. Consecutive Confinements Methods - CoCoM), stworzył narzędzie CTS Designer,...

New user-guided and ckpt-based checkpointing libraries for parallel MPI applications

Publication

- Year 2005

Praca prezentuje szczególy projektowe i implementacyjne jak również wyniki wydajnościowe dwóch nowych bibliotek checkpointingu opracowanych przez autorów dla równoległych aplikacji MPI. Pierwsz biblioteka, tzw. user-guided wymaga od programisty dostarczenia funkcji pakujących i rozpakowujących stan procesu, ale dostarcza łatwego w użyciu API z wykorzystaniem stałych MPI. Wykorzystuje funkcje I/O MPI-2 lub dedykowany proces master...

Multi-source-supplied parallel hybrid propulsion of the inland passenger ship STA.H. Research work on energy efficiency of a hybrid propulsion system operating in the electric motor drive mode

Publication

- Polish Maritime Research - Year 2013

In the Faculty of Ocean Engineering and Ship Technology, Gdansk University of Technology, design has recently been developed of a small inland ship with hybrid propulsion and supply system. The ship will be propelled by a specially designed so called parallel hybrid propulsion system. The work was aimed at carrying out the energy efficiency analysis of a hybrid propulsion system operating in the electric motor drive mode and at...

Full text available to download

High power, zero ripples active filtering system with power modules operating in parallel

Publication

D. Wojciechowski
R. Strzelecki

- Year 2010

Full text to download in external service

ARUZ — Large-scale, massively parallel FPGA-based analyzer of real complex systems

Publication

R. Kiełbik
K. Hałagan
W. Zatorski
J. Jung
J. Ulański
A. Napieralski
K. Rudnicki
P. Amrozik
G. Jabłoński
D. Stożek... and 4 others

- COMPUTER PHYSICS COMMUNICATIONS - Year 2018

Full text to download in external service

Efficient parallel algorithms in global optimization of potential energy functions for peptides, proteins, and crystals

Publication

J. Lee
J. Pillardy
C. Czaplewski
Y. Arnautova
D. Ripoll
A. Liwo
K. Gibson
R. Wawak
H. Scheraga

- COMPUTER PHYSICS COMMUNICATIONS - Year 2000

Full text to download in external service

Parallel simulations of electrophysiological phenomena in myocardium on large 32 and 64-bit Linux clusters.

Publication

- Year 2004

W pracy podjęto badania i przeprowadzono symulacje zjawisk elektrofizjologicznych w mięśniu sercowym z wykorzystaniem wytworzonego w tym celu oprogramowania równoległego opartego na MPI. Zaimplementowano i zbadano ulepszenia kodu prowadzące do uzyskania dobrej skalowalności oraz przeprowadzono testy wydajności na najnowszych 32 i 64-bitowych klastrach linuksowych. Praca stanowi próbę równoległej implementacji znanego podejścia...

Portable parallel simulator using MPI for 2D and 3D domains: design and performance testing

Publication

- Year 2005

W artykule prezentujemy szczegóły projektowo-implementacyjne naszego modularnego kodu symulacyjnego z wykorzystaniem MPI, w tym nakładaniem obliczeń i komunikacji. Podkreślamy modularność naszej implementacji pozwalającą na łatwą adaptację kodu dla innych zasotosowań. Prezentujemy związek pomiędzy przyspieszeniem obliczeń, rozmiarem i kształtami trójwymiarowych domen z różnymi stosunkami liczby węzłów aktualizowanych przez procesor...

Modelling of First- and Second-order Chemical Reactions on ARUZ – Massively-parallel FPGA-based Machine

Publication

P. Amrozik
K. Halagan
K. Rudnicki

- Year 2021

Full text to download in external service

Carbonized Lanthanum-Based Metal-Organic Framework with Parallel Arranged Channels for Azo-Dye Adsorption

Publication

K. Cendrowski
K. Opała
E. Mijowska

- Nanomaterials - Year 2020

Full text to download in external service

Construction of highly stable parallel two-step Runge-Kutta methods for delay differential equations

Publication

Z. Bartoszewski
Z. Jackiewicz

- JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS - Year 2008

W pracy pokazano, że każda A-stabilna dwukrokowa metoda Rungego-Kutty dla równań różniczkowych zwyczajnych rzędu p1 i rzędu etapowego q=p1 może być uogólniona do P-stabilnej metody dla równań różniczkowych z opóźnieniem zbieżnej jednostajnie z rzędem p=p1.

Full text to download in external service

Feedline Alterations for Optimization-Based Design of Compact Super-Wideband MIMO Antennas in Parallel Configuration

Publication

M. ul Haq
S. Kozieł

- IEEE Antennas and Wireless Propagation Letters - Year 2019

This letter presents a technique for size reduction of wideband multiple-input-multiple-output (MIMO) antennas. Our approach is a two-stage procedure. At the first stage, the antenna structure is modified to improve its impedance matching. This is achieved through incorporation of an n-section tapered feedline, followed by reoptimization of geometry parameters. Reducing the maximum in-band reflection well beyond the acceptance...

Full text to download in external service

Mechanism of recognition of parallel G-quadruplexes by DEAH/RHAU helicase DHX36 explored by molecular dynamics simulations

Publication

K. Amirul
M. Jurkowski
J. Czub
M. Kogut
K. A. Hossain

- Computational and Structural Biotechnology Journal - Year 2021

Full text to download in external service

Taking advantage of the shared explicit cache system based critical sections in the shared memory parallel architectures

Publication

T. Madajczak

- Year 2006

Artykuł prezentuje nową metodę implementacji sekcji krytycznych w równoległych architekturach z pamięcią współdzieloną, takich jak systemy zintegrowane wielowątkowe wieloprocesorowe. Metoda stanowi modyfikację i rozbudowanie metody zwanej Folding, dostępnej w procesorach sieciowych oraz jest w założeniach podobna do techniki zwanej cache-based locking. W porównaniu do dostępnych metod, nowa metoda usuwa problemy skalowalności i...

Molecular Simulations Using Boltzmann’s Thermally Activated Diffusion - Implementation on ARUZ – Massively-parallel FPGA-based Machine

Publication

G. Jablonski
P. Amrozik
K. Halagan

- Year 2021

Full text to download in external service

Performance evaluation of Unified Memory with prefetching and oversubscription for selected parallel CUDA applications on NVIDIA Pascal and Volta GPUs

Publication

- JOURNAL OF SUPERCOMPUTING - Year 2019

The paper presents assessment of Unified Memory performance with data prefetching and memory oversubscription. Several versions of code are used with: standard memory management, standard Unified Memory and optimized Unified Memory with programmer-assisted data prefetching. Evaluation of execution times is provided for four applications: Sobel and image rotation filters, stream image processing and computational fluid dynamic simulation,...

Full text available to download

Executing Multiple Simulations in the MERPSYS Environment

Publication

P. Rościszewski

- Year 2016

The chapter investigates the steps necessary to perform a simulation instance in the MERPSYS environment and discusses potential limitations in case when vast numbers of simulations are required. An extended architecture is proposed which includes a JMS-based simulation queue and multiple distributed simulators, overcoming the potential bottlenecks. The chapter introduces also methods for preparing suites of multiple simulations...

Full text to download in external service

Air Pollution Research Based on Spider Web and Parallel Continuous Particulate Monitoring—A Comparison Study Coupled with Identification of Sources

Publication

A. Stojanowska
T. Mach
T. Olszowski
J. Bihałowicz
M. Górka
J. Rybak
M. Rajfur
P. Świsłowski

- Minerals - Year 2021

Full text to download in external service

Improved conformational space annealing method to treat β-structure with the UNRES force-field and to enhance scalability of parallel implementation

Publication

C. Czaplewski
A. Liwo
J. Pillardy
S. Ołdziej
H. Scheraga

- POLYMER - Year 2004

Full text to download in external service

Checkpointing of Parallel MPI Applications using MPI One-sided API with Support for Byte-addressable Non-volatile RAM

Publication

P. Dorożyński
P. Czarnul
A. Malinowski
K. Czuryło
Ł. Dorau
M. Maciejewski
P. Skowron

- Year 2016

The increasing size of computational clusters results in an increasing probability of failures, which in turn requires application checkpointing in order to survive those failures. Traditional checkpointing requires data to be copied from application memory into persistent storage medium, which increases application execution time as it is usually done in a separate step. In this paper we propose to use emerging byte-addressable...

Full text to download in external service

Grid Implementation of a Parallel Multiobjective Genetic Algorithm for Optimized Allocation of Chlorination Stations in Drinking Water Distribution Systems: Chojnice Case Study

Publication

- IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS - Year 2008

Solving multiobjective optimization problems requires suitable algorithms to find a satisfactory approximation of a globally optimal Pareto front. Furthermore, it is a computationally demanding task. In this paper, the grid implementation of a distributed multiobjective genetic algorithm is presented. The distributed version of the algorithm is based on the island algorithm with forgetting island elitism used instead of a genetic...

Full text to download in external service

Multiprocessor implementation of Parallel Multiobjective Genetic Algorithm for Optimized Allocation of Chlorination Stations in Drinking Water Distribution System - a new water quality model approach

Publication

G. Ewald
T. Zubowicz
M. Brdys

- IFAC Proceedings Volumes - Year 2013

Full text to download in external service

Multiprocessor Implementation of Parallel Multiobjective Genetic Algorithm for Optimized Allocation of Chlorination Stations in Drinking Water Distribution System a New Water Quality Model Approach

Publication

- Year 2013

The Critical Infrastructure Systems (CISs) have received in recent years a considerable attention due to their heavy impact on sustainable development of modern societies. Most CISs may be classied as large scale complex systems of network structure, in uenced by strong interactions form the surrounding environment, internal and external interconnections. The later is a result of inter-CIS dependencies. The control, monitoring...

Full text to download in external service

Repair Augmentation of Unstable, Complete Vertical Meniscal Tears With Bone Marrow Venting Procedure: A Prospective, Randomized, Double-Blind, Parallel-Group, Placebo-Controlled Study

Publication

R. Kaminski
K. Kulinski
K. Kozar-Kaminska
M. Wasko
M. Langner
S. Pomianowski

- ARTHROSCOPY-THE JOURNAL OF ARTHROSCOPIC AND RELATED SURGERY - Year 2019

Full text to download in external service

Implementation of multi-operand addition in FPGA using high-level synthesis

Publication

- Przegląd Elektrotechniczny - Year 2018

The paper presents the results of high-level synthesis (HLS) of multi-operand adders in FPGA using the Vivado Xilinx environment. The aim was to estimate the hardware amount and latency of adders described in C-code. The main task of the presented experiments was to compare the implementations of the carry-save adder (CSA) type multi-operand adders obtained as the effect of the HLS synthesis and those based on the basic component...

Full text available to download

Implementation of Molecular Dynamics and Its Extensions with the Coarse-Grained UNRES Force Field on Massively Parallel Systems: Toward Millisecond-Scale Simulations of Protein Structure, Dynamics, and Thermodynamics

Publication

A. Liwo
S. Ołdziej
C. Czaplewski
D. Kleinerman
P. Blood
H. Scheraga

- Journal of Chemical Theory and Computation - Year 2010

Full text to download in external service

Implementation of Addition and Subtraction Operations in Multiple Precision Arithmetic

Publication

K. Rudnicki
T. Stefański

- Year 2019

In this paper, we present a digital circuit of arithmetic unit implementing addition and subtraction operations in multiple-precision arithmetic (MPA). This adder-subtractor unit is a part of MPA coprocessor supporting and offloading the central processing unit (CPU) in computations requiring precision higher than 32/64 bits. Although addition and subtraction operations of two n-digit numbers require O(n) operations, the efficient...

Full text to download in external service

The chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identiﬁed complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for diﬀerent size of the data as well as for diﬀerent number of the processors used for the computations. The results we got using the model have been compared to the results obtained from real computational environment.

Publication

J. Szymański

- Year 2016

The chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identiﬁed complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for diﬀerent size of the data as well as for diﬀerent number of the processors used...

High-Speed Binary-to-Residue Converter Design Using 2-Bit Segmentation of the Input Word

Publication

- Scientific Journal of Gdynia Maritime University - Year 2022

In this paper a new approach to the design of the high-speed binary-to-residue converter is proposed that allows the attaining of high pipelining rates by eliminating memories used in modulo m generators. The converter algorithm uses segmentation of the input binary word into 2-bit segments. The use and effects of the input word segmentation for the synthesis of converters for five-bit moduli are presented. For the number represented...

Full text available to download

Author Reply to “Regarding ‘Repair Augmentation of Unstable, Complete Vertical Meniscal Tears With Bone Marrow Venting Procedure: A Prospective, Randomized, Double-Blind, Parallel-Group, Placebo-Controlled Study’”

Publication

E. Trams
K. Kulinski
S. Pomianowski
R. Kaminski
K. Kozar-Kaminska

- ARTHROSCOPY-THE JOURNAL OF ARTHROSCOPIC AND RELATED SURGERY - Year 2022

Full text to download in external service

Short-Term Outcomes of Percutaneous Trephination with a Platelet Rich Plasma Intrameniscal Injection for the Repair of Degenerative Meniscal Lesions. A Prospective, Randomized, Double-Blind, Parallel-Group, Placebo-Controlled Study

Publication

R. Kaminski
M. Maksymowicz-Wleklik
K. Kulinski
K. Kozar-Kaminska
A. Dabrowska-Thing
S. Pomianowski

- International Journal of Molecular Sciences - Year 2019

Full text to download in external service

Search

Filters

Catalog

Category

Year

Options

Search results for: PARALLEL-PREFIX ADDER