Wyniki wyszukiwania dla: PARALLEL-PREFIX ADDER

Wyniki wyszukiwania dla: PARALLEL-PREFIX ADDER

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 761

wyczyść wszystkie filtry niedostępne

Algorytmy równoległe i rozproszone/Parallel and distributed >> algorithms
Kursy Online
- J. Cychnerski
- M. Matuszek
- P. Kaczmarek
- P. Czarnul
Optimization of Execution Time under Power Consumption Constraints in a Heterogeneous Parallel System with GPUs and CPUs
Publikacja
- P. Czarnul
- P. Rościszewski
- Rok 2014
The paper proposes an approach for parallelization of computations across a collection of clusters with heterogeneous nodes with both GPUs and CPUs. The proposed system partitions input data into chunks and assigns to par- ticular devices for processing using OpenCL kernels defined by the user. The sys- tem is able to minimize the execution time of the application while maintaining the power consumption of the utilized GPUs and...

Pełny tekst do pobrania w serwisie zewnętrznym
A Parallel Corpus-Based Approach to the Crime Event Extraction for Low-Resource Languages
Publikacja
- N. Khairova
- O. Mamyrbayev
- N. Rizun
- M. Razno
- G. Ybytayeva
- IEEE Access - Rok 2023
These days, a lot of crime-related events take place all over the world. Most of them are reported in news portals and social media. Crime-related event extraction from the published texts can allow monitoring, analysis, and comparison of police or criminal activities in different countries or regions. Existing approaches to event extraction mainly suggest processing texts in English, French, Chinese, and some other resource-rich...

Pełny tekst do pobrania w portalu
Dynamic Data Management Among Multiple Databases for Optimization of Parallel Computations in Heterogeneous HPC Systems
Publikacja
- P. Rościszewski
- Rok 2014
Rapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...

Pełny tekst do pobrania w serwisie zewnętrznym
ACM Transactions on Parallel Computing

Czasopisma

ISSN: 2329-4949 , eISSN: 2329-4957
Planning optimised multi-tasking operations under the capability for parallel machining
Publikacja
- M. Siemiątkowski
- M. Deja
- JOURNAL OF MANUFACTURING SYSTEMS - Rok 2021
The advent of advanced multi-tasking machines (MTMs) in the metalworking industry has provided the opportunity for more efficient parallel machining as compared to traditional sequential processing. It entailed the need for developing appropriate reasoning schemes for efficient process planning to take advantage of machining capabilities inherent in these machines. This paper addresses an adequate methodical approach for a non-linear...

Pełny tekst do pobrania w portalu
Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors
Publikacja
- P. Czarnul
- INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING - Rok 2016
The paper deals with parallelization of computing similarity measures between large vectors. Such computations are important components within many applications and consequently are of high importance. Rather than focusing on optimization of the algorithm itself, assuming specific measures, the paper assumes a general scheme for finding similarity measures for all pairs of vectors and investigates optimizations for scalability...

Pełny tekst do pobrania w portalu
Effective configuration of a double triad planar parallel manipulator for precise positioning of heavy details during their assembling process
Publikacja
- K. Lipiński
- Rok 2019
In the paper, dynamics analysis of a parallel manipulator is presented. It is an atypical manipulator, devoted to help in assembling of heavy industrial constructions. Few atypical properties are required: small workspace; slow velocities; high loads. Initially, a short discussion about definition of the parallel manipulators is presented, as well as the sketch of the proposed structure. In parallel, some definitions, assumptions...

Pełny tekst do pobrania w portalu
Experimental Research on the Energy Efficiency of a Parallel Hybrid Drive for an Inland Ship
Publikacja
- ENERGIES - Rok 2019
The growing requirements for limiting the negative impact of all modes of transport on the natural environment mean that clean technologies are becoming more and more important. The global trend of e-mobility also applies to sea and inland water transport. This article presents the results of experimental tests carried out on a life-size, parallel diesel-electric hybrid propulsion system. The eciency of the propulsion system was...

Pełny tekst do pobrania w portalu
Benchmarking Parallel Chess Search in Stockfish on Intel Xeon and Intel Xeon Phi Processors
Publikacja
- P. Czarnul
- Rok 2018
The paper presents results from benchmarking the parallel multithreaded Stockfish chess engine on selected multi- and many-core processors. It is shown how the strength of play for an n-thread version compares to 1-thread version on both Intel Xeon and latest Intel Xeon Phi x200 processors. Results such as the number of wins, losses and draws are presented and how these change for growing numbers of threads. Impact of using particular...

Pełny tekst do pobrania w serwisie zewnętrznym
Coordination in serial-parallel image processing
Publikacja
- W. Wójcik
- V. Dubovoi
- M. Duda
- R. Romaniuk
- L. Yesmakhanova
- A. Kozbakova
- R. S. Romaniuk
- Rok 2015
Pełny tekst do pobrania w serwisie zewnętrznym
The parallel environment for endoscopic image analysis
Publikacja
- H. Krawczyk
- A. Neyman
- M. Nowikowski
- J. Saif
- Rok 2002
The jPVM-oriented environment to support high performance computing required for the Endoscopy Recommender System (ERS) is defined. SPMD model of image matching is considered and its two implementations are proposed: Lexicographical Searching Algorithm (LSA) and Gradient Serching Algorithm (GSA). Three classes of experiments are considered and the relative degree of similarity and execution time of each algorithm are analysed....

Pełny tekst do pobrania w serwisie zewnętrznym
Scheduling with Complete Multipartite Incompatibility Graph on Parallel Machines: Complexity and Algorithms
Publikacja
- T. Pikies
- K. Turowski
- M. Kubale
- ARTIFICIAL INTELLIGENCE - Rok 2022
In this paper, the problem of scheduling on parallel machines with a presence of incompatibilities between jobs is considered. The incompatibility relation can be modeled as a complete multipartite graph in which each edge denotes a pair of jobs that cannot be scheduled on the same machine. The paper provides several results concerning schedules, optimal or approximate with respect to the two most popular criteria of optimality:...

Pełny tekst do pobrania w serwisie zewnętrznym
Low-Power Receivers for Wireless Capacitive Coupling Transmission in 3-D-Integrated Massively Parallel CMOS Imager
Publikacja
- IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS - Rok 2020
The paper presents pixel receivers for massively parallel transmission of video signal between capacitive coupled integrated circuits (ICs). The receivers meet the key requirements for massively parallel transmission, namely low-power consumption below a single μW, small area of less than 205 μm2, high sensitivity better than 160 mV, and good immunity to crosstalk. The receivers were implemented and measured in a 3-D IC (two face-to-face...

Pełny tekst do pobrania w portalu
Performance/energy aware optimization of parallel applications on GPUs under power capping
Publikacja
- A. Krzywaniak
- P. Czarnul
- Rok 2020
In the paper we present an approach and results from application of the modern power capping mechanism available for NVIDIA GPUs to the bench- marks such as NAS Parallel Benchmarks BT, SP and LU as well as cublasgemm- benchmark which are widely used for assessment of high performance computing systems’ performance. Specifically, depending on the benchmarks, various power cap configurations are best for desired trade-off of performance...

Pełny tekst do pobrania w portalu
Mechanism of recognition of parallel G-quadruplexes by DEAH/RHAU helicase DHX36 explored by molecular dynamics simulations
Publikacja
- K. A. Hossain
- M. Jurkowski
- J. Czub
- M. Kogut
- Computational and Structural Biotechnology Journal - Rok 2021
Because of high stability and slow unfolding rates of G-quadruplexes (G4), cells have evolved specialized helicases that disrupt these non-canonical DNA and RNA structures in an ATP-dependent manner. One example is DHX36, a DEAH-box helicase, which participates in gene expression and replication by recognizing and unwinding parallel G4s. Here, we studied the molecular basis for the high affinity and specificity of DHX36 for parallel-type...

Pełny tekst do pobrania w portalu
A CMOS Pixel With Embedded ADC, Digital CDS and Gain Correction Capability for Massively Parallel Imaging Array
Publikacja
- IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS - Rok 2017
In the paper, a CMOS pixel has been proposed for imaging arrays with massively parallel image acquisition and simultaneous compensation of dark signal nonuniformity (DSNU) as well as photoresponse nonuniformity (PRNU). In our solution the pixel contains all necessary functional blocks: a photosensor and an analog-to-digital converter (ADC) with built-in correlated double sampling (CDS) integrated together. It is implemented in...

Pełny tekst do pobrania w portalu
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
Publikacja
- J. Skrzypczak
- P. Czarnul
- SIMULATION MODELLING PRACTICE AND THEORY - Rok 2023
In the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...

Pełny tekst do pobrania w serwisie zewnętrznym
Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications
Publikacja
- Ł. Jarząbek
- P. Czarnul
- JOURNAL OF SUPERCOMPUTING - Rok 2017
The aim of this paper is to evaluate performance of new CUDA mechanisms—unified memory and dynamic parallelism for real parallel applications compared to standard CUDA API versions. In order to gain insight into performance of these mechanisms, we decided to implement three applications with control and data flow typical of SPMD, geometric SPMD and divide-and-conquer schemes, which were then used for tests and experiments. Specifically,...

Pełny tekst do pobrania w portalu
Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training
Publikacja
- P. Rościszewski
- Procedia Computer Science - Rok 2017
In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

Pełny tekst do pobrania w portalu
Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams
Publikacja
- K. Łopatka
- Rok 2015
A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...
Parallel query processing and edge ranking of graphs
Publikacja
- D. Dereniowski
- M. Kubale
- Rok 2006
Artykuł poświęcony jest problemowi szukania drzewa spinającego o minimalnym uporządkowanym indeksie chromatycznym. Jednym z zastosowań jest poszukiwanie optymalnych harmonogramów w równoległym przetwarzaniu zapytań w relacyjnych bazach danych. Podajemy nowe oszacowanie funkcji dobroci przybliżonego algorytmu autorstwa Makino, Uno i Ibaraki wraz z rezultatami testów komputerowych przeprowadzonych dla grafów losowych.

Pełny tekst do pobrania w serwisie zewnętrznym
Parallel processing subsystems with redundancy in a distributed environment
Publikacja
- A. Kosowski
- M. Małafiejski
- P. Żyliński
- Rok 2006
W pracy rozważano problem podziału systemu rozproszonego na spójne podsystemy złożone z przynajmniej trzech jednostek, pozwalające na detekcję i skorygowanie pojedynczych błędów. Wykazano, że problem maksymalizacji liczby takich jednostek jest NP-trudny nawet dla dwuspójnych kubicznych topologii sieci. Podano też nowe algorytmy przybliżone.

Pełny tekst do pobrania w serwisie zewnętrznym
Parallel tabu search for graph coloring problem
Publikacja
- J. Dąbrowski
- M. Kubale
- Rok 2006
Tabu search is a simple, yet powerful meta-heuristic based on local search that has been often used to solve combinatorial optimization problems like the graph coloring problem. This paper presents current taxonomy of patallel tabu search algorithms and compares three parallelization techniques applied to Tabucol, a sequential TS algorithm for graph coloring. The experimental results are based on graphs available from the DIMACS...
Parallel computations in the volunteer based Comcute system
Publikacja
- Rok 2014
The paper presents Comcute which is a novel multi-level implemen- tation of the volunteer based computing paradigm. Comcute was designed to let users donate the computing power of their PCs in a simplified manner, requiring only pointing their web browser at a specific web address and clicking a mouse. The server side appoints several servers to be in charge of execution of particular tasks. Thanks to that the system can survive...

Pełny tekst do pobrania w serwisie zewnętrznym
Decentralized control of a UPS systems operating in parallel
Publikacja
- R. Strzelecki
- D. Vinnikov
- Rok 2008
Pełny tekst do pobrania w serwisie zewnętrznym
Efficient parallel query processing by graph ranking
Publikacja
- D. Dereniowski
- M. Kubale
- FUNDAMENTA INFORMATICAE - Rok 2006
W artykule analizujemy przybliżony algorytm dla problemu szukania drzewa spinającego o minimalnym uporządkowanym indeksie chromatycznym, co znajduje zastosowanie w równoległym przetwarzaniu zapytań w relacyjnych bazach danych. Podajemy nowe oszacowanie uporządkowanego indeksu chromatycznego drzewa, które prowadzi do uzyskania lepszej funkcji dobroci wspomnianego algorytmu.
Cholesky factorization of matrices in parallel and ranking of graphs.
Publikacja
- D. Dereniowski
- M. Kubale
- Rok 2004
Uporządkowane kolorowanie znajduje zastosowanie przy równoległej faktoryzacji macierzy metodą Cholesky'ego. Praca zawiera opis tego zastosowania. Podano także algorytmy optymalnego uporządkowanego kolorowania krawędzi pewnych klas grafów: grafów pełnych dwudzielnych oraz powstałych z pełnych dwudzielnych przez usunięcie O(log n) krawędzi.
Scattering by parallel cylindrical posts with conducting strips.
Publikacja
- Rok 2004
Rozwinięto teorię rozpraszania fali elektromagnetycznej w obszarach otwartych i zamkniętych na metalizowanych obiektach cylindrycznych przy użyciu zmodyfikowanej procedury iteracyjnej i metody dopasowania rodzajów. W wyniku analizy uzyskano pole rozproszone w przypadku struktur otwartych oraz odpowiedzi częstotliwościowe współczynników odbicia i transmisji w prostokątnych złączach falowodowych. Obrót i przemieszczenie opisywane...
Fabrication method of parallel mesoporous carbon nanotubes
Publikacja
- X. Chen
- K. Cendrowski
- J. Srenscek-Nazzal
- M. Rümmeli
- R. Kalenczuk
- H. Chen
- P. Chu
- E. Borowiak-Palen
- E. Mijowska
- Colloids and Surfaces A: Physicochemical and Engineering Aspects - Rok 2011
Pełny tekst do pobrania w serwisie zewnętrznym
Merging Images from Parallel Depth Cameras
Publikacja
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2014
In this paper a problem of simultaneous information acquisition from multiple depth cameras is investigated, aiming at obtaining single overall picture containing information from all cameras. The experiments are carried out on Microsoft Kinect devices. A methodology for merging images from multiple positioned in a line cameras is proposed. The method is based on the concept of simulating a view of an imaginary camera covering...
Merging Images From Parallel Depth Cameras
Publikacja
- A. Brzeski
- P. Dorożyński
- K. Dziubich
- T. Dziubich
- Rok 2012
In this paper a problem of simultaneous information acquisition from multiple depth cameras is investigated, aiming at obtaining single overall picture containing information from all cameras. The experiments are carried out on Microsoft Kinect devices. A methodology for merging images from multiple positioned in a line cameras is proposed. The method is based on the concept of simulating a view of an imaginary camera covering...
Parallel frequency tracking with built-in performance evaluation
Publikacja
- M. Meller
- M. Niedźwiecki
- DIGITAL SIGNAL PROCESSING - Rok 2013
The problem of estimation of instantaneous frequency of a nonstationary complex sinusoid (cisoid) buried in wideband noise is considered. The proposed approach employs a bank of adaptive notch filters, extended with a nontrivial performance assessment mechanism which automatically chooses the best performing filter in the bank. Additionally, a computationally attractive method of implementing the bank is proposed. The new structure...

Pełny tekst do pobrania w serwisie zewnętrznym
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
Publikacja
- P. Czarnul
- P. Rościszewski
- Rok 2020
Auto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...

Pełny tekst do pobrania w portalu
Optimization of parallel implementation of UNRES package for coarse‐grained simulations to treat large proteins
Publikacja
- A. Sieradzan
- J. Sans‐Duñó
- E. Lubecka
- C. Czaplewski
- A. Lipska
- H. Leszczyński
- K. Ocetkiewicz
- J. Proficz
- P. Czarnul
- H. Krawczyk
- A. Liwo
- JOURNAL OF COMPUTATIONAL CHEMISTRY - Rok 2023
We report major algorithmic improvements of the UNRES package for physics-based coarse-grained simulations of proteins. These include (i) introduction of interaction lists to optimize computations, (ii) transforming the inertia matrix to a pentadiagonal form to reduce computing and memory requirements, (iii) removing explicit angles and dihedral angles from energy expressions and recoding the most time-consuming energy/force terms...

Pełny tekst do pobrania w portalu
Optimization of hybrid parallel application execution in heterogeneous high performance computing systems considering execution time and power consumption
Publikacja
- P. Rościszewski
- Rok 2018
Many important computational problems require utilization of high performance computing (HPC) systems that consist of multi-level structures combining higher and higher numbers of devices with various characteristics. Utilizing full power of such systems requires programming parallel applications that are hybrid in two meanings: they can utilize parallelism on multiple levels at the same time and combine together programming interfaces...

Pełny tekst do pobrania w serwisie zewnętrznym
Multi-agent large-scale parallel crowd simulation with NVRAM-based distributed cache
Publikacja
- A. Malinowski
- P. Czarnul
- Journal of Computational Science - Rok 2019
This paper presents the architecture, main components and performance results for a parallel and modu-lar agent-based environment aimed at crowd simulation. The environment allows to simulate thousandsor more agents on maps of square kilometers or more, features a modular design and incorporates non-volatile RAM (NVRAM) with a fail-safe mode that can be activated to allow to continue computationsfrom a recently analyzed state in...

Pełny tekst do pobrania w serwisie zewnętrznym
Infrared techniques for natural convection investigations in channels between two vertical, parallel, isothermal and symmetrically heated plates
Publikacja
- INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER - Rok 2017
The effect of the gap width between two symmetrically heated vertical, parallel, isothermal plates on intensity of natural convective heat transfer in a gas (Pr = 0.71) was experimentally studied using the balance and gradient methods. In the former method heat fluxes were determined based on measurements of the voltage and electric current supplying the heaters placed inside the walls. In the latter, heat fluxes were calculated...

Pełny tekst do pobrania w portalu
Numerical Study on Mitigation of Flow Maldistribution in Parallel Microchannel Heat Sink: Channels Variable Width Versus Variable Height Approach
Publikacja
- R. Kumar
- G. Singh
- D. Mikielewicz
- JOURNAL OF ELECTRONIC PACKAGING - Rok 2019
Microchannel heat sink on one hand enjoys benefits of intensified several folds heat transfer performance but on the other hand has to suffer aggravated form of trifling limitations associated with imperfect hydrodynamics and heat transfer behavior. Flow maldistribution is one of such limitation that exaggerates temperature nonuniformity across parallel microchannels leading to increase in maximum base temperature. Recently, variable...

Pełny tekst do pobrania w serwisie zewnętrznym
A Fail-Safe NVRAM Based Mechanism for Efficient Creation and Recovery of Data Copies in Parallel MPI Applications
Publikacja
- A. Malinowski
- P. Czarnul
- M. Maciejewski
- P. Skowron
- Rok 2016
The paper presents a fail-safe NVRAM based mechanism for creation and recovery of data copies during parallel MPI application runtime. Specifically, we target a cluster environment in which each node has an NVRAM installed in it. Our previously developed extension to the MPI I/O API can take advantage of NVRAM regions in order to provide an NVRAM based cache like mechanism to significantly speed up I/O operations and allow to preload...

Pełny tekst do pobrania w serwisie zewnętrznym
International Journal of Parallel and Distributed Systems

Czasopisma

ISSN: 2277-1638
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS

Czasopisma

ISSN: 1045-9219 , eISSN: 1558-2183
Highly parallel distributed computing systems with optical interconnections
Publikacja
- J. Just
- R. Romaniuk
- R. S. Romaniuk
- Microprocessing and Microprogramming - Rok 1989
Pełny tekst do pobrania w serwisie zewnętrznym
Highly Parallel Distributed Computing System With Optical Interconnections
Publikacja
- J. Just
- R. Romaniuk
- R. S. Romaniuk
- Rok 1990
Pełny tekst do pobrania w serwisie zewnętrznym
Review of parallel computing methods and tools for FPGA technology
Publikacja
- R. Cieszewski
- M. Linczuk
- K. Pozniak
- R. Romaniuk
- R. S. Romaniuk
- Rok 2013
Pełny tekst do pobrania w serwisie zewnętrznym
A probe for immittance spectroscopy based on the parallel electrode technique.
Publikacja
- J. Wtorek
- A. Bujnowski
- A. Poliński
- L. Józefiak
- B. Truyen
- Rok 2004
W pracy zaprezentowano konstrukcję sondy do spektroskopowych pomiarów immitancyjnych. Obliczono jej stałą, którą to wartość zweryfikowano eksperymentalnie. Pokazano przykładowe wyniki pomiarów in vivo.
Fashion and Tourism: Parallel Stories of Two "Dream Marvels".
Publikacja
- M. Gravari-Barbas
- N. Sabatini
- Rok 2023
Fashion and tourism are two social, cultural, and economic phenomena that have both numerous connections and surprising similarities. These are not new: they have been built and developed since the beginnings of tourism as a modern social phenomenon, emerged in Europe in the context of the industrial revolution. They consolidated in the first decades of the 21st century, in a context where both phenomena have completed their “mass”...

Pełny tekst do pobrania w serwisie zewnętrznym
Simulation of Parallel Applications on Large-scale Distributed Systems
Publikacja
- P. Rościszewski
- P. Sidorczak
- Rok 2014
This chapter has a form of a review article in the field of simulating High-Performance Computing systems. We justify the need for a new versatile simulator considering heterogeneity, energy efficiency and reliability of HPC systems. We sketch the problems that need to be solved by such simulator and rationalize using discrete-event simulation for this purpose. Based on a review of existing discrete-event HPC simulation solutions...
Validation of atmospheric aerosols parallel sampling in a multifold device
Publikacja
- C. M. Oliveira
- M. Camoes
- P. Bigus
- A. Fachado
- R. Silva
- ENVIRONMENTAL MONITORING AND ASSESSMENT - Rok 2015
In this work, particulate matter was collected using an active sampling system consisting of a PM10 (<10 μm) inlet coupled to a multifold device containing six channels, connected to a vacuum pump. Each channel was equipped with a filter holder fitted with adequately chosen filters. The system was fixed on a metallic structure, which was placed on the roof of the laboratory building, at the Faculty of Sciences, in Lisbon. Sampling...

Pełny tekst do pobrania w serwisie zewnętrznym
Optimization of Data Assignment for Parallel Processing in a Hybrid Heterogeneous Environment Using Integer Linear Programming
Publikacja
- T. M. Boiński
- P. Czarnul
- COMPUTER JOURNAL - Rok 2021
In the paper we investigate a practical approach to application of integer linear programming for optimization of data assignment to compute units in a multi-level heterogeneous environment with various compute devices, including CPUs, GPUs and Intel Xeon Phis. The model considers an application that processes a large number of data chunks in parallel on various compute units and takes into account computations, communication including...

Pełny tekst do pobrania w portalu

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: PARALLEL-PREFIX ADDER