Wyniki wyszukiwania dla: PARALLEL PERFORMANCE

Parallel frequency tracking with built-in performance evaluation

Publikacja

- DIGITAL SIGNAL PROCESSING - Rok 2013

The problem of estimation of instantaneous frequency of a nonstationary complex sinusoid (cisoid) buried in wideband noise is considered. The proposed approach employs a bank of adaptive notch filters, extended with a nontrivial performance assessment mechanism which automatically chooses the best performing filter in the bank. Additionally, a computationally attractive method of implementing the bank is proposed. The new structure...

Pełny tekst do pobrania w serwisie zewnętrznym

Parallel Programming for Modern High Performance Computing Systems

Publikacja

P. Czarnul

- Rok 2018

In view of the growing presence and popularity of multicore and manycore processors, accelerators, and coprocessors, as well as clusters using such computing devices, the development of efficient parallel applications has become a key challenge to be able to exploit the performance of such systems. This book covers the scope of parallel programming for modern high performance computing systems. It first discusses selected and...

Pełny tekst do pobrania w serwisie zewnętrznym

Performance evaluation of parallel background subtraction on GPU platforms

Publikacja

G. Szwoch

- Elektronika : konstrukcje, technologie, zastosowania - Rok 2015

Implementation of the background subtraction algorithm on parallel GPUs is presented. The algorithm processes video streams and extracts foreground pixels. The work focuses on optimizing parallel algorithm implementation by taking into account specific features of the GPU architecture, such as memory access, data transfers and work group organization. The algorithm is implemented in both OpenCL and CUDA. Various optimizations of...

Pełny tekst do pobrania w serwisie zewnętrznym

Performance Evaluation of the Parallel Codebook Algorithm for Background Subtraction in Video Stream

Publikacja

G. Szwoch

- Communications in Computer and Information Science - Rok 2011

A background subtraction algorithm based on the codebook approach was implemented on a multi-core processor in a parallel form, using the OpenMP system. The aim of the experiments was to evaluate performance of the multithreaded algorithm in processing video streams recorded from monitoring cameras, depending on a number of computer cores used, method of task scheduling, image resolution and degree of image content variability....

Pełny tekst do pobrania w serwisie zewnętrznym

Performance evaluation of the parallel object tracking algorithm employing the particle filter

Publikacja

G. Szwoch

- Rok 2016

An algorithm based on particle filters is employed to track moving objects in video streams from fixed and non-fixed cameras. Particle weighting is based on color histograms computed in the iHLS color space. Particle computations are parallelized with CUDA framework. The algorithm was tested on various GPU devices: a desktop GPU card, a mobile chipset and two embedded GPU platforms. The processing speed depending on the number...

Performance evaluation of the parallel object tracking algorithm employing the particle filter

Publikacja

G. Szwoch

- Rok 2016

Pełny tekst do pobrania w serwisie zewnętrznym

Performance/energy aware optimization of parallel applications on GPUs under power capping

Publikacja

- Rok 2020

In the paper we present an approach and results from application of the modern power capping mechanism available for NVIDIA GPUs to the bench- marks such as NAS Parallel Benchmarks BT, SP and LU as well as cublasgemm- benchmark which are widely used for assessment of high performance computing systems’ performance. Specifically, depending on the benchmarks, various power cap configurations are best for desired trade-off of performance...

Pełny tekst do pobrania w portalu

Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications

Publikacja

- JOURNAL OF SUPERCOMPUTING - Rok 2017

The aim of this paper is to evaluate performance of new CUDA mechanisms—unified memory and dynamic parallelism for real parallel applications compared to standard CUDA API versions. In order to gain insight into performance of these mechanisms, we decided to implement three applications with control and data flow typical of SPMD, geometric SPMD and divide-and-conquer schemes, which were then used for tests and experiments. Specifically,...

Pełny tekst do pobrania w portalu

Survey of Methodologies, Approaches, and Challenges in Parallel Programming Using High-Performance Computing Systems

Publikacja

- Scientific Programming - Rok 2020

This paper provides a review of contemporary methodologies and APIs for parallel programming, with representative technologies selected in terms of target system type (shared memory, distributed, and hybrid), communication patterns (one-sided and two-sided), and programming abstraction level. We analyze representatives in terms of many aspects including programming model, languages, supported platforms, license, optimization goals,...

Pełny tekst do pobrania w portalu

Performance Evaluation of Selected Parallel Object Detection and Tracking Algorithms on an Embedded GPU Platform

Publikacja

- Rok 2017

Performance evaluation of selected complex video processing algorithms, implemented on a parallel, embedded GPU platform Tegra X1, is presented. Three algorithms were chosen for evaluation: a GMM-based object detection algorithm, a particle filter tracking algorithm and an optical flow based algorithm devoted to people counting in a crowd flow. The choice of these algorithms was based on their computational complexity and parallel...

Pełny tekst do pobrania w serwisie zewnętrznym

Portable parallel simulator using MPI for 2D and 3D domains: design and performance testing

Publikacja

- Rok 2005

W artykule prezentujemy szczegóły projektowo-implementacyjne naszego modularnego kodu symulacyjnego z wykorzystaniem MPI, w tym nakładaniem obliczeń i komunikacji. Podkreślamy modularność naszej implementacji pozwalającą na łatwą adaptację kodu dla innych zasotosowań. Prezentujemy związek pomiędzy przyspieszeniem obliczeń, rozmiarem i kształtami trójwymiarowych domen z różnymi stosunkami liczby węzłów aktualizowanych przez procesor...

Performance Assessment of Using Docker for Selected MPI Applications in a Parallel Environment Based on Commodity Hardware

Publikacja

- Applied Sciences-Basel - Rok 2022

In the paper, we perform detailed performance analysis of three parallel MPI applications run in a parallel environment based on commodity hardware, using Docker and bare-metal configurations. The testbed applications are representative of the most typical parallel processing paradigms: master–slave, geometric Single Program Multiple Data (SPMD) as well as divide-and-conquer and feature characteristic computational and communication...

Pełny tekst do pobrania w portalu

Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system

Publikacja

- SIMULATION MODELLING PRACTICE AND THEORY - Rok 2023

In the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...

Pełny tekst do pobrania w serwisie zewnętrznym

Investigation of Parallel Data Processing Using Hybrid High Performance CPU + GPU Systems and CUDA Streams

Publikacja

P. Czarnul

- COMPUTING AND INFORMATICS - Rok 2020

The paper investigates parallel data processing in a hybrid CPU+GPU(s) system using multiple CUDA streams for overlapping communication and computations. This is crucial for efficient processing of data, in particular incoming data stream processing that would naturally be forwarded using multiple CUDA streams to GPUs. Performance is evaluated for various compute time to host-device communication time ratios, numbers of CUDA streams,...

Pełny tekst do pobrania w portalu

Analyzing energy/performance trade-offs with power capping for parallel applications on modern multi and many core processors

Publikacja

- Annals of Computer Science and Information Systems - Rok 2018

In the paper we present extensive results from analyzing energy/performance trade-offs with power capping observed on four different modern CPUs, for three different parallel applications such as 2D heat distribution, numerical integration and Fast Fourier Transform. The CPU tested represent both multi-core type CPUs such as Intel⃝R Xeon⃝R E5, desktop and mobile i7 as well as many-core Intel⃝R Xeon PhiTM x200 but also server, desktop...

Pełny tekst do pobrania w portalu

Performance evaluation of Unified Memory with prefetching and oversubscription for selected parallel CUDA applications on NVIDIA Pascal and Volta GPUs

Publikacja

- JOURNAL OF SUPERCOMPUTING - Rok 2019

The paper presents assessment of Unified Memory performance with data prefetching and memory oversubscription. Several versions of code are used with: standard memory management, standard Unified Memory and optimized Unified Memory with programmer-assisted data prefetching. Evaluation of execution times is provided for four applications: Sobel and image rotation filters, stream image processing and computational fluid dynamic simulation,...

Pełny tekst do pobrania w portalu

Optimization of hybrid parallel application execution in heterogeneous high performance computing systems considering execution time and power consumption

Publikacja

P. Rościszewski

- Rok 2018

Many important computational problems require utilization of high performance computing (HPC) systems that consist of multi-level structures combining higher and higher numbers of devices with various characteristics. Utilizing full power of such systems requires programming parallel applications that are hybrid in two meanings: they can utilize parallelism on multiple levels at the same time and combine together programming interfaces...

Pełny tekst do pobrania w serwisie zewnętrznym

Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors

Publikacja

P. Czarnul

- INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING - Rok 2016

The paper deals with parallelization of computing similarity measures between large vectors. Such computations are important components within many applications and consequently are of high importance. Rather than focusing on optimization of the algorithm itself, assuming specific measures, the paper assumes a general scheme for finding similarity measures for all pairs of vectors and investigates optimizations for scalability...

Pełny tekst do pobrania w portalu

Parallel immune system for graph coloring

Publikacja

J. Dąbrowski

- Rok 2008

This paper presents a parallel artificial immune system designed forgraph coloring. The algorithm is based on the clonal selection principle. Each processor operates on its own pool of antibodies and amigration mechanism is used to allow processors to exchange information. Experimental results show that migration improves the performance of the algorithm. The experiments were performed using a high performance cluster on a set...

Pełny tekst do pobrania w serwisie zewnętrznym

Modeling energy consumption of parallel applications

Publikacja

- Annals of Computer Science and Information Systems - Rok 2016

The paper presents modeling and simulation of energy consumption of two types of parallel applications: geometric Single Program Multiple Data (SPMD) and divide-and-conquer (DAC). Simulation is performed in a new MERPSYS environment. Model of an application uses the Java language with extension representing message exchange between processes working in parallel. Simulation is performed by running threads representing distinct process...

Pełny tekst do pobrania w portalu

NVRAM as Main Storage of Parallel File System

Publikacja

A. Malinowski

- Journal of Computer Science and Control Systems - Rok 2016

Modern cluster environments' main trouble used to be lack of computational power provided by CPUs and GPUs, but recently they suffer more and more from insufficient performance of input and output operations. Apart from better network infrastructure and more sophisticated processing algorithms, a lot of solutions base on emerging memory technologies. This paper presents evaluation of using non-volatile random-access memory as a...

Pełny tekst do pobrania w serwisie zewnętrznym

The parallel environment for endoscopic image analysis

Publikacja

- Rok 2002

The jPVM-oriented environment to support high performance computing required for the Endoscopy Recommender System (ERS) is defined. SPMD model of image matching is considered and its two implementations are proposed: Lexicographical Searching Algorithm (LSA) and Gradient Serching Algorithm (GSA). Three classes of experiments are considered and the relative degree of similarity and execution time of each algorithm are analysed....

Pełny tekst do pobrania w serwisie zewnętrznym

A Workflow Application for Parallel Processing of Big Data from an Internet Portal

Publikacja

P. Czarnul

- Rok 2014

The paper presents a workflow application for efficient parallel processing of data downloaded from an Internet portal. The workflow partitions input files into subdirectories which are further split for parallel processing by services installed on distinct computer nodes. This way, analysis of the first ready subdirectories can start fast and is handled by services implemented as parallel multithreaded applications using multiple...

Pełny tekst do pobrania w serwisie zewnętrznym

Block-based Representation of Application Execution on Modern Parallel Systems

Publikacja

P. Czarnul

- Rok 2013

The chapter presents how to model execution of a parallel computational application that is to be executed in a large-scale parallel or distributed environment with potentially thousands to millions of execution units. The representation uses pre- viously attributes and factors representative of modern high performance systems including multicore CPUs, GPUs, dedicated accelerators such as Intel Phi.

Multi-agent large-scale parallel crowd simulation

Publikacja

A. Malinowski
P. Czarnul
K. Czuryƚo
M. Maciejewski
P. Skowron

- Rok 2017

This paper presents design, implementation and performance results of a new modular, parallel, agent-based and large scale crowd simulation environment. A parallel application, implemented with C and MPI, was implemented and run in this parallel environment for simulation and visualization of an evacuation scenario at Gdansk University of Technology, Poland and further in the area of districts of Gdansk. The application uses a...

Pełny tekst do pobrania w serwisie zewnętrznym

Parallel implementation of a Sailing Assistance Application in a Cloud Environment

Publikacja

- IEEE Access - Rok 2023

Sailboat weather routing is a highly complex problem in terms of both the computational time and memory. The reason for this is a large search resulting in a multitude of possible routes and a variety of user preferences. Analysing all possible routes is only feasible for small sailing regions, low-resolution maps, or sailboat movements on a grid. Therefore, various heuristic approaches are often applied, which can find solutions...

Pełny tekst do pobrania w portalu

Simulation of Parallel Applications on Large-scale Distributed Systems

Publikacja

- Rok 2014

This chapter has a form of a review article in the field of simulating High-Performance Computing systems. We justify the need for a new versatile simulator considering heterogeneity, energy efficiency and reliability of HPC systems. We sketch the problems that need to be solved by such simulator and rationalize using discrete-event simulation for this purpose. Based on a review of existing discrete-event HPC simulation solutions...

Computer experiments with a parallel clonal selection algorithm for the graph coloring problem

Publikacja

- Rok 2008

Artificial immune systems (AIS) are algorithms that are based on the structure and mechanisms of the vertebrate immune system. Clonal selection is a process that allows lymphocytes to launch a quick response to known pathogens and to adapt to new, previously unencountered ones. This paper presents a parallel island model algorithm based on the clonal selection principles for solving the Graph Coloring Problem. The performance of...

Pełny tekst do pobrania w serwisie zewnętrznym

A Parallel Genetic Algorithm for Creating Virtual Portraits of Historical Figures

Publikacja

- TASK Quarterly - Rok 2012

In this paper we present a genetic algorithm (GA) for creating hypothetical virtual portraits of historical figures and other individuals whose facial appearance is unknown. Our algorithm uses existing portraits of random people from specific historical period and social background to evolve a set of face images potentially resembling the person whose image is to be found. We then use portraits of the person's relatives to judge...

Pełny tekst do pobrania w portalu

Assessment of OpenMP Master–Slave Implementations for Selected Irregular Parallel Applications

Publikacja

P. Czarnul

- Electronics - Rok 2021

The paper investigates various implementations of a master–slave paradigm using the popular OpenMP API and relative performance of the former using modern multi-core workstation CPUs. It is assumed that a master partitions available input into a batch of predefined number of data chunks which are then processed in parallel by a set of slaves and the procedure is repeated until all input data has been processed. The paper experimentally...

Pełny tekst do pobrania w portalu

Modern Platform for Parallel Algorithms Testing: Java on Intel Xeon Phi

Publikacja

A. Malinowski

- International Journal of Information Technology and Computer Science - Rok 2015

Parallel algorithms are popular method of increasing system performance. Apart from showing their properties using asymptotic analysis, proof-of-concept implementation and practical experiments are often required. In order to speed up the development and provide simple and easily accessible testing environment that enables execution of reliable experiments, the paper proposes a platform with multi-core computational accelerator:...

Pełny tekst do pobrania w serwisie zewnętrznym

Parallel multithread computing for spectroscopic analysis in optical coherence tomography

Publikacja

- Rok 2014

Spectroscopic Optical Coherence Tomography (SOCT) is an extension of Optical Coherence Tomography (OCT). It allows gathering spectroscopic information from individual scattering points inside the sample. It is based on time-frequency analysis of interferometric signals. Such analysis requires calculating hundreds of Fourier transforms while performing a single A-scan. Additionally, further processing of acquired spectroscopic information...

Pełny tekst do pobrania w serwisie zewnętrznym

A Solution to Image Processing with Parallel MPI I/O and Distributed NVRAM Cache

Publikacja

- Scalable Computing: Practice and Experience - Rok 2018

The paper presents a new approach to parallel image processing using byte addressable, non-volatile memory (NVRAM). We show that our custom built MPI I/O implementation of selected functions that use a distributed cache that incorporates NVRAMs located in cluster nodes can be used for efficient processing of large images. We demonstrate performance benefits of such a solution compared to a traditional implementation without NVRAM...

Pełny tekst do pobrania w portalu

Optimization of parallel implementation of UNRES package for coarse‐grained simulations to treat large proteins

Publikacja

A. Sieradzan
J. Sans‐Duñó
E. Lubecka
C. Czaplewski
A. Lipska
H. Leszczyński
K. Ocetkiewicz
J. Proficz
P. Czarnul
H. Krawczyk
A. Liwo

- JOURNAL OF COMPUTATIONAL CHEMISTRY - Rok 2023

We report major algorithmic improvements of the UNRES package for physics-based coarse-grained simulations of proteins. These include (i) introduction of interaction lists to optimize computations, (ii) transforming the inertia matrix to a pentadiagonal form to reduce computing and memory requirements, (iii) removing explicit angles and dihedral angles from energy expressions and recoding the most time-consuming energy/force terms...

Pełny tekst do pobrania w portalu

Parallel implementation of background subtraction algorithms for real-time video processing on a supercomputer platform

Publikacja

- Journal of Real-Time Image Processing - Rok 2016

Results of evaluation of the background subtraction algorithms implemented on a supercomputer platform in a parallel manner are presented in the paper. The aim of the work is to chose an algorithm, a number of threads and a task scheduling method, that together provide satisfactory accuracy and efficiency of a real-time processing of high resolution camera images, maintaining the cost of resources usage at a reasonable level. Two...

Pełny tekst do pobrania w portalu

Optimization of Execution Time under Power Consumption Constraints in a Heterogeneous Parallel System with GPUs and CPUs

Publikacja

- Rok 2014

The paper proposes an approach for parallelization of computations across a collection of clusters with heterogeneous nodes with both GPUs and CPUs. The proposed system partitions input data into chunks and assigns to par- ticular devices for processing using OpenCL kernels defined by the user. The sys- tem is able to minimize the execution time of the application while maintaining the power consumption of the utilized GPUs and...

Pełny tekst do pobrania w serwisie zewnętrznym

Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training

Publikacja

P. Rościszewski

- Procedia Computer Science - Rok 2017

In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

Pełny tekst do pobrania w portalu

Multi-agent large-scale parallel crowd simulation with NVRAM-based distributed cache

Publikacja

- Journal of Computational Science - Rok 2019

This paper presents the architecture, main components and performance results for a parallel and modu-lar agent-based environment aimed at crowd simulation. The environment allows to simulate thousandsor more agents on maps of square kilometers or more, features a modular design and incorporates non-volatile RAM (NVRAM) with a fail-safe mode that can be activated to allow to continue computationsfrom a recently analyzed state in...

Pełny tekst do pobrania w serwisie zewnętrznym

DL_MG: A Parallel Multigrid Poisson and Poisson–Boltzmann Solver for Electronic Structure Calculations in Vacuum and Solution

Publikacja

J. Womack
L. Anton
J. Dziedzic
P. Hasnip
M. Probert
C. Skylaris

- Journal of Chemical Theory and Computation - Rok 2018

The solution of the Poisson equation is a crucial step in electronic structure calculations, yielding the electrostatic potential -- a key component of the quantum mechanical Hamiltonian. In recent decades, theoretical advances and increases in computer performance have made it possible to simulate the electronic structure of extended systems in complex environments. This requires the solution of more complicated variants of the...

Pełny tekst do pobrania w portalu

Optimization of Data Assignment for Parallel Processing in a Hybrid Heterogeneous Environment Using Integer Linear Programming

Publikacja

- COMPUTER JOURNAL - Rok 2021

In the paper we investigate a practical approach to application of integer linear programming for optimization of data assignment to compute units in a multi-level heterogeneous environment with various compute devices, including CPUs, GPUs and Intel Xeon Phis. The model considers an application that processes a large number of data chunks in parallel on various compute units and takes into account computations, communication including...

Pełny tekst do pobrania w portalu

Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging

Publikacja

- Rok 2017

In the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modiﬁcation of the training program which minimizes the...

Pełny tekst do pobrania w serwisie zewnętrznym

Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge

Publikacja

- Rok 2020

Auto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...

Pełny tekst do pobrania w portalu

Feedline Alterations for Optimization-Based Design of Compact Super-Wideband MIMO Antennas in Parallel Configuration

Publikacja

M. ul Haq
S. Kozieł

- IEEE Antennas and Wireless Propagation Letters - Rok 2019

This letter presents a technique for size reduction of wideband multiple-input-multiple-output (MIMO) antennas. Our approach is a two-stage procedure. At the first stage, the antenna structure is modified to improve its impedance matching. This is achieved through incorporation of an n-section tapered feedline, followed by reoptimization of geometry parameters. Reducing the maximum in-band reflection well beyond the acceptance...

Pełny tekst do pobrania w serwisie zewnętrznym

Numerical Study on Mitigation of Flow Maldistribution in Parallel Microchannel Heat Sink: Channels Variable Width Versus Variable Height Approach

Publikacja

R. Kumar
G. Singh
D. Mikielewicz

- JOURNAL OF ELECTRONIC PACKAGING - Rok 2019

Microchannel heat sink on one hand enjoys benefits of intensified several folds heat transfer performance but on the other hand has to suffer aggravated form of trifling limitations associated with imperfect hydrodynamics and heat transfer behavior. Flow maldistribution is one of such limitation that exaggerates temperature nonuniformity across parallel microchannels leading to increase in maximum base temperature. Recently, variable...

Pełny tekst do pobrania w serwisie zewnętrznym

Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams

Publikacja

K. Łopatka

- Rok 2015

A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...

Massively parallel linear-scaling Hartree–Fock exchange and hybrid exchange–correlation functionals with plane wave basis set accuracy

Publikacja

J. Dziedzic
J. C. Womack
R. Ali
C. Skylaris

- JOURNAL OF CHEMICAL PHYSICS - Rok 2021

We extend our linear-scaling approach for the calculation of Hartree–Fock exchange energy using localized in situ optimized orbitals [Dziedzic et al., J. Chem. Phys. 139, 214103 (2013)] to leverage massive parallelism. Our approach has been implemented in the ONETEP (Order-N Electronic Total Energy Package) density functional theory framework, which employs a basis of non-orthogonal generalized Wannier functions (NGWFs) to achieve...

Pełny tekst do pobrania w portalu

Parallel Cooperating A-Teams

Publikacja

D. Barbucha
I. Czarnowski
P. Jędrzejowicz
E. Ratajczak-Ropel
I. Wierzbowska

- Rok 2011

Pełny tekst do pobrania w serwisie zewnętrznym

Modeling Parallel Applications in the MERPSYS Environment

Publikacja

P. Czarnul

- Rok 2016

The chapter presents how to model parallel computational applications for which simulation of execution in a large-scale parallel or distributed environment is performed within the MERPSYS environment. Specifically, it is shown what approaches can be adopted to model key paradigms often used for parallel applications: master-slave, geometric parallelism (single program multiple data), pipelined and divide-and-conquer applications....

Recognition of hazardous acoustic events employing parallel processing on a supercomputing cluster . Rozpoznawanie niebezpiecznych zdarzeń dźwiękowych z wykorzystaniem równoległego przetwarzania na klastrze superkomputerowym

Publikacja

- Rok 2015

A method for automatic recognition of hazardous acoustic events operating on a super computing cluster is introduced. The methods employed for detecting and classifying the acoustic events are outlined. The evaluation of the recognition engine is provided: both on the training set and using real-life signals. The algorithms yield sufficient performance in practical conditions to be employed in security surveillance systems. The...

Parallel processing of multimedia streams

Publikacja

- Rok 2010

W artykule zaprezentowana jest nowa biblioteka wspierającą tworzenie zadań obliczeniowych, część platformy KASKADA.Przedstawiony został projekt biblioteki, uwzględniający diagram głównych klas oraz diagram sekwencji. Drugi z diagramów ukazuje współpracę głównych klas w procesie przetwarzania strumieni multimedialnych. W dalszej częsci omówione zostały szczegły mechanizmu komunikacji międzyzadawniowej oraz przedstawiony został graf...

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: PARALLEL PERFORMANCE