Search results for: parallel processing

Parallel multithread computing for spectroscopic analysis in optical coherence tomography

Publication

- Year 2014

Spectroscopic Optical Coherence Tomography (SOCT) is an extension of Optical Coherence Tomography (OCT). It allows gathering spectroscopic information from individual scattering points inside the sample. It is based on time-frequency analysis of interferometric signals. Such analysis requires calculating hundreds of Fourier transforms while performing a single A-scan. Additionally, further processing of acquired spectroscopic information...

Full text to download in external service

Image Processing Techniques for Distributed Grid Applications

Publication

P. Brudło

- Year 2012

Parallel approaches to 2D and 3D convolution processing of series of images have been presented. A distributed, practically oriented, 2D spatial convolution scheme has been elaborated and extended into the temporal domain. Complexity of the scheme has been determined and analysed with respect to coefficients in convolution kernels. Possibilities of parallelisation of the convolution operations have been analysed and the results...

Performance evaluation of parallel background subtraction on GPU platforms

Publication

G. Szwoch

- Elektronika : konstrukcje, technologie, zastosowania - Year 2015

Implementation of the background subtraction algorithm on parallel GPUs is presented. The algorithm processes video streams and extracts foreground pixels. The work focuses on optimizing parallel algorithm implementation by taking into account specific features of the GPU architecture, such as memory access, data transfers and work group organization. The algorithm is implemented in both OpenCL and CUDA. Various optimizations of...

Full text to download in external service

Performance Evaluation of Selected Parallel Object Detection and Tracking Algorithms on an Embedded GPU Platform

Publication

- Year 2017

Performance evaluation of selected complex video processing algorithms, implemented on a parallel, embedded GPU platform Tegra X1, is presented. Three algorithms were chosen for evaluation: a GMM-based object detection algorithm, a particle filter tracking algorithm and an optical flow based algorithm devoted to people counting in a crowd flow. The choice of these algorithms was based on their computational complexity and parallel...

Full text to download in external service

Parallel Background Subtraction in Video Streams Using OpenCL on GPU Platforms

Publication

G. Szwoch

- Year 2014

Implementation of the background subtraction algorithm using OpenCL platform is presented. The algorithm processes live stream of video frames from the surveillance camera in on-line mode. Processing is performed using a host machine and a parallel computing device. The work focuses on optimizing an OpenCL algorithm implementation for GPU devices by taking into account specific features of the GPU architecture, such as memory access,...

Full text to download in external service

Acceleration of the DGF-FDTD method on GPU using the CUDA technology

Publication

- Year 2015

We present a parallel implementation of the discrete Green's function formulation of the finite-difference time-domain (DGF-FDTD) method on a graphics processing unit (GPU). The compute unified device architecture (CUDA) parallel computing platform is applied in the developed implementation. For the sake of example, arrays of Yagi-Uda antennas were simulated with the use of DGF-FDTD on GPU. The efficiency of parallel computations...

Full text to download in external service

NVRAM as Main Storage of Parallel File System

Publication

A. Malinowski

- Journal of Computer Science and Control Systems - Year 2016

Modern cluster environments' main trouble used to be lack of computational power provided by CPUs and GPUs, but recently they suffer more and more from insufficient performance of input and output operations. Apart from better network infrastructure and more sophisticated processing algorithms, a lot of solutions base on emerging memory technologies. This paper presents evaluation of using non-volatile random-access memory as a...

Full text to download in external service

Performance evaluation of the parallel object tracking algorithm employing the particle filter

Publication

G. Szwoch

- Year 2016

An algorithm based on particle filters is employed to track moving objects in video streams from fixed and non-fixed cameras. Particle weighting is based on color histograms computed in the iHLS color space. Particle computations are parallelized with CUDA framework. The algorithm was tested on various GPU devices: a desktop GPU card, a mobile chipset and two embedded GPU platforms. The processing speed depending on the number...

From Sequential to Parallel Implementation of NLP Using the Actor Model

Publication

- Advances in Intelligent Systems and Computing - Year 2018

The article focuses on presenting methods allowing easy parallelization of an existing, sequential Natural Language Processing (NLP) application within a multi-core system. The actor-based solution implemented with the Akka framework has been applied and compared to an application based on Task Parallel Library (TPL) and to the original sequential application. Architectures, data and control flows are described along with execution...

Full text available to download

OpenGL accelerated method of the material matrix generation for FDTD simulations

Publication

- Year 2014

This paper presents the accelerated technique of the material matrix generation from CAD models utilized by the finite-difference time-domain (FDTD) simulators. To achieve high performance of these computations, the parallel-processing power of a graphics processing unit was employed with the use of the OpenGL library. The method was integrated with the developed FDTD solver, providing approximately five-fold speedup of the material...

Full text to download in external service

A distributed system for conducting chess games in parallel

Publication

- Procedia Computer Science - Year 2017

This paper proposes a distributed and scalable cloud based system designed to play chess games in parallel. Games can be played between chess engines alone or between clusters created by combined chess engines. The system has a built-in mechanism that compares engines, based on Elo ranking which finally presents the strength of each tested approach. If an approach needs more computational power, the design of the system allows...

Full text available to download

Performance Evaluation of the Parallel Codebook Algorithm for Background Subtraction in Video Stream

Publication

G. Szwoch

- Communications in Computer and Information Science - Year 2011

A background subtraction algorithm based on the codebook approach was implemented on a multi-core processor in a parallel form, using the OpenMP system. The aim of the experiments was to evaluate performance of the multithreaded algorithm in processing video streams recorded from monitoring cameras, depending on a number of computer cores used, method of task scheduling, image resolution and degree of image content variability....

Full text to download in external service

Optimization of Execution Time under Power Consumption Constraints in a Heterogeneous Parallel System with GPUs and CPUs

Publication

- Year 2014

The paper proposes an approach for parallelization of computations across a collection of clusters with heterogeneous nodes with both GPUs and CPUs. The proposed system partitions input data into chunks and assigns to par- ticular devices for processing using OpenCL kernels defined by the user. The sys- tem is able to minimize the execution time of the application while maintaining the power consumption of the utilized GPUs and...

Full text to download in external service

Optimization of hybrid parallel application execution in heterogeneous high performance computing systems considering execution time and power consumption

Publication

P. Rościszewski

- Year 2018

Many important computational problems require utilization of high performance computing (HPC) systems that consist of multi-level structures combining higher and higher numbers of devices with various characteristics. Utilizing full power of such systems requires programming parallel applications that are hybrid in two meanings: they can utilize parallelism on multiple levels at the same time and combine together programming interfaces...

Full text to download in external service

Processing of Satellite Data in the Cloud

Publication

- TASK Quarterly - Year 2017

The dynamic development of digital technologies, especially those dedicated to devices generating large data streams, such as all kinds of measurement equipment (temperature and humidity sensors, cameras, radio-telescopes and satellites – Internet of Things) enables more in-depth analysis of the surrounding reality, including better understanding of various natural phenomenon, starting from atomic level reactions, through macroscopic...

Full text available to download

Parallelization of Compute Intensive Applications into Workflows based on Services in BeesyCluster

Publication

P. Czarnul

- Year 2011

The paper presents an approach for modeling, optimization and execution of workflow applications based on services that incorporates both service selection and partitioning of input data for parallel processing by parallel workflow paths. A compute-intensive workflow application for parallel integration is presented. An impact of the input data partitioning on the scalability is presented. The paper shows a comparison of the theoretical...

Full text to download in external service

Performance Assessment of Using Docker for Selected MPI Applications in a Parallel Environment Based on Commodity Hardware

Publication

- Applied Sciences-Basel - Year 2022

In the paper, we perform detailed performance analysis of three parallel MPI applications run in a parallel environment based on commodity hardware, using Docker and bare-metal configurations. The testbed applications are representative of the most typical parallel processing paradigms: master–slave, geometric Single Program Multiple Data (SPMD) as well as divide-and-conquer and feature characteristic computational and communication...

Full text available to download

Multi-core processing system for real-time image processing in embedded computer vision applications

Publication

- Year 2008

W artykule opisano architekturę wielordzeniowego programowalnego systemu do przetwarzania obrazów w czasie rzeczywistym. Dane obrazu są przetwarzane równocześnie przez wszystkie procesory. System umożliwia niskopoziomowe przetwarzanie obrazów,np. odejmowanie tła, wykrywanie obiektów ruchomych, transformacje geometryczne, indeksowanie wykrytych obiektów, ocena ich kształtu oraz podstawowa analiza trajektorii ruchu. Ang:This paper...

Parallelization of video stream algorithms in kaskada platform

Publication

A. Brzeski

- Year 2011

The purpose of this work is to present different techniques of video stream algorithms parallelization provided by the Kaskada platform - a novel system working in a supercomputer environment designated for multimedia streams processing. Considered parallelization methods include frame-level concurrency, multithreading and pipeline processing. Execution performance was measured on four time-consuming image recognition algorithms,...

Modelling and simulation of GPU processing in the MERPSYS environment

Publication

- Scalable Computing: Practice and Experience - Year 2018

In this work, we evaluate an analytical GPU performance model based on Little's law, that expresses the kernel execution time in terms of latency bound, throughput bound, and achieved occupancy. We then combine it with the results of several research papers, introduce equations for data transfer time estimation, and finally incorporate it into the MERPSYS framework, which is a general-purpose simulator for parallel and distributed...

Full text available to download

Filters

Catalog

Category

Year

Options

Parallel multithread computing for spectroscopic analysis in optical coherence tomography

Image Processing Techniques for Distributed Grid Applications

Performance evaluation of parallel background subtraction on GPU platforms

Performance Evaluation of Selected Parallel Object Detection and Tracking Algorithms on an Embedded GPU Platform

Parallel Background Subtraction in Video Streams Using OpenCL on GPU Platforms

Acceleration of the DGF-FDTD method on GPU using the CUDA technology

NVRAM as Main Storage of Parallel File System

Performance evaluation of the parallel object tracking algorithm employing the particle filter

From Sequential to Parallel Implementation of NLP Using the Actor Model

OpenGL accelerated method of the material matrix generation for FDTD simulations

A distributed system for conducting chess games in parallel

Performance Evaluation of the Parallel Codebook Algorithm for Background Subtraction in Video Stream

Optimization of Execution Time under Power Consumption Constraints in a Heterogeneous Parallel System with GPUs and CPUs

Optimization of hybrid parallel application execution in heterogeneous high performance computing systems considering execution time and power consumption

Processing of Satellite Data in the Cloud

Parallelization of Compute Intensive Applications into Workflows based on Services in BeesyCluster

Performance Assessment of Using Docker for Selected MPI Applications in a Parallel Environment Based on Commodity Hardware

Multi-core processing system for real-time image processing in embedded computer vision applications

Parallelization of video stream algorithms in kaskada platform

Modelling and simulation of GPU processing in the MERPSYS environment