Wyniki wyszukiwania dla: PARALLEL DATA PROCESSING - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: PARALLEL DATA PROCESSING

Wyniki wyszukiwania dla: PARALLEL DATA PROCESSING

  • Network-aware Data Prefetching Optimization of Computations in a Heterogeneous HPC Framework

    Rapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...

    Pełny tekst do pobrania w portalu

  • Acceleration of the DGF-FDTD method on GPU using the CUDA technology

    We present a parallel implementation of the discrete Green's function formulation of the finite-difference time-domain (DGF-FDTD) method on a graphics processing unit (GPU). The compute unified device architecture (CUDA) parallel computing platform is applied in the developed implementation. For the sake of example, arrays of Yagi-Uda antennas were simulated with the use of DGF-FDTD on GPU. The efficiency of parallel computations...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Performance evaluation of the parallel object tracking algorithm employing the particle filter

    Publikacja

    - Rok 2016

    An algorithm based on particle filters is employed to track moving objects in video streams from fixed and non-fixed cameras. Particle weighting is based on color histograms computed in the iHLS color space. Particle computations are parallelized with CUDA framework. The algorithm was tested on various GPU devices: a desktop GPU card, a mobile chipset and two embedded GPU platforms. The processing speed depending on the number...

  • Assessment of OpenMP Master–Slave Implementations for Selected Irregular Parallel Applications

    Publikacja

    - Electronics - Rok 2021

    The paper investigates various implementations of a master–slave paradigm using the popular OpenMP API and relative performance of the former using modern multi-core workstation CPUs. It is assumed that a master partitions available input into a batch of predefined number of data chunks which are then processed in parallel by a set of slaves and the procedure is repeated until all input data has been processed. The paper experimentally...

    Pełny tekst do pobrania w portalu

  • Parallel Programming for Modern High Performance Computing Systems

    Publikacja

    - Rok 2018

    In view of the growing presence and popularity of multicore and manycore processors, accelerators, and coprocessors, as well as clusters using such computing devices, the development of efficient parallel applications has become a key challenge to be able to exploit the performance of such systems. This book covers the scope of parallel programming for modern high performance computing systems. It first discusses selected and...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • OpenGL accelerated method of the material matrix generation for FDTD simulations

    Publikacja

    This paper presents the accelerated technique of the material matrix generation from CAD models utilized by the finite-difference time-domain (FDTD) simulators. To achieve high performance of these computations, the parallel-processing power of a graphics processing unit was employed with the use of the OpenGL library. The method was integrated with the developed FDTD solver, providing approximately five-fold speedup of the material...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems

    In this paper we present a new environment called MERPSYS that allows simulation of parallel application execution time on cluster-based systems. The environment offers a modeling application using the Java language extended with methods representing message passing type communication routines. It also offers a graphical interface for building a system model that incorporates various hardware components such as CPUs, GPUs, interconnects...

    Pełny tekst do pobrania w portalu

  • Block-based Representation of Application Execution on Modern Parallel Systems

    Publikacja

    - Rok 2013

    The chapter presents how to model execution of a parallel computational application that is to be executed in a large-scale parallel or distributed environment with potentially thousands to millions of execution units. The representation uses pre- viously attributes and factors representative of modern high performance systems including multicore CPUs, GPUs, dedicated accelerators such as Intel Phi.

  • Parallelization of video stream algorithms in kaskada platform

    Publikacja

    - Rok 2011

    The purpose of this work is to present different techniques of video stream algorithms parallelization provided by the Kaskada platform - a novel system working in a supercomputer environment designated for multimedia streams processing. Considered parallelization methods include frame-level concurrency, multithreading and pipeline processing. Execution performance was measured on four time-consuming image recognition algorithms,...

  • A Parallel Corpus-Based Approach to the Crime Event Extraction for Low-Resource Languages

    Publikacja
    • N. Khairova
    • O. Mamyrbayev
    • N. Rizun
    • M. Razno
    • G. Ybytayeva

    - IEEE Access - Rok 2023

    These days, a lot of crime-related events take place all over the world. Most of them are reported in news portals and social media. Crime-related event extraction from the published texts can allow monitoring, analysis, and comparison of police or criminal activities in different countries or regions. Existing approaches to event extraction mainly suggest processing texts in English, French, Chinese, and some other resource-rich...

    Pełny tekst do pobrania w portalu

  • Modeling SPMD Application Execution Time

    Publikacja

    - Rok 2016

    Parallel applications in a Single Process Multiple Data paradigm assume splitting huge amounts of data to multiple processors working in parallel at small data packets. As the individual data packets are not independent, the processors must interact with each other to exchange results of the calculations with their adjacent partners and take these results into account in their own computations. An example of SPMD is geometric parallelism...

  • A Parallel MPI I/O Solution Supported by Byte-addressable Non-volatile RAM Distributed Cache

    Publikacja

    - Annals of Computer Science and Information Systems - Rok 2016

    While many scientific, large-scale applications are data-intensive, fast and efficient I/O operations have become of key importance for HPC environments. We propose an MPI I/O extension based on in-system distributed cache with data located in Non-volatile Random Access Memory (NVRAM) available in each cluster node. The presented architecture makes effective use of NVRAM properties such as persistence and byte-level access behind...

    Pełny tekst do pobrania w portalu

  • Scalable Measurement System for Multiple Impedance Gas Sensors

    Author proposes scalable architecture of the measurement system for gas sensor with impedance dependance of the gas concentration. The main part of the system is a single-board impedance analyser. The number of analysers working in parallel can be configured according to specific application. The system is controlled by a single computer which organises the measurement cycle and store the acquired measurement data. The system is...

    Pełny tekst do pobrania w portalu

  • Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training

    In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

    Pełny tekst do pobrania w portalu

  • A Regular Expression Matching Application with Configurable Data Intensity for Testing Heterogeneous HPC Systems

    Publikacja

    Modern High Performance Computing (HPC) systems are becoming increasingly heterogeneous in terms of utilized hardware, as well as software solutions. The problems, that we wish to efficiently solve using those systems have different complexity, not only considering magnitude, but also the type of complexity: computation, data or communication intensity. Developing new mechanisms for dealing with those complexities or choosing an...

  • Modeling of Passive and Forced Convection Heat Transfer in Channels with Rib Turbulators

    Publikacja

    - ENERGIES - Rok 2021

    The main goal of the research presented in this paper was the experimental and numerical analysis of heat enhancement and aerodynamic phenomena during air flow in a channel equipped with flow turbulators in the form of properly configured ribs. The use of ribs intensifies the heat transfer and at the same time increases not only the flow resistance but also the energy costs. Therefore, designing modern heat exchangers with optimal...

    Pełny tekst do pobrania w portalu

  • Andrzej Stateczny prof. dr hab. inż.

    Osoby

    Prof. dr hab. inż. Andrzej Stateczny jest profesorem Politechniki Gdańskiej i prezesem firmy Marine Technology Ltd. Jego zainteresowania naukowe koncentrują się głównie wokół nawigacji, hydrografii i geoinformatyki. Obecnie prowadzone badania obejmują nawigację radarową, nawigację porównawczą, hydrografię, metody sztucznej inteligencji w zakresie przetwarzania obrazów i fuzji danych wielosensorycznych. Był kierownikiem lub głównym...

  • Planning optimised multi-tasking operations under the capability for parallel machining

    The advent of advanced multi-tasking machines (MTMs) in the metalworking industry has provided the opportunity for more efficient parallel machining as compared to traditional sequential processing. It entailed the need for developing appropriate reasoning schemes for efficient process planning to take advantage of machining capabilities inherent in these machines. This paper addresses an adequate methodical approach for a non-linear...

    Pełny tekst do pobrania w portalu

  • Computer experiments with a parallel clonal selection algorithm for the graph coloring problem

    Publikacja

    Artificial immune systems (AIS) are algorithms that are based on the structure and mechanisms of the vertebrate immune system. Clonal selection is a process that allows lymphocytes to launch a quick response to known pathogens and to adapt to new, previously unencountered ones. This paper presents a parallel island model algorithm based on the clonal selection principles for solving the Graph Coloring Problem. The performance of...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Anna Wałek dr

    Osoby

    Dr Anna Wałek, Prezydent International Association of University Libraries (IATUL), dyrektor Biblioteki Politechniki Gdańskiej, ekspert w zakresie otwartego dostępu do zasobów naukowych (Open Science, Open Access, Open Research Data) oraz organizacji i zarządzania biblioteką naukową. Od lutego 2023 r. ekspert i Koordynator Hubu Wschodniego (East Hub) w ramach projektu Focusing on Open, Collaboration and Useful Science (EOSC Focus)...

  • Patryk Ziółkowski dr inż.

    Absolwent Wydziału Inżynierii Lądowej i Środowiska Politechniki Gdańskiej, w specjalności Konstrukcje Budowlane i Inżynierskie. Pracuje na stanowisku adiunkta w Katedrze Konstrukcji Inżynierskich. Brał udział w projektach międzynarodowych, w tym projektach dla Ministerstwa Transportu stanu Alabama (2015), jest także laureatem grantu Fundacji Kościuszkowskiej na prowadzanie badań w USA, który zrealizował w 2018 roku. Współautor...

  • Testing for conformance of parallel programming pattern languages

    This paper reports on the project being run by TUG and IMAG, aimed at reducing the volume of tests required to exercise parallel programming language compilers and libraries. The idea is to use the ISO STEP standard scheme for conformance testing of software products. A detailed example illustrating the ongoing work is presented.

  • Bounds on the Cover Time of Parallel Rotor Walks

    Publikacja

    - Rok 2014

    The rotor-router mechanism was introduced as a deterministic alternative to the random walk in undirected graphs. In this model, a set of k identical walkers is deployed in parallel, starting from a chosen subset of nodes, and moving around the graph in synchronous steps. During the process, each node maintains a cyclic ordering of its outgoing arcs, and successively propagates walkers which visit it along its outgoing arcs in...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Modern Platform for Parallel Algorithms Testing: Java on Intel Xeon Phi

    Parallel algorithms are popular method of increasing system performance. Apart from showing their properties using asymptotic analysis, proof-of-concept implementation and practical experiments are often required. In order to speed up the development and provide simple and easily accessible testing environment that enables execution of reliable experiments, the paper proposes a platform with multi-core computational accelerator:...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Krzysztof Kutt dr inż.

    Osoby

    Computer scientist and psychologist trying to combine expertise from both disciplines into something cool. My research activity focuses on the development of affective HCI/BCI interfaces (based on multimodal fusion of signals and contextual data), methods for processing sensory data (including semantization of such data) and the development of knowledge-based systems (in particular knowledge graphs and semantic web systems).

  • Effective configuration of a double triad planar parallel manipulator for precise positioning of heavy details during their assembling process

    Publikacja

    - Rok 2019

    In the paper, dynamics analysis of a parallel manipulator is presented. It is an atypical manipulator, devoted to help in assembling of heavy industrial constructions. Few atypical properties are required: small workspace; slow velocities; high loads. Initially, a short discussion about definition of the parallel manipulators is presented, as well as the sketch of the proposed structure. In parallel, some definitions, assumptions...

    Pełny tekst do pobrania w portalu

  • Parallel immune system for graph coloring

    Publikacja

    - Rok 2008

    This paper presents a parallel artificial immune system designed forgraph coloring. The algorithm is based on the clonal selection principle. Each processor operates on its own pool of antibodies and amigration mechanism is used to allow processors to exchange information. Experimental results show that migration improves the performance of the algorithm. The experiments were performed using a high performance cluster on a set...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • In-ADC, Rank-Order Filter for Digital Pixel Sensors

    This paper presents a new implementation of the rank-order filter, which is established on a parallel-operated array of single-slope (SS) analog-to-digital converters (ADCs). The SS ADCs use an “on-the-ramp processing” technique, i.e., filtration is performed along with analog-to-digital conversion, so the final states of the converters represent a filtered image. A proof-of-concept 64 × 64 array of SS ADCs, integrated with MOS...

    Pełny tekst do pobrania w portalu

  • Mariusz Figurski prof. dr hab. inż.

    Dyrektor Centrum Modelowania Meteorologicznego Instytut Meteorologii i Gospodarki Wodnej - Państwowy Instytut Badawczy. Urodził się 27 kwietnia 1964 roku w Łasinie. Egzamin maturalny złożył w 1983 roku po ukończeniu II Liceum Ogólnokształcącego im. Jana III Sobieskiego w Grudziądzu, Studia wyższe w trybie indywidualnym ukończył w 1989 (10.07.1989) na Wydziałach Elektromechanicznym i Inżynierii Lądowej i Geodezji Wojskowej Akademii...

  • Runtime Visualization of Application Progress and Monitoring of a GPU-enabled Parallel Environment

    Publikacja

    The paper presents design, implementation and real life uses of a visualization subsystem for a distributed framework for parallelization of workflow-based computations among clusters with nodes that feature both CPUs and GPUs. Firstly, the proposed system presents a graphical view of the infrastructure with clusters, nodes and compute devices along with parameters and runtime graphs of load, memory available, fan speeds etc. Secondly,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • A New Approach for the Mitigating of Flow Maldistribution in Parallel Microchannel Heat Sink

    Publikacja

    The problem of flow maldistribution is very critical in microchannel heat sinks (MCHS). It induces temperature nonuniformity, which may ultimately lead to the breakdown of associated system. In the present communication, a novel approach for the mitigation of flow maldistribution problem in parallel MCHS has been proposed using variable width microchannels. Numerical simulation of copper made parallel MCHS consisting of 25 channels...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Low-Power Receivers for Wireless Capacitive Coupling Transmission in 3-D-Integrated Massively Parallel CMOS Imager

    The paper presents pixel receivers for massively parallel transmission of video signal between capacitive coupled integrated circuits (ICs). The receivers meet the key requirements for massively parallel transmission, namely low-power consumption below a single μW, small area of less than 205 μm2, high sensitivity better than 160 mV, and good immunity to crosstalk. The receivers were implemented and measured in a 3-D IC (two face-to-face...

    Pełny tekst do pobrania w portalu

  • Parallelization of Selected Algorithms on Multi-core CPUs, a Cluster and in a Hybrid CPU+Xeon Phi Environment

    In the paper we present parallel implementations as well as execution times and speed-ups of three different algorithms run in various environments such as on a workstation with multi-core CPUs and a cluster. The parallel codes, implementing the master-slave model in C+MPI, differ in computation to communication ratios. The considered problems include: a genetic algorithm with various ratios of master processing time to communication...

    Pełny tekst do pobrania w portalu

  • FPGA Acceleration of Matrix-Assembly Phase of RWG-Based MoM

    Publikacja

    In this letter, the field-programmable-gate-array accelerated implementation of matrix-assembly phase of the method of moments (MoM) is presented. The solution is based on a discretization of the frequency-domain mixed potential integral equation using the Rao-Wilton-Glisson basis functions and their extension to wire-to-surface junctions. To take advantage of the given hardware resources (i.e., Xilinx Alveo U200 accelerator card),...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Controlled grafting of vinylic monomers on polyolefins: a robust mathematical modeling approach

    Publikacja
    • M. Saeb
    • B. Rezaee
    • A. Shadman
    • K. Formela
    • Z. Ahmadi
    • F. Hemmati
    • T. Kermaniyan
    • Y. Mohammadi

    - DESIGNED MONOMERS AND POLYMERS - Rok 2017

    Experimental and mathematical modeling analyses were used for controlling melt free-radical grafting of vinylic monomers on polyolefins and, thereby, reducing the disturbance of undesired cross-linking of polyolefins. Response surface, desirability function, and artificial intelligence methodologies were blended to modeling/optimization of grafting reaction in terms of vinylic monomer content, peroxide initiator concentration,...

    Pełny tekst do pobrania w portalu

  • Survey of Methodologies, Approaches, and Challenges in Parallel Programming Using High-Performance Computing Systems

    This paper provides a review of contemporary methodologies and APIs for parallel programming, with representative technologies selected in terms of target system type (shared memory, distributed, and hybrid), communication patterns (one-sided and two-sided), and programming abstraction level. We analyze representatives in terms of many aspects including programming model, languages, supported platforms, license, optimization goals,...

    Pełny tekst do pobrania w portalu

  • A self-optimization mechanism for generalized adaptive notch smoother

    Publikacja

    Tracking of nonstationary narrowband signals is often accomplished using algorithms called adaptive notch filters (ANFs). Generalized adaptive notch smoothers (GANSs) extend the concepts of adaptive notch filtering in two directions. Firstly, they are designed to estimate coefficients of nonstationary quasi-periodic systems, rather than signals. Secondly, they employ noncausal processing, which greatly improves their accuracy and...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Performance Analysis of the OpenCL Environment on Mobile Platforms

    Publikacja

    Today’s smartphones have more and more features that so far were only assigned to personal computers. Every year these devices are composed of better and more efficient components. Everything indicates that modern smartphones are replacing ordinary computers in various activities. High computing power is required for tasks such as image processing, speech recognition and object detection. This paper analyses the performance of...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Massively parallel linear-scaling Hartree–Fock exchange and hybrid exchange–correlation functionals with plane wave basis set accuracy

    Publikacja

    - JOURNAL OF CHEMICAL PHYSICS - Rok 2021

    We extend our linear-scaling approach for the calculation of Hartree–Fock exchange energy using localized in situ optimized orbitals [Dziedzic et al., J. Chem. Phys. 139, 214103 (2013)] to leverage massive parallelism. Our approach has been implemented in the ONETEP (Order-N Electronic Total Energy Package) density functional theory framework, which employs a basis of non-orthogonal generalized Wannier functions (NGWFs) to achieve...

    Pełny tekst do pobrania w portalu

  • Platforma KASKADA jako system zapewniania bezpieczeństwa poprzez masową analizę strumieni multimedialnych w czasie rzeczywistym

    W artykule przedstawiono Platformę KASKADA rozumianą jako system przetwarzania danych cyfrowych i strumieni multimedialnych oraz stanowiącą ofertę usług wspomagających zapewnienie bezpieczeństwa publicznego, ocenę badań medycznych i ochronę własności intelektualnej. celem prowadzonych prac było stworzenie innowacyjnego systemu umozliwiajacego wydajną i masową analizę dokumentów cyfrowych i strumieni multimedialnych w czasie rzeczywistym...

  • Mechanism of recognition of parallel G-quadruplexes by DEAH/RHAU helicase DHX36 explored by molecular dynamics simulations

    Because of high stability and slow unfolding rates of G-quadruplexes (G4), cells have evolved specialized helicases that disrupt these non-canonical DNA and RNA structures in an ATP-dependent manner. One example is DHX36, a DEAH-box helicase, which participates in gene expression and replication by recognizing and unwinding parallel G4s. Here, we studied the molecular basis for the high affinity and specificity of DHX36 for parallel-type...

    Pełny tekst do pobrania w portalu

  • Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications

    The aim of this paper is to evaluate performance of new CUDA mechanisms—unified memory and dynamic parallelism for real parallel applications compared to standard CUDA API versions. In order to gain insight into performance of these mechanisms, we decided to implement three applications with control and data flow typical of SPMD, geometric SPMD and divide-and-conquer schemes, which were then used for tests and experiments. Specifically,...

    Pełny tekst do pobrania w portalu

  • Influence of laser processing of the low alloy medium carbon structural steel on the development of the fatigue crack

    Publikacja

    The paper contains the results of the structural analysis, hardness tests and fatigue tests conducted for the medium carbon structural steel with low content of Cr and Ni after its processing with CO2 laser beam. Pre-cracks were made in the round compact tension (RCT) specimen used for fatigue test. Next, four paths, parallel to each other, were melted on both sides of the samples using a laser beam. The paths were perpendicular...

    Pełny tekst do pobrania w portalu

  • Sensorless predictive control of three-phase parallel active filter

    Publikacja

    The paper presents the control system of parallel active power filter (APF) with predictive reference current calculation and model based predictive current control. The novel estimator and predictor of grid emf is proposed for AC voltage sensorless operation of APF, regardless of distortion of this voltage. Proposed control system provides control of APF current with high precision and dynamics limited only by filter circuit parameters....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Magdalena Szuflita-Żurawska

    Magdalena Szuflita-Żurawska jest kierownikiem Sekcji Informacji Naukowo-Technicznej na Politechnice Gdańskiej oraz Liderem Centrum Kompetencji Otwartej Nauki przy Bibliotece Politechniki Gdańskiej. Jej główne zainteresowania badawcze koncentrują się w obszarze komunikacji naukowej oraz otwartych danych badawczych, a także motywacji i produktywności naukowej. Jest odpowiedzialna między innymi za prowadzenie szkoleń dla pracowników...

  • Numerical Study on Mitigation of Flow Maldistribution in Parallel Microchannel Heat Sink: Channels Variable Width Versus Variable Height Approach

    Publikacja

    - JOURNAL OF ELECTRONIC PACKAGING - Rok 2019

    Microchannel heat sink on one hand enjoys benefits of intensified several folds heat transfer performance but on the other hand has to suffer aggravated form of trifling limitations associated with imperfect hydrodynamics and heat transfer behavior. Flow maldistribution is one of such limitation that exaggerates temperature nonuniformity across parallel microchannels leading to increase in maximum base temperature. Recently, variable...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Decentralized control of a different rated parallel UPS systems

    Publikacja

    The paper presents the single phase uninterruptible power supply (UPS) system with galvanic separated DC-AC-DC-AC converters operating in parallel. The CAN physical layer based system of communication between converters has been developed and applied, which allow to utilize a decentralized master-slave control providing high availability factor of the whole UPS system. The control system of particular converters has been developed...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Single and Dual-GPU Generalized Sparse Eigenvalue Solvers for Finding a Few Low-Order Resonances of a Microwave Cavity Using the Finite-Element Method

    Publikacja

    This paper presents two fast generalized eigenvalue solvers for sparse symmetric matrices that arise when electromagnetic cavity resonances are investigated using the higher-order finite element method (FEM). To find a few loworder resonances, the locally optimal block preconditioned conjugate gradient (LOBPCG) algorithm with null-space deflation is applied. The computations are expedited by using one or two graphical processing...

    Pełny tekst do pobrania w portalu

  • Comparison of EHD devices with parallel and in series spiked electrodes

    Publikacja

    - Rok 2012

    In this paper two electrohydrodynamic (EHD) devices for gas pumping and cleaning are presented. In both cases to induce an airflow in these EHD devices corona discharge was used. The discharge was generated between the spiked electrodes set parallel (the first case) or in series (the second case) and the plate electrodes. An asymmetric electric field and generated discharge result in unidirectional gas flow through the EHD device....

  • Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge

    Publikacja

    Auto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...

    Pełny tekst do pobrania w portalu