Filters
total: 8658
-
Catalog
- Publications 7739 available results
- Journals 17 available results
- Conferences 44 available results
- People 92 available results
- Inventions 1 available results
- Projects 13 available results
- Research Equipment 8 available results
- e-Learning Courses 108 available results
- Events 4 available results
- Open Research Data 632 available results
displaying 1000 best results Help
Search results for: parallel response model
-
Rigid finite elements and multibody modeling in analyses of a robot shaped elastic/plastic deformations of a beam
PublicationDynamics analysis of a system composed of a parallel manipulator and of an elastic beam is presented in the paper. Classic 3RRR parallel manipulator is considered and used to deform the beam. Elasto-plastic deformations are investigated. Rigid-finite-elements technique is employed to deal with dynamics of the beam. A multibody structure is associated with the introduced hybrid system in order to model its dynamics. Idea of the...
-
Implementation of FDTD-compatible Green's function on heterogeneous CPU-GPU parallel processing system
PublicationThis paper presents an implementation of the FDTD-compatible Green's function on a heterogeneous parallel processing system. The developed implementation simultaneously utilizes computational power of the central processing unit (CPU) and the graphics processing unit (GPU) to the computational tasks best suited to each architecture. Recently, closed-form expression for this discrete Green's function (DGF) was derived, which facilitates...
-
Parallel Implementation of the Discrete Green's Function Formulation of the FDTD Method on a Multicore Central Processing Unit
PublicationParallel implementation of the discrete Green's function formulation of the finite-difference time-domain (DGF-FDTD) method was developed on a multicore central processing unit. DGF-FDTD avoids computations of the electromagnetic field in free-space cells and does not require domain termination by absorbing boundary conditions. Computed DGF-FDTD solutions are compatible with the FDTD grid enabling the perfect hybridization of FDTD...
-
Parallel immune system for graph coloring
PublicationThis paper presents a parallel artificial immune system designed forgraph coloring. The algorithm is based on the clonal selection principle. Each processor operates on its own pool of antibodies and amigration mechanism is used to allow processors to exchange information. Experimental results show that migration improves the performance of the algorithm. The experiments were performed using a high performance cluster on a set...
-
Performance Assessment of Using Docker for Selected MPI Applications in a Parallel Environment Based on Commodity Hardware
PublicationIn the paper, we perform detailed performance analysis of three parallel MPI applications run in a parallel environment based on commodity hardware, using Docker and bare-metal configurations. The testbed applications are representative of the most typical parallel processing paradigms: master–slave, geometric Single Program Multiple Data (SPMD) as well as divide-and-conquer and feature characteristic computational and communication...
-
Sensitivity of the Baltic Sea level prediction to spatial model resolution
Publicationhe three-dimensional hydrodynamic model of the Baltic Sea (M3D) and...
-
Parallel multithread computing for spectroscopic analysis in optical coherence tomography
PublicationSpectroscopic Optical Coherence Tomography (SOCT) is an extension of Optical Coherence Tomography (OCT). It allows gathering spectroscopic information from individual scattering points inside the sample. It is based on time-frequency analysis of interferometric signals. Such analysis requires calculating hundreds of Fourier transforms while performing a single A-scan. Additionally, further processing of acquired spectroscopic information...
-
Redundantly Actuated 3RRR Parallel Planar Manipulator - Numerical Analyses of its Dynamics Sensitivity on Modifications of its Platform’s Inertia Parameters
PublicationIn the paper, numerical analyses, as well as dynamics of a complex mechanism, are presented. Two objectives are crucial for the paper: inverse dynamic model is needed (dedicated to be use in the model predictive controller); an identification method is searched (some trajectory parameters are controlled, when specific trajectory is tracked under an open-loop model-based control), as selected parameters must be identified for the...
-
A New Approach for the Mitigating of Flow Maldistribution in Parallel Microchannel Heat Sink
PublicationThe problem of flow maldistribution is very critical in microchannel heat sinks (MCHS). It induces temperature nonuniformity, which may ultimately lead to the breakdown of associated system. In the present communication, a novel approach for the mitigation of flow maldistribution problem in parallel MCHS has been proposed using variable width microchannels. Numerical simulation of copper made parallel MCHS consisting of 25 channels...
-
Runtime Visualization of Application Progress and Monitoring of a GPU-enabled Parallel Environment
PublicationThe paper presents design, implementation and real life uses of a visualization subsystem for a distributed framework for parallelization of workflow-based computations among clusters with nodes that feature both CPUs and GPUs. Firstly, the proposed system presents a graphical view of the infrastructure with clusters, nodes and compute devices along with parameters and runtime graphs of load, memory available, fan speeds etc. Secondly,...
-
Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging
PublicationIn the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modification of the training program which minimizes the...
-
On the preestimation technique and its application to identification of nonstationary systems
PublicationThe problem of noncausal identification of a nonstationary stochastic FIR (finite impulse response) sys- tem is reformulated, and solved, as a problem of smoothing of preestimated parameter trajectories. Three approaches to preestimation are critically analyzed and compared. It is shown that optimization of the smoothing operation can be performed adaptively using the parallel estimation technique. The new approach is computationally...
-
Uniform Model Interface for Assurance Case Integration with System Models
PublicationAssurance cases are developed and maintained in parallel with corresponding system models and therefore need to reference each other. Managing the correctness and consistency of interrelated safety argument and system models is essential for system dependability and is a nontrivial task. The model interface presented in this paper enables a uniform process of establishing and managing assurance case references to various types...
-
Optimization of Execution Time under Power Consumption Constraints in a Heterogeneous Parallel System with GPUs and CPUs
PublicationThe paper proposes an approach for parallelization of computations across a collection of clusters with heterogeneous nodes with both GPUs and CPUs. The proposed system partitions input data into chunks and assigns to par- ticular devices for processing using OpenCL kernels defined by the user. The sys- tem is able to minimize the execution time of the application while maintaining the power consumption of the utilized GPUs and...
-
Low-Power Receivers for Wireless Capacitive Coupling Transmission in 3-D-Integrated Massively Parallel CMOS Imager
PublicationThe paper presents pixel receivers for massively parallel transmission of video signal between capacitive coupled integrated circuits (ICs). The receivers meet the key requirements for massively parallel transmission, namely low-power consumption below a single μW, small area of less than 205 μm2, high sensitivity better than 160 mV, and good immunity to crosstalk. The receivers were implemented and measured in a 3-D IC (two face-to-face...
-
Scheduling of compatible jobs on parallel machines
PublicationThe dissertation discusses the problems of scheduling compatible jobs on parallel machines. Some jobs are incompatible, which is modeled as a binary relation on the set of jobs; the relation is often modeled by an incompatibility graph. We consider two models of machines. The first model, more emphasized in the thesis, is a classical model of scheduling, where each machine does one job at time. The second one is a model of p-batching...
-
A Solution to Image Processing with Parallel MPI I/O and Distributed NVRAM Cache
PublicationThe paper presents a new approach to parallel image processing using byte addressable, non-volatile memory (NVRAM). We show that our custom built MPI I/O implementation of selected functions that use a distributed cache that incorporates NVRAMs located in cluster nodes can be used for efficient processing of large images. We demonstrate performance benefits of such a solution compared to a traditional implementation without NVRAM...
-
Psychometric properties of the Bern illegitimate tasks scale using classical test and item response theories
PublicationCombining a classical test theory and an item response theory (IRT), this study aimed to investigate the psychometric properties of the Bern Illegitimate Tasks Scale (BITS) by measuring two conceptually separate dimensions capturing unnecessary tasks (perceived by employees as pointless) and unreasonable tasks (perceived as unfairly or inappropriately assigned). Data collected among Polish employees in two samples (N= 965 and N=...
-
Genetic Positioning of Fire Stations Utilizing Grid-computing Platform
PublicationA chapter presents a model for determining near-optimal locations of fire stations based on topography of a given area and location of forests, rivers, lakes and other elements of the site. The model is based on principals of genetic algorithms and utilizes the power of the grid to distribute and execute in parallel most performance-demanding computations involved in the algorithm.
-
Dynamic Data Management Among Multiple Databases for Optimization of Parallel Computations in Heterogeneous HPC Systems
PublicationRapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...
-
Performance Evaluation of Selected Parallel Object Detection and Tracking Algorithms on an Embedded GPU Platform
PublicationPerformance evaluation of selected complex video processing algorithms, implemented on a parallel, embedded GPU platform Tegra X1, is presented. Three algorithms were chosen for evaluation: a GMM-based object detection algorithm, a particle filter tracking algorithm and an optical flow based algorithm devoted to people counting in a crowd flow. The choice of these algorithms was based on their computational complexity and parallel...
-
Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams
PublicationA system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...
-
Channel Blockage and Flow Maldistribution during Unsteady Flow in a Model Microchannel Plate heat Exchanger
PublicationThis paper describes the problem of channel blockage as a result of flow maldistribution between the channels of a model mini channel plate heat exchanger consisting of one pass on each leg. Each leg of the heat exchanger contains 51 parallel and rectangular minichannels of four hydraulic diameters namely 461 μm, 571 μm, 750 μm and 823 μm. In addition, a more complex geometry has been investigated where for the sake of breaking...
-
Optimization of Data Assignment for Parallel Processing in a Hybrid Heterogeneous Environment Using Integer Linear Programming
PublicationIn the paper we investigate a practical approach to application of integer linear programming for optimization of data assignment to compute units in a multi-level heterogeneous environment with various compute devices, including CPUs, GPUs and Intel Xeon Phis. The model considers an application that processes a large number of data chunks in parallel on various compute units and takes into account computations, communication including...
-
Identification of nonstationary multivariate autoregressive processes– Comparison of competitive and collaborative strategies for joint selection of estimation bandwidth and model order
PublicationThe problem of identification of multivariate autoregressive processes (systems or signals) with unknown and possibly time-varying model order and time-varying rate of parameter variation is considered and solved using parallel estimation approach. Under this approach, several local estimation algorithms, with different order and bandwidth settings, are run simultaneously and compared based on their predictive performance. First,...
-
Mechanism of recognition of parallel G-quadruplexes by DEAH/RHAU helicase DHX36 explored by molecular dynamics simulations
PublicationBecause of high stability and slow unfolding rates of G-quadruplexes (G4), cells have evolved specialized helicases that disrupt these non-canonical DNA and RNA structures in an ATP-dependent manner. One example is DHX36, a DEAH-box helicase, which participates in gene expression and replication by recognizing and unwinding parallel G4s. Here, we studied the molecular basis for the high affinity and specificity of DHX36 for parallel-type...
-
Recognition of hazardous acoustic events employing parallel processing on a supercomputing cluster . Rozpoznawanie niebezpiecznych zdarzeń dźwiękowych z wykorzystaniem równoległego przetwarzania na klastrze superkomputerowym
PublicationA method for automatic recognition of hazardous acoustic events operating on a super computing cluster is introduced. The methods employed for detecting and classifying the acoustic events are outlined. The evaluation of the recognition engine is provided: both on the training set and using real-life signals. The algorithms yield sufficient performance in practical conditions to be employed in security surveillance systems. The...
-
Numerical Study on Mitigation of Flow Maldistribution in Parallel Microchannel Heat Sink: Channels Variable Width Versus Variable Height Approach
PublicationMicrochannel heat sink on one hand enjoys benefits of intensified several folds heat transfer performance but on the other hand has to suffer aggravated form of trifling limitations associated with imperfect hydrodynamics and heat transfer behavior. Flow maldistribution is one of such limitation that exaggerates temperature nonuniformity across parallel microchannels leading to increase in maximum base temperature. Recently, variable...
-
Decentralized control of a different rated parallel UPS systems
PublicationThe paper presents the single phase uninterruptible power supply (UPS) system with galvanic separated DC-AC-DC-AC converters operating in parallel. The CAN physical layer based system of communication between converters has been developed and applied, which allow to utilize a decentralized master-slave control providing high availability factor of the whole UPS system. The control system of particular converters has been developed...
-
Wiktoria Wojnicz dr hab. inż.
PeopleDSc in Mechanics (in the field of Biomechanics) - Lodz Univeristy of Technology, 2019 PhD in Mechanics (in the field of Biomechanics) - Lodz Univeristy of Technology, 2009 (with distinction) List of papers (2009 - ) Wojnicz W., Wittbrodt E., Analysis of muscles' behaviour. Part I. The computational model of muscle. Acta of Bioengineering and Biomechanics, Vol. 11, No.4, 2009, p. 15-21 Wojnicz W., Wittbrodt E., Analysis of...
-
Performance and Power-Aware Modeling of MPI Applications for Cluster Computing
PublicationThe paper presents modeling of performance and power consumption when running parallel applications on modern cluster-based systems. The model includes basic so-called blocks representing either computations or communication. The latter includes both point-to-point and collective communication. Real measurements were performed using MPI applications and routines run on three different clusters with both Infiniband and Gigabit Ethernet...
-
Comparison of EHD devices with parallel and in series spiked electrodes
PublicationIn this paper two electrohydrodynamic (EHD) devices for gas pumping and cleaning are presented. In both cases to induce an airflow in these EHD devices corona discharge was used. The discharge was generated between the spiked electrodes set parallel (the first case) or in series (the second case) and the plate electrodes. An asymmetric electric field and generated discharge result in unidirectional gas flow through the EHD device....
-
Assessment of OpenMP Master–Slave Implementations for Selected Irregular Parallel Applications
PublicationThe paper investigates various implementations of a master–slave paradigm using the popular OpenMP API and relative performance of the former using modern multi-core workstation CPUs. It is assumed that a master partitions available input into a batch of predefined number of data chunks which are then processed in parallel by a set of slaves and the procedure is repeated until all input data has been processed. The paper experimentally...
-
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
PublicationAuto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...
-
Numeryczna analiza dynamiczna wieszaków w łukowym wiadukcie kolejowym. Analiza przypadku
PublicationW artykule przedstawiono analizę dynamiczną wiaduktu kolejowego w ciągu Centralnej Magistrali Kolejowej zlokalizowanego koło Huty Zawadzkiej. W analizie zwrócono szczególną uwagę na lokalną odpowiedź wieszaków na obciążenie przejeżdżającym pociągiem oraz wiatrem. Obliczenia przeprowadzono na podstawie wykonanego modelu MES. Zweryfikowano poprawność modelu poprzez porównanie uzyskanych wyników z wartościami pomierzonymi. Przeprowadzono...
-
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
PublicationIn the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...
-
A CMOS Pixel With Embedded ADC, Digital CDS and Gain Correction Capability for Massively Parallel Imaging Array
PublicationIn the paper, a CMOS pixel has been proposed for imaging arrays with massively parallel image acquisition and simultaneous compensation of dark signal nonuniformity (DSNU) as well as photoresponse nonuniformity (PRNU). In our solution the pixel contains all necessary functional blocks: a photosensor and an analog-to-digital converter (ADC) with built-in correlated double sampling (CDS) integrated together. It is implemented in...
-
Modeling the effect of parasitic capacitances on the dead-time distortion in multilevel NPC inverters
PublicationA simple model is derived and verified for evaluating the effect of parasitic capacitances on the dead-time related voltage distortion in multilevel NPC voltage source inverters. The model permits well-defined and precise compensation of dead-time distortion, exhibiting meaningful improvement on compensation methods neglecting the effects of parasitic capacitances. A simple formula is given for evaluating the capacitances as serial/parallel...
-
Performance Evaluation of the Parallel Codebook Algorithm for Background Subtraction in Video Stream
PublicationA background subtraction algorithm based on the codebook approach was implemented on a multi-core processor in a parallel form, using the OpenMP system. The aim of the experiments was to evaluate performance of the multithreaded algorithm in processing video streams recorded from monitoring cameras, depending on a number of computer cores used, method of task scheduling, image resolution and degree of image content variability....
-
A Parallel MPI I/O Solution Supported by Byte-addressable Non-volatile RAM Distributed Cache
PublicationWhile many scientific, large-scale applications are data-intensive, fast and efficient I/O operations have become of key importance for HPC environments. We propose an MPI I/O extension based on in-system distributed cache with data located in Non-volatile Random Access Memory (NVRAM) available in each cluster node. The presented architecture makes effective use of NVRAM properties such as persistence and byte-level access behind...
-
Modeling and control of a redundantly actuated variable mass 3RRR planar manipulator controlled by a model-based feedforward and a model-based-proportional-derivative feedforward–feedback controller
PublicationIn the paper, dynamics of a complex mechatronics system is considered. A redundantly actuated planar manipulator is the base of the mechanical part of it. It is a 3RRR 1 platform based parallel manipulator. To control its trajectory, a model-based feedforward controller is employed. Three aspects are fundamental in the presented investigations. The first focus is on development of an accurate numerical model used to solve the inverse...
-
Benchmarking Parallel Chess Search in Stockfish on Intel Xeon and Intel Xeon Phi Processors
PublicationThe paper presents results from benchmarking the parallel multithreaded Stockfish chess engine on selected multi- and many-core processors. It is shown how the strength of play for an n-thread version compares to 1-thread version on both Intel Xeon and latest Intel Xeon Phi x200 processors. Results such as the number of wins, losses and draws are presented and how these change for growing numbers of threads. Impact of using particular...
-
Executing Multiple Simulations in the MERPSYS Environment
PublicationThe chapter investigates the steps necessary to perform a simulation instance in the MERPSYS environment and discusses potential limitations in case when vast numbers of simulations are required. An extended architecture is proposed which includes a JMS-based simulation queue and multiple distributed simulators, overcoming the potential bottlenecks. The chapter introduces also methods for preparing suites of multiple simulations...
-
Fixed Pattern Noise Reduction and Linearity Improvement in Time-Mode CMOS Image Sensors
PublicationIn the paper, a digital clock stopping technique for gain and offset correction in time-mode analog-to-digital converters (ADCs) has been proposed. The technique is dedicated to imagers with massively parallel image acquisition working in the time mode where compensation of dark signal non-uniformity (DSNU) as well as photo-response non-uniformity (PRNU) is critical. Fixed pattern noise (FPN) reduction has been experimentally validated...
-
Comment on "On accurate capacitance characterization of organic photovoltaic cells"
PublicationIn the 100th volume of Applied Physics Letters Carr and Chaudhary have presented a work on capacitance characterization of organic photovoltaic cells. The work concerns small signal measurements of various organic photovoltaic structures. The authors however limit their considerations to one part of small signal response, namely to capacitance measured either in parallel mode or in series mode. This attitude generally does not...
-
Infrared techniques for natural convection investigations in channels between two vertical, parallel, isothermal and symmetrically heated plates
PublicationThe effect of the gap width between two symmetrically heated vertical, parallel, isothermal plates on intensity of natural convective heat transfer in a gas (Pr = 0.71) was experimentally studied using the balance and gradient methods. In the former method heat fluxes were determined based on measurements of the voltage and electric current supplying the heaters placed inside the walls. In the latter, heat fluxes were calculated...
-
On Anti-Plane Surface Waves Considering Highly Anisotropic Surface Elasticity Constitutive Relations
PublicationWithin the framework of highly anisotropic surface elasticity model we discuss the propagation of new type of surface waves that are anti-plane surface waves. By the highly anisotropic surface elasticity model we mean the model with a surface strain energy density which depends on incomplete set of second derivatives of displacements. From the physical point of view this model corresponds to a coating made of a family of parallel...
-
A Parallel Corpus-Based Approach to the Crime Event Extraction for Low-Resource Languages
PublicationThese days, a lot of crime-related events take place all over the world. Most of them are reported in news portals and social media. Crime-related event extraction from the published texts can allow monitoring, analysis, and comparison of police or criminal activities in different countries or regions. Existing approaches to event extraction mainly suggest processing texts in English, French, Chinese, and some other resource-rich...
-
Performance evaluation of the parallel object tracking algorithm employing the particle filter
PublicationAn algorithm based on particle filters is employed to track moving objects in video streams from fixed and non-fixed cameras. Particle weighting is based on color histograms computed in the iHLS color space. Particle computations are parallelized with CUDA framework. The algorithm was tested on various GPU devices: a desktop GPU card, a mobile chipset and two embedded GPU platforms. The processing speed depending on the number...
-
Parallel implementation of a Sailing Assistance Application in a Cloud Environment
PublicationSailboat weather routing is a highly complex problem in terms of both the computational time and memory. The reason for this is a large search resulting in a multitude of possible routes and a variety of user preferences. Analysing all possible routes is only feasible for small sailing regions, low-resolution maps, or sailboat movements on a grid. Therefore, various heuristic approaches are often applied, which can find solutions...