Wyniki wyszukiwania dla: EXECUTION MANAGEMENT
-
Reliability Modeling for SPMD and DAC Applications in MERPSYS
PublikacjaIn this chapter we shall see a method for modeling of reliability of application execution in the MERPSYS system, and the results of reliability simulation of the same SPMD and DAC application which we have previously presented in this book. Unlike energy usage modeling and modeling of execution efficiency we could not compare the reliability model and simulation results to real experiment just because the experiment would last too...
-
Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training
PublikacjaIn the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...
-
Optimization model of agile team’s cohesion
PublikacjaTeam’s cohesion is one of the most important factors of IT project execution effectiveness. Optimization of team’s cohesion gives the possibility of reducing the risk of project failure. It also allows to increase the teamwork efficiency and thus optimize time of tasks execution, increase the guarantee of maintaining the scope of the project and the chance of achieving a given level of products quality. This article presents determination...
-
Using Statistical Methods to Estimate The Worst Case Response Time of Network Software Running on Indeterministic Hardware Platforms
PublikacjaIn this paper we investigate whether the statistical Worst Case Execution Time (WCET) estimation methods devised for embedded platforms can be successfully applied to find the Worst Case Response Time (WCRT) of a network application running on a complex hardware platform such as a contemporary commercial off-the-shelf (COTS) system. Establishing easy-to-use timing validation techniques is crucial for real-time applications and...
-
KernelHive: a new workflow-based framework for multilevel high performance computing using clusters and workstations with CPUs and GPUs
PublikacjaThe paper presents a new open-source framework called KernelHive for multilevel parallelization of computations among various clusters, cluster nodes, and finally, among both CPUs and GPUs for a particular application. An application is modeled as an acyclic directed graph with a possibility to run nodes in parallel and automatic expansion of nodes (called node unrolling) depending on the number of computation units available....
-
PRE-FAILURE CONDITION AND THE METHOD OF REPAIRING GLUED TIMBER ROOF BEARS
PublikacjaDesign documentation of glued-laminated timber roof girders is in many cases very general, does not contain details and design solutions, and the contractors do not have experience in the implementation of this type of structure. Design and execution errors, including those related to glued laminated timber roof girders, not only affect the safety and durability of the structure of the facility in which they were built in, but...
-
Optimization of Execution Time under Power Consumption Constraints in a Heterogeneous Parallel System with GPUs and CPUs
PublikacjaThe paper proposes an approach for parallelization of computations across a collection of clusters with heterogeneous nodes with both GPUs and CPUs. The proposed system partitions input data into chunks and assigns to par- ticular devices for processing using OpenCL kernels defined by the user. The sys- tem is able to minimize the execution time of the application while maintaining the power consumption of the utilized GPUs and...
-
Investigation of Performance and Energy Consumption of Tokenization Algorithms on Multi-core CPUs Under Power Capping
PublikacjaIn this paper we investigate performance-energy optimization of tokenizer algorithm training using power capping. We focus on parallel, multi-threaded implementations of Byte Pair Encoding (BPE), Unigram, WordPiece, and WordLevel run on two systems with different multi-core CPUs: Intel Xeon 6130 and desktop Intel i7-13700K. We analyze execution times and energy consumption for various numbers of threads and various power caps and...
-
Performance and Power-Aware Modeling of MPI Applications for Cluster Computing
PublikacjaThe paper presents modeling of performance and power consumption when running parallel applications on modern cluster-based systems. The model includes basic so-called blocks representing either computations or communication. The latter includes both point-to-point and collective communication. Real measurements were performed using MPI applications and routines run on three different clusters with both Infiniband and Gigabit Ethernet...
-
Application of theory of semi-Markov processes to determining distribution of probabilistic process of marine accidents resulting from colission of ships
PublikacjaIn this paper is presented possible application of the theory of semi-Markov processes to elaborating an eight-state model of the process of occurrence of serviceability state and unserviceability states of sea-going ships making critical manoeuvres during their entering and leaving the ports. In the analysis it was taken into account that sea-going ships are in service for a very long time t (t → ∞). The model was elaborated to...
-
Implementation of Extended Kalman Filter with Optimized Execution Time for Sensorless Control of a PMSM Using ARM Cortex-M3 Microcontroller
PublikacjaThis paper addresses the implementation and optimization of an Extended Kalman Filter (EKF) for the Permanent Magnet Synchronous Motor (PMSM) sensorless control using an ARM Cortex-M3 microcontroller. A various optimization levels based on arithmetic calculation reduction was implemented in ARM Cortex-M3 microcontroller. The execution time of EKF estimator was reduced from 260.4 μs to 37.7 μs without loss of accuracy. To further...
-
Analyzing energy/performance trade-offs with power capping for parallel applications on modern multi and many core processors
PublikacjaIn the paper we present extensive results from analyzing energy/performance trade-offs with power capping observed on four different modern CPUs, for three different parallel applications such as 2D heat distribution, numerical integration and Fast Fourier Transform. The CPU tested represent both multi-core type CPUs such as Intel⃝R Xeon⃝R E5, desktop and mobile i7 as well as many-core Intel⃝R Xeon PhiTM x200 but also server, desktop...
-
Checkpointing of Parallel MPI Applications using MPI One-sided API with Support for Byte-addressable Non-volatile RAM
PublikacjaThe increasing size of computational clusters results in an increasing probability of failures, which in turn requires application checkpointing in order to survive those failures. Traditional checkpointing requires data to be copied from application memory into persistent storage medium, which increases application execution time as it is usually done in a separate step. In this paper we propose to use emerging byte-addressable...
-
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
PublikacjaAuto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...
-
Bilateral Multi-Issue Negotiation Between Active Documents and Execution Devices
PublikacjaMobile document-agents are often in conflict with execution devices when attempting to perform activities of the business process they implement, since preferences of device owners may change depending on their current location and the actual class of the device in use. The paper proposes a bilateral negotiation mechanism based on a simple bargaining game that can effectively resolve such conflicts without any third party support.
-
Performance Assessment of Using Docker for Selected MPI Applications in a Parallel Environment Based on Commodity Hardware
PublikacjaIn the paper, we perform detailed performance analysis of three parallel MPI applications run in a parallel environment based on commodity hardware, using Docker and bare-metal configurations. The testbed applications are representative of the most typical parallel processing paradigms: master–slave, geometric Single Program Multiple Data (SPMD) as well as divide-and-conquer and feature characteristic computational and communication...
-
Dynamic Compatibility Matching of Services for Distributed Workflow Execution
PublikacjaThe paper presents a concept and an implementation of dynamic learn-ing of compatibilities of services used in a workflow application. While services may have the same functionality, they may accept input and produce output in different formats. The proposed solution learns matching of outputs and inputs at runtime and uses this knowledge in subsequent runs of workflow applications. The presented solution was implemented in an...
-
Document Agents with the Intelligent Negotiations Capability
PublikacjaThe paper focus is on augmenting proactive document-agents with built -in intelligence to enable them to recognize execution context provided by devices visited durning the business process, and to reach collaboration agreement despite of their conflicting requirements. We propose a solution based on neural networks to improve simple multi-issue negotiation between the document and the device, practically with no excessive cost...
-
Analysis and Evaluation of the Working Cycle of the Diesel Engine
PublikacjaThe paper presents a proposal to apply a quantitative evaluation of the diesel engine with regard to the phenomena occurring during of a working cycle. The proposed procedure when analyzing test results from diesel engine is an attempt to transfer an engine activity evaluation methods in the operational time scale (exploit time), eg. in hours, to the micro-scale (dynamic time) relating only to the execution time of one (several)...
-
Review and comparison of smoothing algorithms for one-dimensional data noise reduction
PublikacjaThe paper considers the choice of parameters of smoothing algorithms for data denoising. The impact of the window size on smoothing accuracy was analyzed. The parameters of denoising filters were selected with respect to the meansquare error between the computed linear regression and the noisy signal. Finally, we have compared mean, median, SavitzkyGolay, Kalman and Gaussian filter algorithms for the data from the digital sensor....
-
Multi-agent large-scale parallel crowd simulation with NVRAM-based distributed cache
PublikacjaThis paper presents the architecture, main components and performance results for a parallel and modu-lar agent-based environment aimed at crowd simulation. The environment allows to simulate thousandsor more agents on maps of square kilometers or more, features a modular design and incorporates non-volatile RAM (NVRAM) with a fail-safe mode that can be activated to allow to continue computationsfrom a recently analyzed state in...
-
Complementary oriented allocation algorithm for cloud computing
PublikacjaNowadays cloud computing is one of the most popular processing models. More and more different kinds of workloads have been migrated to clouds. This trend obliges the community to design algorithms which could optimize the usage of cloud resources and be more effiient and effective. The paper proposes a new model of workload allocation which bases on the complementarity relation and analyzes it. An example of a case of use is shown...
-
From Sequential to Parallel Implementation of NLP Using the Actor Model
PublikacjaThe article focuses on presenting methods allowing easy parallelization of an existing, sequential Natural Language Processing (NLP) application within a multi-core system. The actor-based solution implemented with the Akka framework has been applied and compared to an application based on Task Parallel Library (TPL) and to the original sequential application. Architectures, data and control flows are described along with execution...
-
The parallel environment for endoscopic image analysis
PublikacjaThe jPVM-oriented environment to support high performance computing required for the Endoscopy Recommender System (ERS) is defined. SPMD model of image matching is considered and its two implementations are proposed: Lexicographical Searching Algorithm (LSA) and Gradient Serching Algorithm (GSA). Three classes of experiments are considered and the relative degree of similarity and execution time of each algorithm are analysed....
-
Executing Multiple Simulations in the MERPSYS Environment
PublikacjaThe chapter investigates the steps necessary to perform a simulation instance in the MERPSYS environment and discusses potential limitations in case when vast numbers of simulations are required. An extended architecture is proposed which includes a JMS-based simulation queue and multiple distributed simulators, overcoming the potential bottlenecks. The chapter introduces also methods for preparing suites of multiple simulations...
-
THE IPV4 TO IPV6 MIGRATION OF APPLICATIONS AND SERVICE
PublikacjaThis article presents the problems related to IPv4 to IPv6 migration of applications supporting network services. It summarizes the needs of executing such migration. It shows the basic problems of automating the migration process, having defined the basic terms, i.e.: a network service, a network application. It shows a sample implementation of the automation of the migration process between IP technologies for selected network...
-
Installation of CMC columns in overconsolidated clays
PublikacjaDuring rigid inclusions execution in the subsoil some monitoring data are registered by a drilling rig for quality control. The use of these data can improve the design procedure of soil improvement as they can provide some information about the position of bearing strata and the energy necessary to install the column in the ground. In this paper the parameters registered in a drilling machine are described, the installation...
-
Some Optimization Methods for Simulations in Volunteer and Grid Systems
PublikacjaIn this chapter, some optimization methods have been presented for improving performance of simulations in the volunteer and grid computing system called Comcute. Some issues related to the cloud computing can be solved by presented approaches as well as the Comcute platform can be used to simulate execution of expensive and energy consuming long-term tasks in the cloud environment. In particular, evolutionary algorithms as well...
-
AUTOMATED NEGOTIATIONS OVER COLLABORATION PROTOCOL AGREEMENTS
PublikacjaThe dissertation focuses on the augmentation of proactive document - agents with built-in intelligence to recognize execution context provided by devices visited during a business process, and to reach collaboration agreement despite conflicting requirements. The proposed solution, based on intelligent bargaining using neural networks to improve simple multi-issue negotiation between the document and thedevice, requires practically...
-
HILS for the Design of Three-Wheeled Mobile Platform Motion Surveillance System with a Use of Energy Performance Index
PublikacjaCurrent tendency in mechatronic design requires the use of comprehensive development of an environment, which gives the possibility to prototype, design, simulate and integrate with dedicated hardware. The paper discusses the Hardware-In-the-Loop Simulations (HILS) mechatronic technique, used during the design of the surveillance system based on energy performance index. The presented test configuration (physical controller – emulated...
-
Simulation of Parallel Applications on Large-scale Distributed Systems
PublikacjaThis chapter has a form of a review article in the field of simulating High-Performance Computing systems. We justify the need for a new versatile simulator considering heterogeneity, energy efficiency and reliability of HPC systems. We sketch the problems that need to be solved by such simulator and rationalize using discrete-event simulation for this purpose. Based on a review of existing discrete-event HPC simulation solutions...
-
Parallelization of large vector similarity computations in a hybrid CPU+GPU environment
PublikacjaThe paper presents design, implementation and tuning of a hybrid parallel OpenMP+CUDA code for computation of similarity between pairs of a large number of multidimensional vectors. The problem has a wide range of applications, and consequently its optimization is of high importance, especially on currently widespread hybrid CPU+GPU systems targeted in the paper. The following are presented and tested for computation of all vector...
-
Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors
PublikacjaThe paper deals with parallelization of computing similarity measures between large vectors. Such computations are important components within many applications and consequently are of high importance. Rather than focusing on optimization of the algorithm itself, assuming specific measures, the paper assumes a general scheme for finding similarity measures for all pairs of vectors and investigates optimizations for scalability...
-
Zarządzanie zapasami dla praktyków
PublikacjaDążenie do redukcji zapasów jest naturalnym trendem wszystkich przedsiębiorstw dbających o dostęp do tzw. wolnej gotówki. 11 lat prowadzonych przeze mnie obserwacji w toku realizowanych kilkuset projektów wskazuje, że tylko ok 0,5% przedsiębiorstw (w danym momencie) nie zajmuje się pracą nad obniżeniem zapasów. Jak ta redukcja zapasów zazwyczaj się odbywa? Szef stawia cel swojemu zespołowi: „dziś rotacja naszego zapasu...
-
Observation value analysis - integral part of Bayesian diagnostics
PublikacjaDetailed subject of the research is to analyse the value of the observation, which is a part of preposterior analysis. For the presented network, the main objective was to determine, conducting of which of three tests is the most valuable from the perspective of determining possible need or possibility to omission expensive technical expertise. The main advantage of preposterior analysis is answering the question which of the considered...
-
Automatic system for optical parameters measurements of biological tissues
PublikacjaIn this paper a system allowing execution of automatic measurements of optical parameters of scattering materials in an efficient and accurate manner is proposed and described. The system is designed especially for measurements of biological tissues including phantoms, which closely imitate optical characteristics of real tissue. The system has modular construction and is based on the ISEL system, luminance and color meter and...
-
Application of Support Vector Machine for Determination of Impact of Traffic-Induced Vibrations on Buildings
PublikacjaThe aim of the article is to present an algorithm of Support Vector Machine created to forecast the impact of traffic-induced vibrations on residential buildings. The method is designed to classify the object into one of two classes. The classification into the first class means that there is no impact of vibrations on the building, while classification to the second class indicates the possible influence and suggests the execution...
-
DESIGN AND EXECUTION ERRORS AS A CAUSE OF DAMAGE TO ANTI- ELECTROSTATIC FLOORING
PublikacjaApart from technological lines, industrial floors are a key element in the scope of maintaining the continuity of work of both production plants and logistics centers. The constantly developing industry of industrial flooring includes both classic design and technological flooring solutions, as well as specialist solutions used in facilities where technological processes or storage require system protection against static electricity....
-
A graph coloring approach to scheduling of multiprocessor tasks on dedicated machines with availability constraints
PublikacjaWe address a generalization of the classical 1- and 2-processor unit execution time scheduling problem on dedicated machines. In our chromatic model of scheduling machines have non-simultaneous availability times and tasks have arbitrary release times and due dates. Also, the versatility of our approach makes it possible to generalize all known classical criteria of optimality. Under these stipulations we show that the problem...
-
Mobile satellite measurements on the Pomeranian Metropolitan Railway
PublikacjaThis paper presents the process of organization and execution of the mobile satellite measurements conducted on the tracks of the Pomeranian Metropolitan Railway before officially entry the line into operation. These measurements allowed for the accurate identification of the actual geometric shape of the line. In the paper the issues of both design and construction phase of this railway line has been described. Moreover, the geodetic...
-
Traffic Remapping Attacks in Ad Hoc Networks
PublikacjaAd hoc networks rely on the mutual cooperation of stations. As such, they are susceptible to selfish attacks that abuse network mechanisms. Class-based QoS provisioning mechanisms, such as the EDCA function of IEEE 802.11, are particularly prone to traffic remapping attacks, which may bring an attacker better QoS without exposing it to easy detection. Such attacks have been studied in wireless LANs, whereas their impact in multihop...
-
Augmenting digital documents with negotiation capability
PublikacjaActive digital documents are not only capable of performing various operations using their internal functionality and external services, accessible in the environment in which they operate, but can also migrate on their own over a network of mobile devices that provide dynamically changing execution contexts. They may imply conflicts between preferences of the active document and the device the former wishes to execute on. In the...
-
DEPO: A dynamic energy‐performance optimizer tool for automatic power capping for energy efficient high‐performance computing
PublikacjaIn the article we propose an automatic power capping software tool DEPO that allows one to perform runtime optimization of performance and energy related metrics. For an assumed application model with an initialization phase followed by a running phase with uniform compute and memory intensity, the tool performs automatic tuning engaging one of the two exploration algorithms—linear search (LS) and golden section search (GSS), finds...
-
Parallelization of video stream algorithms in kaskada platform
PublikacjaThe purpose of this work is to present different techniques of video stream algorithms parallelization provided by the Kaskada platform - a novel system working in a supercomputer environment designated for multimedia streams processing. Considered parallelization methods include frame-level concurrency, multithreading and pipeline processing. Execution performance was measured on four time-consuming image recognition algorithms,...
-
Face detection in image sequences using a portable thermal camera
PublikacjaFace detection is often a first step in quantitative analysis of face images. It is an important research area for visible images and recently also for thermography. Due to technological developments thermal cameras may be embedded into wearable devices to provide remote healthcare. In this paper, we compared three algorithms for face detection in thermal images by testing execution time, accuracy, symmetry ratio and false-positives....
-
Three solvers for MIMO noise radar clutter cancellation - a performance comparison
PublikacjaThe problem of canceling strong clutter echos in a MIMO noise radar is considered. Execution times of three algorithms is compared. The first solution is a standard Least Squares approach employing Cholesky decomposition of the transmitted signal sample autocorrelation matrix. The second approach is based on careful waveform design which guarantees that the signal sample autocorrelation matrix has Toeplitz structure. This enables...
-
APIS – Agent Platform for Integration of Services
PublikacjaThe paper presents an approach to creating the platform for development and evaluation of complex tasks execution algorithms. Proposed solution is based on an agent paradigm where independent peers can cooperate and negotiate in order to execute specified tasks which are defined only by the final output description. Tasks are realized by the means of services exposed by different peers. In case when there is no single service fulfilling...
-
Improving Re-rankCCP with Rules Quality Measures
PublikacjaRecommender Systems are software tools and techniques which aim at suggesting new items that may possibly be of interest to a user. Context-Aware Recommender Systems exploit contextual information to provide more adequate recommendations. In this paper we described a modification of an existing contextual post-filtering algorithm which uses rules-like user representation called Contextual Conditional Preferences. We extended the...
-
Modern Platform for Parallel Algorithms Testing: Java on Intel Xeon Phi
PublikacjaParallel algorithms are popular method of increasing system performance. Apart from showing their properties using asymptotic analysis, proof-of-concept implementation and practical experiments are often required. In order to speed up the development and provide simple and easily accessible testing environment that enables execution of reliable experiments, the paper proposes a platform with multi-core computational accelerator:...
-
Displacement piles - classification and methods for the calculation of bearing capacity.
PublikacjaDisplacement piles belong to a group of technologies whose main idea is to install or make a pile without extracting ground material. According to definition, contained in PN-EN:1997-1:2008, displacement piles should be considered as driven, pressed in using vibrators and made with the use of spread augers. The classification of piles used so far with regard to the technology of execution is modified. An additional element is the...