Search results for: hpc

Use of ICT infrastructure for teaching HPC

Publication

- Year 2019

In this paper we look at modern ICT infrastructure as well as curriculum used for conducting a contemporary course on high performance computing taught over several years at the Faculty of Electronics Telecommunications and Informatics, Gdansk University of Technology, Poland. We describe the infrastructure in the context of teaching parallel programming at the cluster level using MPI, node level using OpenMP and CUDA. We present...

Full text to download in external service

Using Redis supported by NVRAM in HPC applications

Publication

A. Malinowski

- Computer Science - Year 2017

Nowadays, the efficiency of storage systems is a bottleneck in many modern HPC clusters. High performance in the traditional approach – processing using files – is often difficult to obtain because of a model’s complexity and its read/write patterns. An alternative approach is to apply a key-value database, which usually has low latency and scales well. On the other hand, many key-value stores suffer from a limitation of memory...

Full text available to download

BalticLSC: A low-code HPC platform for small and medium research teams

Publication

R. Roszczyk
M. Wdowiak
M. Smialek
K. Rybinski
K. Marek

- Year 2021

Full text to download in external service

Network-aware Data Prefetching Optimization of Computations in a Heterogeneous HPC Framework

Publication

P. Rościszewski

- International Journal of Computer Networks & Communications (IJCNC) - Year 2014

Rapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...

Full text available to download

MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems

Publication

- SIMULATION MODELLING PRACTICE AND THEORY - Year 2017

In this paper we present a new environment called MERPSYS that allows simulation of parallel application execution time on cluster-based systems. The environment offers a modeling application using the Java language extended with methods representing message passing type communication routines. It also offers a graphical interface for building a system model that incorporates various hardware components such as CPUs, GPUs, interconnects...

Full text available to download

A Regular Expression Matching Application with Configurable Data Intensity for Testing Heterogeneous HPC Systems

Publication

- Year 2014

Modern High Performance Computing (HPC) systems are becoming increasingly heterogeneous in terms of utilized hardware, as well as software solutions. The problems, that we wish to efficiently solve using those systems have different complexity, not only considering magnitude, but also the type of complexity: computation, data or communication intensity. Developing new mechanisms for dealing with those complexities or choosing an...

Extended investigation of performance-energy trade-offs under power capping in HPC environments

Publication

- Year 2019

—In the paper we present investigation of performance-energy trade-offs under power capping using modern processors. The results are presented for systems targeted at both server and client markets and were collected from Intel Xeon E5 and Intel Xeon Phi server processors as well as from desktop and mobile Intel Core i7 processors. The results, when using power capping, show that we can find various interesting combinations of...

Pre‐exascale HPC approaches for molecular dynamics simulations. Covid‐19 research: A use case

Publication

M. Wieczór
V. Genna
J. Aranda
R. M. Badia
J. L. Gelpí
V. Gapsys
B. L. de Groot
E. Lindahl
M. Municoy
A. Hospital
M. Orozco

- Wiley Interdisciplinary Reviews-Computational Molecular Science - Year 2023

Exascale computing has been a dream for ages and is close to becoming a reality that will impact how molecular simulations are being performed, as well as the quantity and quality of the information derived for them. We review how the biomolecular simulations field is anticipating these new architectures, making emphasis on recent work from groups in the BioExcel Center of Excellence for High Performance Computing. We exemplified...

Full text available to download

Dynamic Data Management Among Multiple Databases for Optimization of Parallel Computations in Heterogeneous HPC Systems

Publication

P. Rościszewski

- Year 2014

Rapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...

Full text to download in external service

Simulation of Parallel Applications on Large-scale Distributed Systems

Publication

- Year 2014

This chapter has a form of a review article in the field of simulating High-Performance Computing systems. We justify the need for a new versatile simulator considering heterogeneity, energy efficiency and reliability of HPC systems. We sketch the problems that need to be solved by such simulator and rationalize using discrete-event simulation for this purpose. Based on a review of existing discrete-event HPC simulation solutions...

Higher platelet counts correlate to tumour progression and can be induced by intratumoural stroma in non-metastatic breast carcinomas

Publication

N. Bednarz-Knoll
M. Popęda
T. Kryczka
B. Kozakiewicz
K. Pogoda
J. Szade
A. Markiewicz
D. Strzemecki
L. Kalinowski
J. Skokowski... and 2 others

- BRITISH JOURNAL OF CANCER - Year 2022

Background Platelets support tumour progression. However, their prognostic significance and relation to circulating tumour cells (CTCs) in operable breast cancer (BrCa) are still scarcely known and, thus, merit further investigation. Methods Preoperative platelet counts (PCs) were compared with clinical data, CTCs, 65 serum cytokines and 770 immune-related transcripts obtained using the NanoString technology. Results High normal...

Full text to download in external service

Energy-Aware High-Performance Computing: Survey of State-of-the-Art Tools, Techniques, and Environments

Publication

- Scientific Programming - Year 2019

The paper presents state of the art of energy-aware high-performance computing (HPC), in particular identification and classification of approaches by system and device types, optimization metrics, and energy/power control methods. System types include single device, clusters, grids, and clouds while considered device types include CPUs, GPUs, multiprocessor, and hybrid systems. Optimization goals include various combinations of...

Full text available to download

Energy-Aware Scheduling for High-Performance Computing Systems: A Survey

Publication

- ENERGIES - Year 2023

High-performance computing (HPC), according to its name, is traditionally oriented toward performance, especially the execution time and scalability of the computations. However, due to the high cost and environmental issues, energy consumption has already become a very important factor that needs to be considered. The paper presents a survey of energy-aware scheduling methods used in a modern HPC environment, starting with the...

Full text available to download

Three dimensional simulations of FRC beams and panels with explicit definition of fibres-concrete interaction

Publication

I. Marzec
J. Suchorzewski
J. Bobiński

- ENGINEERING STRUCTURES - Year 2024

High performance concrete (HPC) is a quite novel material which has been rapidly developed in the last few decades. It exhibits superior mechanical properties and durability comparing to normal concrete. HPC can achieve also superior tensile performance if strong fibres (steel or carbon) are implemented in the matrix. Thus, there exist the unabated interest in studying how the addition of different types of fibres modifies the...

Full text to download in external service

Efficiency Evaluation of High Performance Computing Systems Using Data Envelopment Analysis

Publication

P. Kaczmarek

- Year 2014

The paper presents an evaluation method of high performance computing (HPC) systems using multicriteria efficiency analysis. The Data Envelopment Analysis approach was applied and adapted to the specifics of HPC, which enabled us to compare relative efficiency of systems considering simultaneously multiple parameters. The analysis is based on the TOP500 list of world largest supercomputers and their parameters such as: the number...

Task Allocation and Scalability Evaluation for Real-Time Multimedia Processing in a Cluster Envirinment

Publication

- LECTURE NOTES IN COMPUTER SCIENCE - Year 2015

An allocation algorithm for stream processing tasks is proposed (Modified best Fit Descendent, MBFD). A comparison with another solution (BFD) is provided. Tests of the algorithms in an HPC environment are descrobed and the results are presented. A proper scalability metric is proposed and used for the evaluation of the allocation algorithm.

Full text to download in external service

Optimization of hybrid parallel application execution in heterogeneous high performance computing systems considering execution time and power consumption

Publication

P. Rościszewski

- Year 2018

Many important computational problems require utilization of high performance computing (HPC) systems that consist of multi-level structures combining higher and higher numbers of devices with various characteristics. Utilizing full power of such systems requires programming parallel applications that are hybrid in two meanings: they can utilize parallelism on multiple levels at the same time and combine together programming interfaces...

Full text to download in external service

Process arrival pattern aware algorithms for acceleration of scatter and gather operations

Publication

J. Proficz

- Cluster Computing-The Journal of Networks Software Tools and Applications - Year 2020

Imbalanced process arrival patterns (PAPs) are ubiquitous in many parallel and distributed systems, especially in HPC ones. The collective operations, e.g. in MPI, are designed for equal process arrival times (PATs), and are not optimized for deviations in their appearance. We propose eight new PAP-aware algorithms for the scatter and gather operations. They are binomial or linear tree adaptations introducing additional process...

Full text available to download

Dynamic GPU power capping with online performance tracing for energy efficient GPU computing using DEPO tool

Publication

- Future Generation Computer Systems-The International Journal of Grid Computing-Theory Methods and Applications - Year 2023

GPU accelerators have become essential to the recent advance in computational power of high- performance computing (HPC) systems. Current HPC systems’ reaching an approximately 20–30 mega-watt power demand has resulted in increasing CO2 emissions, energy costs and necessitate increasingly complex cooling systems. This is a very real challenge. To address this, new mechanisms of software power control could be employed. In this...

Full text to download in external service

All-gather Algorithms Resilient to Imbalanced Process Arrival Patterns

Publication

J. Proficz

- ACM Transactions on Architecture and Code Optimization - Year 2021

Two novel algorithms for the all-gather operation resilient to imbalanced process arrival patterns (PATs) are presented. The first one, Background Disseminated Ring (BDR), is based on the regular parallel ring algorithm often supplied in MPI implementations and exploits an auxiliary background thread for early data exchange from faster processes to accelerate the performed all-gather operation. The other algorithm, Background Sorted...

Full text available to download

Superkomputery do wspomagania procesów gospodarczych ze szczególnym uwzględnieniem sektora bankowego

Publication

- Współczesna Gospodarka - Year 2014

W artykule omówiono wykorzystanie superkomputerów do wspomagania procesów gospodarczych ze szczególnym uwzględnieniem sektora bankowego. Odniesiono się do wybranych projektów wspierających rozwój gospodarczy w oparciu o superkomputery. W szczególności zaproponowano zastosowanie HPC do implementacji wybranych metod sztucznej inteligencji w bankowości, w tym oceny ryzyka wybranych przedsięwzięć. Zaproponowane podejście umożliwia...

Full text available to download

Modifiers for Medical Grade Polymeric Systems used in FDM 3D Printing - Short Review

Publication

- Journal of Scientific & Technical Research - Year 2019

FDM 3D printing could find an application in the wide range of biomedical applications. Unfortunately, the quantity of polymeric biomaterials suitable to processing into filaments is limited. The most frequently used biomaterials for medical constructs such as bone grafts, soft tissue scaffolds or another DDS include PCL, PLA, PVA, HPC, EVA copolymer, EC and TPUs. Various modifiers such as TCP, HA, TEC, MMC could be applicated...

Full text available to download

Long Distance Geographically Distributed InfiniBand Based Computing

Publication

K. Niedzielewski
M. Semeniuk
J. Skomiał
J. Proficz
P. Sumionka
B. Pliszka
M. Michalewicz

- Supercomputing Frontiers and Innovations - Year 2020

Collaboration between multiple computing centres, referred as federated computing is becom- ing important pillar of High Performance Computing (HPC) and will be one of its key components in the future. To test technical possibilities of future collaboration using 100 Gb optic fiber link (Connection was 900 km in length with 9 ms RTT time) we prepared two scenarios of operation. In the first one, Interdisciplinary Centre for Mathematical...

Full text available to download

Teaching High Performance Computing Using BeesyCluster and Relevant Usage Statistics

Publication

P. Czarnul

- Year 2014

The paper presents motivations and experiences from using the BeesyCluster middleware for teaching high performance computing at the Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology. Features of BeesyCluster well suited for conducting courses are discussed including: easy-to-use WWW interface for application development and running hiding queuing systems, publishing applications as services...

Full text to download in external service

An experimental study of self-sensing concrete enhanced with multi-wall carbon nanotubes in wedge splitting test and DIC

Publication

J. Suchorzewski
M. Prieto
U. Mueller

- CONSTRUCTION AND BUILDING MATERIALS - Year 2020

Concrete is the worldwide most utilized construction material because of its very good performance, forming ability, long-term durability, and low costs. Concrete is a brittle material prone to cracking. Extensive cracking may impact durability and performance over time considerably. The addition of a small amount of carbon nanotubes (CNT) increases the concrete’s overall electrical conductivity, enabling internal structure...

Full text to download in external service

Improving Clairvoyant: reduction algorithm resilient to imbalanced process arrival patterns

Publication

- JOURNAL OF SUPERCOMPUTING - Year 2021

The Clairvoyant algorithm proposed in “A novel MPI reduction algorithm resilient to imbalances in process arrival times” was analyzed, commented and improved. The comments concern handling certain edge cases in the original pseudocode and description, i.e., adding another state of a process, improved cache friendliness more precise complexity estimations and some other issues improving the robustness of the algorithm implementation....

Full text available to download

Distributed NVRAM Cache – Optimization and Evaluation with Power of Adjacency Matrix

Publication

- Year 2017

In this paper we build on our previously proposed MPI I/O NVRAM distributed cache for high performance computing. In each cluster node it incorporates NVRAMs which are used as an intermediate cache layer between an application and a file for fast read/write operations supported through wrappers of MPI I/O functions. In this paper we propose optimizations of the solution including handling of write requests with a synchronous mode,...

Full text to download in external service

Full scale CFD seakeeping simulations for case study ship redesigned from V-shaped bulbous bow to X-bow hull form

Publication

- APPLIED OCEAN RESEARCH - Year 2019

Increasing propulsion efficiency, safety, comfort and operability are of the great importance, especially for small ships operating on windy sites like the North Sea and the Baltic Sea. Seakeeping performance of ships and offshore structures can be analysed by different methods and the one that is becoming increasingly important is CFD RANS. The recent development of simulation techniques together with rising HPC accessibility...

Full text to download in external service

A Parallel MPI I/O Solution Supported by Byte-addressable Non-volatile RAM Distributed Cache

Publication

A. Malinowski
P. Czarnul
P. Dorożyński
K. Czuryło
Ł. Dorau
M. Maciejewski
P. Skowron

- Annals of Computer Science and Information Systems - Year 2016

While many scientiﬁc, large-scale applications are data-intensive, fast and efﬁcient I/O operations have become of key importance for HPC environments. We propose an MPI I/O extension based on in-system distributed cache with data located in Non-volatile Random Access Memory (NVRAM) available in each cluster node. The presented architecture makes effective use of NVRAM properties such as persistence and byte-level access behind...

Full text available to download

Parallel Programming for Modern High Performance Computing Systems

Publication

P. Czarnul

- Year 2018

In view of the growing presence and popularity of multicore and manycore processors, accelerators, and coprocessors, as well as clusters using such computing devices, the development of efficient parallel applications has become a key challenge to be able to exploit the performance of such systems. This book covers the scope of parallel programming for modern high performance computing systems. It first discusses selected and...

Full text to download in external service

BeesyCluster: Architektura systemu dostepu do sieci klastrów przez WWW/Web Services.

Publication

- Year 2004

Niniejsza praca prezentuje system BeesyCluster, który integruje rozproszone klastry poprzez łatwy w użyciu interfejs WWW oraz Web Services. W wersji pilotowej system uruchomiony zostanie w charakterze portalu dostępowego do klastrów gdańskiej sieci TASK wykorzystując 128-procesorowy klaster galera oraz 256-procesorowy 64-bitowy klaster holk jak również laboratoria badawcze Wydziału ETI Politechniki Gdańskiej. System, oparty o technologię...

Advanced Potential Energy Surfaces for Molecular Simulation

Publication

A. Albaugh
H. Boateng
R. Bradshaw
O. Demerdash
J. Dziedzic
Y. Mao
D. Margul
J. Swails
Q. Zeng
D. Case... and 10 others

- JOURNAL OF PHYSICAL CHEMISTRY B - Year 2016

Advanced potential energy surfaces are defined as theoretical models that explicitly include many-body effects that transcend the standard fixed-charge, pairwise-additive paradigm typically used in molecular simulation. However, several factors relating to their software implementation have precluded their widespread use in condensed-phase simulations: the computational cost of the theoretical models, a paucity of approximate models...

Full text available to download

DEPO: A dynamic energy‐performance optimizer tool for automatic power capping for energy efficient high‐performance computing

Publication

- SOFTWARE-PRACTICE & EXPERIENCE - Year 2022

In the article we propose an automatic power capping software tool DEPO that allows one to perform runtime optimization of performance and energy related metrics. For an assumed application model with an initialization phase followed by a running phase with uniform compute and memory intensity, the tool performs automatic tuning engaging one of the two exploration algorithms—linear search (LS) and golden section search (GSS), finds...

Full text to download in external service

Survey of Methodologies, Approaches, and Challenges in Parallel Programming Using High-Performance Computing Systems

Publication

- Scientific Programming - Year 2020

This paper provides a review of contemporary methodologies and APIs for parallel programming, with representative technologies selected in terms of target system type (shared memory, distributed, and hybrid), communication patterns (one-sided and two-sided), and programming abstraction level. We analyze representatives in terms of many aspects including programming model, languages, supported platforms, license, optimization goals,...

Full text available to download

Investigation into MPI All-Reduce Performance in a Distributed Cluster with Consideration of Imbalanced Process Arrival Patterns

Publication

J. Proficz
P. Sumionka
J. Skomiał
M. Semeniuk
K. Niedzielewski
M. Walczak

- Advances in Intelligent Systems and Computing - Year 2020

The paper presents an evaluation of all-reduce collective MPI algorithms for an environment based on a geographically-distributed compute cluster. The testbed was split into two sites: CI TASK in Gdansk University of Technology and ICM in University of Warsaw, located about 300 km from each other, both connected by a fast optical fiber Ethernet-based 100 Gbps network (900 km part of the PIONIER backbone). Each site hosted a set...

Full text available to download

Towards Scalable Simulation of Federated Learning

Publication

- Communications in Computer and Information Science - Year 2021

Federated learning (FL) allows to train models on decentralized data while maintaining data privacy, which unlocks the availability of large and diverse datasets for many practical applications. The ongoing development of aggregation algorithms, distribution architectures and software implementations aims for enabling federated setups employing thousands of distributed devices, selected from millions. Since the availability of...

Full text to download in external service

Search

Filters

Catalog

Category

Year

Options