Publications
Filters
total: 286
Catalog Publications
Year 2024
-
Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits
PublicationThe Contextual Multi-Armed Bandits (CMAB) framework is pivotal for learning to make decisions. However, due to challenges in deploying online algorithms, there is a shift towards offline policy learning, which relies on pre-existing datasets. This study examines the relationship between the quality of these datasets and the performance of offline policy learning algorithms, specifically, Neural Greedy and NeuraLCB. Our results...
-
Investigation of Performance and Energy Consumption of Tokenization Algorithms on Multi-core CPUs Under Power Capping
PublicationIn this paper we investigate performance-energy optimization of tokenizer algorithm training using power capping. We focus on parallel, multi-threaded implementations of Byte Pair Encoding (BPE), Unigram, WordPiece, and WordLevel run on two systems with different multi-core CPUs: Intel Xeon 6130 and desktop Intel i7-13700K. We analyze execution times and energy consumption for various numbers of threads and various power caps and...
-
Performance and Energy Aware Training of a Deep Neural Network in a Multi-GPU Environment with Power Capping
PublicationIn this paper we demonstrate that it is possible to obtain considerable improvement of performance and energy aware metrics for training of deep neural networks using a modern parallel multi-GPU system, by enforcing selected, non-default power caps on the GPUs. We measure the power and energy consumption of the whole node using a professional, certified hardware power meter. For a high performance workstation with 8 GPUs, we were...
-
Wyzwania współpracy z robotem jako członkiem zespołu
Publication– Cel: Ocena, jakie są przekonania dotyczące współpracy z robotami w zespole, jakie są obawy i oczekiwania związane z „zatrudnianiem” robotów. – Metodyka badań: Badania ilościowe zrealizowane z zastosowaniem kwestionariusza ankiety w grupie 593 specjalistów IT. – Wyniki badań: Roboty nadal traktowane są głównie jako narzędzia, nie jako podmioty uczestniczące w pracy zespołu. Przekonanie, że obecność robota w zespole nie wyeliminuje...
Year 2023
-
A multithreaded CUDA and OpenMP based power‐aware programming framework for multi‐node GPU systems
PublicationIn the paper, we have proposed a framework that allows programming a parallel application for a multi-node system, with one or more GPUs per node, using an OpenMP+extended CUDA API. OpenMP is used for launching threads responsible for management of particular GPUs and extended CUDA calls allow to manage CUDA objects, data and launch kernels. The framework hides inter-node MPI communication from the programmer who can benefit from...
-
Dataset Related Experimental Investigation of Chess Position Evaluation Using a Deep Neural Network
PublicationThe idea of training Articial Neural Networks to evaluate chess positions has been widely explored in the last ten years. In this paper we investigated dataset impact on chess position evaluation. We created two datasets with over 1.6 million unique chess positions each. In one of those we also included randomly generated positions resulting from consideration of potentially unpredictable chess moves. Each position was evaluated...
-
Dynamic GPU power capping with online performance tracing for energy efficient GPU computing using DEPO tool
PublicationGPU accelerators have become essential to the recent advance in computational power of high- performance computing (HPC) systems. Current HPC systems’ reaching an approximately 20–30 mega-watt power demand has resulted in increasing CO2 emissions, energy costs and necessitate increasingly complex cooling systems. This is a very real challenge. To address this, new mechanisms of software power control could be employed. In this...
-
Efficient parallel implementation of crowd simulation using a hybrid CPU+GPU high performance computing system
PublicationIn the paper we present a modern efficient parallel OpenMP+CUDA implementation of crowd simulation for hybrid CPU+GPU systems and demonstrate its higher performance over CPU-only and GPU-only implementations for several problem sizes including 10 000, 50 000, 100 000, 500 000 and 1 000 000 agents. We show how performance varies for various tile sizes and what CPU–GPU load balancing settings shall be preferred for various domain...
-
Energy-Aware Scheduling for High-Performance Computing Systems: A Survey
PublicationHigh-performance computing (HPC), according to its name, is traditionally oriented toward performance, especially the execution time and scalability of the computations. However, due to the high cost and environmental issues, energy consumption has already become a very important factor that needs to be considered. The paper presents a survey of energy-aware scheduling methods used in a modern HPC environment, starting with the...
-
General Provisioning Strategy for Local Specialized Cloud Computing Environments
PublicationThe well-known management strategies in cloud computing based on SLA requirements are considered. A deterministic parallel provisioning algorithm has been prepared and used to show its behavior for three different requirements: load balancing, consolidation, and fault tolerance. The impact of these strategies on the total execution time of different sets of services is analyzed for randomly chosen sets of data. This makes it possible...
-
Levels of creativity in architectural education
PublicationArchitectural design combines engineering science and art, thus stimulating creativity is a challenge in the didactic process. There are various levels of creativity that can be attained through architectural education. From idea to architecture (FITA) is a teaching method based on metaphorical and analogical reasoning that was developed, implemented and tested during architectural design classes in the Faculty of Architecture...
-
Molecular mechanism and energetics of coupling between substrate binding and product release in the F 1 -ATPase catalytic cycle
PublicationF1-ATPase is a motor protein that couples the rotation of its rotary γ subunit with ATP synthesis or hydrolysis. Single-molecule experiments indicate that nucleotide binding and release events occur almost simultaneously during the synthesis cycle, allowing the energy gain due to spontaneous binding of ADP to one catalytic β subunit to be directly harnessed for driving the release of ATP from another rather than being dissipated...
-
Optimization of parallel implementation of UNRES package for coarse‐grained simulations to treat large proteins
PublicationWe report major algorithmic improvements of the UNRES package for physics-based coarse-grained simulations of proteins. These include (i) introduction of interaction lists to optimize computations, (ii) transforming the inertia matrix to a pentadiagonal form to reduce computing and memory requirements, (iii) removing explicit angles and dihedral angles from energy expressions and recoding the most time-consuming energy/force terms...
-
Performance assessment of OpenMP constructs and benchmarks using modern compilers and multi-core CPUs
PublicationConsidering ongoing developments of both modern CPUs, especially in the context of increasing numbers of cores, cache memory and architectures as well as compilers there is a constant need for benchmarking representative and frequently run workloads. The key metric is speed-up as the computational power of modern CPUs stems mainly from using multiple cores. In this paper, we show and discuss results from running codes such as:...
-
The congruence of mental models in entrepreneurial teams – implications for performance and satisfaction in teams operating in an emerging economy
PublicationPurpose – The paper aims to explore the relationship between the congruence of mental models held by the members of entrepreneurial teams operating in an emerging economy (Poland) and entrepreneurial outcomes (performance and satisfaction). Design/methodology/approach – The data obtained from 18 nascent and 20 established entrepreneurial teams was analysed to answer hypotheses. The research was quantitative and was conducted using...
-
The Idea of a Student Research Project as a Method of Preparing a Student for Professional and Scientific Work
PublicationIn the paper we present the idea and implementation of a student research project course within the master’s program at the Faculty of Electronics, Telecommunications and Informatics, Gdansk Tech. It aims at preparing students for performing research and scientific tasks in future professional work. We outline the evolution from group projects into research project and the current deployment of both at bachelor’s and master’s levels...
-
UNRES-GPU for Physics-Based Coarse-Grained Simulations of Protein Systems at Biological Time- and Size-Scales
PublicationThe dynamics of the virus like particles (VLPs) corresponding to the GII.4 Houston, GII.2 SMV, and GI.1 Norwalk strains of human noroviruses (HuNoV) that cause gastroenteritis was investigated by means of long-time (about 30 μs in the laboratory timescale) molecular dynamics simulations with the coarse-grained UNRES force field. The main motion of VLP units turned out to be the bending at the junction between the P1 subdomain (that...
Year 2022
-
DEPO: A dynamic energy‐performance optimizer tool for automatic power capping for energy efficient high‐performance computing
PublicationIn the article we propose an automatic power capping software tool DEPO that allows one to perform runtime optimization of performance and energy related metrics. For an assumed application model with an initialization phase followed by a running phase with uniform compute and memory intensity, the tool performs automatic tuning engaging one of the two exploration algorithms—linear search (LS) and golden section search (GSS), finds...
-
Edu Inspiracje WZiE: Technologia XR w edukacji. Czy jesteśmy na nią gotowi?
Publication -
Featured based CAVE software factory
PublicationIn the paper we convey the lessons learned along the path we have gone through several years since establishing a room-sized CAVE installation at our university, from craft manufacturing and ad-hoc software reuse of VR software products to the robust feature driven software product line (SPL) implementing the Product Line Engineering (PLE) factory paradigm. With that we can serve all our departments and other entities from the...
-
Food Classification from Images Using a Neural Network Based Approach with NVIDIA Volta and Pascal GPUs
PublicationIn the paper we investigate the problem of food classification from images, for the Food-101 dataset extended with 31 additional food classes from Polish cuisine. We adopted transfer learning and firstly measured training times for models such as MobileNet, MobileNetV2, ResNet50, ResNet50V2, ResNet101, ResNet101V2, InceptionV3, InceptionResNetV2, Xception, NasNetMobile and DenseNet, for systems with NVIDIA Tesla V100 (Volta) and...
-
GPU Power Capping for Energy-Performance Trade-Offs in Training of Deep Convolutional Neural Networks for Image Recognition
PublicationIn the paper we present performance-energy trade-off investigation of training Deep Convolutional Neural Networks for image recognition. Several representative and widely adopted network models, such as Alexnet, VGG-19, Inception V3, Inception V4, Resnet50 and Resnet152 were tested using systems with Nvidia Quadro RTX 6000 as well as Nvidia V100 GPUs. Using GPU power capping we found other than default configurations minimizing...
-
Investigation of Performance and Configuration of a Selected IoT System—Middleware Deployment Benchmarking and Recommendations
PublicationNowadays Internet of Things is gaining more and more focus all over the world. As a concept it gives many opportunities for applications for society and it is expected that the number of software services deployed in this area will still grow fast. Especially important in this context are properties connected with deployment such as portability, scalability and balance between software requirements and hardware capabilities. In...
-
Performance Assessment of Using Docker for Selected MPI Applications in a Parallel Environment Based on Commodity Hardware
PublicationIn the paper, we perform detailed performance analysis of three parallel MPI applications run in a parallel environment based on commodity hardware, using Docker and bare-metal configurations. The testbed applications are representative of the most typical parallel processing paradigms: master–slave, geometric Single Program Multiple Data (SPMD) as well as divide-and-conquer and feature characteristic computational and communication...
Year 2021
-
Assessment of OpenMP Master–Slave Implementations for Selected Irregular Parallel Applications
PublicationThe paper investigates various implementations of a master–slave paradigm using the popular OpenMP API and relative performance of the former using modern multi-core workstation CPUs. It is assumed that a master partitions available input into a batch of predefined number of data chunks which are then processed in parallel by a set of slaves and the procedure is repeated until all input data has been processed. The paper experimentally...
-
Benchmarking Scalability and Security Configuration Impact for A Distributed Sensors-Server IOT Use Case
PublicationInternet of Things has been getting more and more attention and found numerous practical applications. Especially important in this context are performance, security and ability to cope with failures. Especially crucial is to find good trade-off between these. In this article we present results of practical tests with multiple clients representing sensors sending notifications to an IoT middleware – DeviceHive. We investigate performance...
-
Human awareness versus Autonomous Vehicles view: comparison of reaction times during emergencies
PublicationHuman safety is one of the most critical factors when a new technology is introduced to the everyday use. It was no different in the case of Autonomous Vehicles (AV), designed to replace generally available Conventional Vehicles (CV) in the future. AV rules, from the start, focus on guaranteeing safety for passengers and other road users, and these assumptions usually work during normal traffic conditions. However, there is still...
-
Improving Clairvoyant: reduction algorithm resilient to imbalanced process arrival patterns
PublicationThe Clairvoyant algorithm proposed in “A novel MPI reduction algorithm resilient to imbalances in process arrival times” was analyzed, commented and improved. The comments concern handling certain edge cases in the original pseudocode and description, i.e., adding another state of a process, improved cache friendliness more precise complexity estimations and some other issues improving the robustness of the algorithm implementation....
-
Interval Edge Coloring of Bipartite Graphs with Small Vertex Degrees
PublicationAn edge coloring of a graph G is called interval edge coloring if for each v ∈ V(G) the set of colors on edges incident to v forms an interval of integers. A graph G is interval colorable if there is an interval coloring of G. For an interval colorable graph G, by the interval chromatic index of G, denoted by χ'_i(G), we mean the smallest number k such that G is interval colorable with k colors. A bipartite graph G is called (α,β)-biregular...
-
Just look at to open it up: A biometric verification facility for password autofill to protect electronic documents
PublicationElectronic documents constitute specific units of information, and protecting them against unauthorized access is a challenging task. This is because a password protected document may be stolen from its host computer or intercepted while on transfer and exposed to unlimited offline attacks. The key issue is, therefore, making document passwords hard to crack. We propose to augment a common text password authentication interface...
-
O neutralności, prezydencie, portalach społecznościowych i edukacji
PublicationArtykuł dotyczy neutralności sieci, która omówiona jest w kontekście zablokowania kont prezydenta USA w mediach społecznościowych. Odnosząc się do różnych znaczeń neutralności i opinii zebranych wśród informatyków, autorzy sformułowali rekomendacje dotyczące miejsca neutralności w edukacji informatycznej studentów.
-
Optimization of Data Assignment for Parallel Processing in a Hybrid Heterogeneous Environment Using Integer Linear Programming
PublicationIn the paper we investigate a practical approach to application of integer linear programming for optimization of data assignment to compute units in a multi-level heterogeneous environment with various compute devices, including CPUs, GPUs and Intel Xeon Phis. The model considers an application that processes a large number of data chunks in parallel on various compute units and takes into account computations, communication including...
-
Zasady efektuacji w działaniach początkujących przedsiębiorców
PublicationArtykuł dotyczy efektuacyjnego, dynamicznego modelu przedsiębiorczości stanowiącego odpowiedź na wymagania zmiennego, niepewnego otoczenia biznesowego, w którym funkcjonują współcześni przedsiębiorcy. We wcześniejszych badaniach, obejmujących stosowanie pięciu zasad efektuacji w prowadzeniu firm, różni autorzy odnosili się głównie do dojrzałych przedsiębiorców, charakteryzujących się doświadczeniem w zakładaniu i prowadzeniu własnego...
-
Znaczenie i dynamika efektuacji w działaniach zespołów projektowych IT
PublicationCel: Celem opracowania jest sprawdzenie, na ile wdrażanie zasad efektuacji jest korzystne dla efektywności zespołów projektowych pracujących nad innowacyjnymi rozwiązaniami i czy wpływa na ich satysfakcję. Metodyka badań: Badania ilościowe zrealizowano w studenckich zespołach projektowych z branży IT, opracowujących rozwiązania na zlecenie interesariuszy zewnętrznych. Do pomiaru zmiennych wykorzystano autorskie kwestionariusze....
Year 2020
-
Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge
PublicationAuto-tuning of configuration and application param- eters allows to achieve significant performance gains in many contemporary compute-intensive applications. Feasible search spaces of parameters tend to become too big to allow for exhaustive search in the auto-tuning process. Expert knowledge about the utilized computing systems becomes useful to prune the search space and new methodologies are needed in the face of emerging heterogeneous...
-
Benchmarking Deep Neural Network Training Using Multi- and Many-Core Processors
PublicationIn the paper we provide thorough benchmarking of deep neural network (DNN) training on modern multi- and many-core Intel processors in order to assess performance differences for various deep learning as well as parallel computing parameters. We present performance of DNN training for Alexnet, Googlenet, Googlenet_v2 as well as Resnet_50 for various engines used by the deep learning framework, for various batch sizes. Furthermore,...
-
Effect of chemical structure on complexation efficiency of aromatic drugs with cyclodextrins: The example of dibenzazepine derivatives
PublicationIt is widely believed that the hydrophobic effect governs the binding of guest molecules to cyclodextrins (CDs). However, it is also known that high hydrophobicity of guest molecules does not always translate to the formation of stable inclusion complexes with CDs. Indeed, a plethora of other factors can play a role in the efficiency of guest–CD interactions, rendering structure-based prediction of the complexation efficiency with...
-
Effective Collaboration of Entrepreneurial Teams—Implications for Entrepreneurial Education
PublicationIn the situation of a permanent change and increased competition, business ventures are more and more often undertaken not by individuals but by entrepreneurial teams. The main aim of this paper is to examine the team principles implemented by eective entrepreneurial teams and how they dier in nascent and established teams. We also focused on the relationship between the implementation of these rules by entrepreneurial team members...
-
Investigation of Parallel Data Processing Using Hybrid High Performance CPU + GPU Systems and CUDA Streams
PublicationThe paper investigates parallel data processing in a hybrid CPU+GPU(s) system using multiple CUDA streams for overlapping communication and computations. This is crucial for efficient processing of data, in particular incoming data stream processing that would naturally be forwarded using multiple CUDA streams to GPUs. Performance is evaluated for various compute time to host-device communication time ratios, numbers of CUDA streams,...
-
Performance/energy aware optimization of parallel applications on GPUs under power capping
PublicationIn the paper we present an approach and results from application of the modern power capping mechanism available for NVIDIA GPUs to the bench- marks such as NAS Parallel Benchmarks BT, SP and LU as well as cublasgemm- benchmark which are widely used for assessment of high performance computing systems’ performance. Specifically, depending on the benchmarks, various power cap configurations are best for desired trade-off of performance...
-
Projekty grupowe jako przygotowanie do współpracy w zespołach wirtualnych
PublicationArtykuł dotyczy zastosowania metody uczenia się poprzez realizację projektów i uczenia się zespołowego, które przedstawiono na przykładzie przedmiotu Projekt grupowy prowadzonego na studiach II stopnia z zakresu ICT (technologii informacyjno-komunikacyjnych). Zaprezentowano założenia i strukturę przedmiotu, koncentrując się na kontekstowym charakterze realizowanych projektów i analogiach do zadań realizowanych w firmach w rzeczywistych...
-
Recent advances in traffic optimisation: systematic literature review of modern models, methods and algorithms
PublicationOver the past few decades, the increasing number of vehicles and imperfect road traffic management have been sources of congestion in cities and reasons for deteriorating health of its inhabitants. With the help of computer simulations, transport engineers optimise and improve the capacity of city streets. However, with an enormous number of possible simulation types, it is difficult to grasp valuable, innovative solutions which...
-
Some Security Features of Selected IoT Platforms
PublicationIoT (Internet of Things) is certainly one of the leading current and future trends for processing in the current distributed world. It is changing our life and society. IoT allows new ubiquitous applications and processing, but, on the other hand, it introduces potentially serious security threats. Nowadays researchers in IoT areas should, without a doubt, consider and focus on security aspects. This paper is aimed at a high-level...
-
Survey of Methodologies, Approaches, and Challenges in Parallel Programming Using High-Performance Computing Systems
PublicationThis paper provides a review of contemporary methodologies and APIs for parallel programming, with representative technologies selected in terms of target system type (shared memory, distributed, and hybrid), communication patterns (one-sided and two-sided), and programming abstraction level. We analyze representatives in terms of many aspects including programming model, languages, supported platforms, license, optimization goals,...
-
Team Roles and Team Performance in Small Virtual Software Teams
PublicationThe article presents the results of research on the composition of team roles conducted in 24 student software teams. An adaptation of M. Belbin’s model by B. Kożusznik was used. The model of team balance according to Belbin and Haaf is presented and correlations between team balance and team performance are analysed. Team performance is measured at three levels: result, satisfaction and team climate. The selected constellation...
-
The impact of the AC922 Architecture on Performance of Deep Neural Network Training
PublicationPractical deep learning applications require more and more computing power. New computing architectures emerge, specifically designed for the artificial intelligence applications, including the IBM Power System AC922. In this paper we confront an AC922 (8335-GTG) server equipped with 4 NVIDIA Volta V100 GPUs with selected deep neural network training applications, including four convolutional and one recurrent model. We report...
-
The Structure of Entrepreneurial Team Members’ Competencies: Between Effectuation and Causation
PublicationA conscious shaping of entrepreneurial competence is a relevant element of entrepreneurial education. In order to recognize which of the competencies regarded as entrepreneurial are characteristic of members of entrepreneurial teams, which is crucial for the work of those teams, it is necessary to identify the structure of those competencies. This quantitative study was conducted with the use of a survey method and involved 111...
-
Zasada szalonego patchworku – efektuacja jako inspiracja do kształtowania relacji i postaw biznesowych
PublicationJedna zzasad efektuacyjnego podejscia do przedsiebiorczosci jest zasada szalonego patchworku (crazy quilt), która dotyczy budowania relacji z otoczeniem biznesowym. Zgodnie z nia przedsiebiorca powinien korzystac zkontaktów zosobami zotoczenia, zbudowanych jeszcze przed załozeniem firmy, atakze aktywnie kształtowac nowe relacje wsrodowisku, wktórym firma działa. Silny networking iznajomosc wielu osób pozwalaja na czerpanie inspiracji...
-
Zasady efektuacji w realizacji projektów w rzeczywistości VUCA
PublicationRozdział dotyczy zasad efektuacyjnego modelu przedsiębiorczości jako odpowiedzi na wyzwania związane ze zmiennością, niepewnością, niejednoznacznością i złożonością otoczenia (tzw. rzeczywistość VUCA), w którym realizowane są projekty w gospodarce cyfrowej. Na tle rozważań teoretyczno-koncepcyjnych przedstawiono wyniki badań empirycznych, które przeprowadzono w zespołach projektowych IT o różnym poziomie wirtualności. Wskazano...
Year 2019
-
Bilateral multi-issue negotiation of execution contexts by proactive document agents
PublicationA proactive document can react to its actual environment by autonomously selecting and performing actions integrated into its body and interact with its user. When migrating over a network of execution devices it may encounter diverse execution contexts, each one set up according to temporal characteristics of a receiving device and preferences of its owner. A concept to augment proactive documents with negotiation capability is...