Publikacje
Filtry
wszystkich: 842
Katalog Publikacji
Rok 2017
-
Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training
PublikacjaIn the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...
-
Multi-agent large-scale parallel crowd simulation
PublikacjaThis paper presents design, implementation and performance results of a new modular, parallel, agent-based and large scale crowd simulation environment. A parallel application, implemented with C and MPI, was implemented and run in this parallel environment for simulation and visualization of an evacuation scenario at Gdansk University of Technology, Poland and further in the area of districts of Gdansk. The application uses a...
-
Multidisciplinary Open System Transferring Knowledge for R2B Development
PublikacjaDespite many efforts, there is still a serious problem in transferring knowledge from research to business. The problem is especially visible in Poland – the cooperation of R2B is ineffective. We are trying to solve this problem using some IT support. The manuscript presents some solutions developed at the Gdańsk University of Technology. In particular, the platform called “MOST Knowledge” is deeply described. Its layer architecture,...
-
Optymalizacja strategii sieci inteligentnych agentów za pomocą programowania genetycznego w systemie rozproszonym realizującym paradygmat volunteer computing
PublikacjaDynamicznie rosnąca złożoność i wymagania w odniesieniu do rozproszonych systemów informatycznych utrudnia zarządzanie dostępnymi zasobami sprzętowymi i programistycznymi. Z tego powodu celem rozprawy jest opracowanie wielokryterialnej metody programowania genetycznego, która pozwala na optymalizację strategii zespołu inteligentnych agentów programistycznych w zakresie zarządzania systemem realizującym paradygmat volunteer computing....
-
Parallelization of Selected Algorithms on Multi-core CPUs, a Cluster and in a Hybrid CPU+Xeon Phi Environment
PublikacjaIn the paper we present parallel implementations as well as execution times and speed-ups of three different algorithms run in various environments such as on a workstation with multi-core CPUs and a cluster. The parallel codes, implementing the master-slave model in C+MPI, differ in computation to communication ratios. The considered problems include: a genetic algorithm with various ratios of master processing time to communication...
-
Path-based methods on categorical structures for conceptual representation of wikipedia articles
PublikacjaMachine learning algorithms applied to text categorization mostly employ the Bag of Words (BoW) representation to describe the content of the documents. This method has been successfully used in many applications, but it is known to have several limitations. One way of improving text representation is usage of Wikipedia as the lexical knowledge base – an approach that has already shown promising results in many research studies....
-
Performance evaluation of unified memory and dynamic parallelism for selected parallel CUDA applications
PublikacjaThe aim of this paper is to evaluate performance of new CUDA mechanisms—unified memory and dynamic parallelism for real parallel applications compared to standard CUDA API versions. In order to gain insight into performance of these mechanisms, we decided to implement three applications with control and data flow typical of SPMD, geometric SPMD and divide-and-conquer schemes, which were then used for tests and experiments. Specifically,...
-
Platforma IPMed jako elastyczne źródło danych dla medycznych zespołów naukowych
PublikacjaPrzedstawiono architekturę rozproszonej platformy IPMed, która umożliwia akwizycję i przechowywanie zanonimizowanych danych medycznych. Pokazano wyniki zastosowania platformy w obszarze badań z zakresu hemodynamiki układu krążenia. Uzyskane dane pozwoliły na weryfikację hipotezy zespołu badawczego i określenie reguł rekomendacji w z leczeniu udarów.
-
Practical Evaluation of Internet Systems' Security Mechanisms
PublikacjaA proposed Internet systems security layer with context-oriented security mechanisms reduces the risk associated with possible vulnerabilities. A metric of the system trust level is proposed, and then evaluated according to a university Internet system.
-
Qualitative evaluation of distributed clinical systems supporting research teams working on large-scale data
PublikacjaInthispaper,fivecontemporaryscalablesystemstosupportmedicalresearchteams are presented. Their functionalities extend from heterogeneous unstructured data acquisition through large-scale data storing, to on-the-fly analyzing by using robust methods. Such kinds of systems can be useful in the development of new medical procedures and recommendation rules for decision support systems. A short description of each of them is provided....
-
SERVICE ORIENTED SCENARIOS FOR SUPPORT ENTERPRISE PROJECT MANAGEMENT
PublikacjaIn the paper, a service oriented architecture is used to describe various scenarios of human-oriented enterprises. They can be modeled by BPMN to show basic information flow. We consider business scenarios to present another modeling solution called service orchestration or service mashup. That approach gives an opportunity for the right management through taking the best possible choice of services, or composition of services....
-
Smart Services supporting Drivers in Effective Cars Parking
PublikacjaThe paper shows how to design and implement intelligent, service oriented systems for management of free places in distributed car parks. The idea of SOA technology is widely used. It is assumed that a state of every car park is automatically determined by a special electronic infrastructure including IoT solutions. The information received from such an infrastructure is sent to the main management system, which processes it and,...
-
Spectral Clustering Wikipedia Keyword-Based search Results
PublikacjaThe paper summarizes our research in the area of unsupervised categorization of Wikipedia articles. As a practical result of our research, we present an application of spectral clustering algorithm used for grouping Wikipedia search results. The main contribution of the paper is a representation method for Wikipedia articles that has been based on combination of words and links and used for categoriation of search results in this...
-
Towards facts extraction from text in Polish language
PublikacjaNatural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...
-
Uczenie maszynowe do samoorganizacji systemów rozproszonych w zastosowaniach gospodarczych
PublikacjaW pracy omówiono uczenie maszynowe do samoorganizacji systemów rozproszonych w zastosowaniach gospodarczych ze szczególnym uwzględnieniem sieci neuronowych do predykcji finansowych oraz szacowania ratingu przedsiębiorstw. Oprócz sieci neuronowych, istotną rolę w przygotowaniu i testowaniu informatycznych systemów finansowych może pełnić programowanie genetyczne. Z tego powodu omówiono uczenie maszynowe w aplikacjach konstruowanych...
-
Using Redis supported by NVRAM in HPC applications
PublikacjaNowadays, the efficiency of storage systems is a bottleneck in many modern HPC clusters. High performance in the traditional approach – processing using files – is often difficult to obtain because of a model’s complexity and its read/write patterns. An alternative approach is to apply a key-value database, which usually has low latency and scales well. On the other hand, many key-value stores suffer from a limitation of memory...
-
WordVenture - COOPERATIVE WordNet EDITOR Architecture for Lexical Semantic Acquisition
PublikacjaThis article presents architecture for acquiring lexical semantics in a collaborative approach paradigm. The system enables functionality for editing semantic networks in a wikipedia-like style. The core of the system is a user-friendly interface based on interactive graph navigation. It has been used for semantic network presentation, and brings simultaneously modification functionality.
Rok 2016
-
A Fail-Safe NVRAM Based Mechanism for Efficient Creation and Recovery of Data Copies in Parallel MPI Applications
PublikacjaThe paper presents a fail-safe NVRAM based mechanism for creation and recovery of data copies during parallel MPI application runtime. Specifically, we target a cluster environment in which each node has an NVRAM installed in it. Our previously developed extension to the MPI I/O API can take advantage of NVRAM regions in order to provide an NVRAM based cache like mechanism to significantly speed up I/O operations and allow to preload...
-
A Parallel MPI I/O Solution Supported by Byte-addressable Non-volatile RAM Distributed Cache
PublikacjaWhile many scientific, large-scale applications are data-intensive, fast and efficient I/O operations have become of key importance for HPC environments. We propose an MPI I/O extension based on in-system distributed cache with data located in Non-volatile Random Access Memory (NVRAM) available in each cluster node. The presented architecture makes effective use of NVRAM properties such as persistence and byte-level access behind...
-
Adaption of Open Science Paradigm at Gdańsk University of Technology
PublikacjaThe paper describes the basic principles standing behind Open Access and will try to dispel the doubts behind them. Further the solution adapted by the Gdańsk University of Technology is presented. Later on the systems that the University uses and means of their integration into a complex Open Science platform are described. Finally the procedures needed to publish a document in the repository alongside the University's policy...
-
Alternative Modeling of Volunteer Based Systems
PublikacjaThe canonical approach to model a flow of a voluntary based computing, as presented in Chapter 7 may tax the capacity of the simulator, especially when a more complex model of Voluntary Computing, proposed in [1], is used which would imply that, for typical heterogeneous groups of volunteers, several thousands of threads would have to be run inside the simulator, to mimic closely the behaviour of every volunteer. This chapter presents...
-
APIS – Agent Platform for Integration of Services
PublikacjaThe paper presents an approach to creating the platform for development and evaluation of complex tasks execution algorithms. Proposed solution is based on an agent paradigm where independent peers can cooperate and negotiate in order to execute specified tasks which are defined only by the final output description. Tasks are realized by the means of services exposed by different peers. In case when there is no single service fulfilling...
-
Application of Web-GIS and Cloud Computing to Automatic Satellite Image Correction
PublikacjaRadiometric calibration of satellite imagery requires coupling of atmospheric and topographic parameters, which constitutes serious computational problems in particular in complex geographical terrain. Successful application of topographic normalization algorithms for calibration purposes requires integration of several types of high-resolution geographic datasets and their processing in a common context. This paper presents the...
-
Automatic Detection of Nerves in Confocal Corneal Images with Orientation-Based Edge Merging
PublikacjaThe paper presents an algorithm for improving results of automatic nerve detections in confocal microscopy images of human corneal. The method is designed as a postprocessing step of regular detection. After the nerves are initially detected, the algorithms attempts to improve the results by filling unde-sired gaps between single nerves detections in order to correctly mark the entire nerve instead of only parts of it. This approach...
-
Automatic Discovery of IaaS Cloud Workload Types
PublikacjaThe paper presents an approach to automatic discovery of workloads types. We perform functional characteristics of the workloads executed in our cloud environment, that have been used to create model of the computations. To categorize the resources utilization we used K-means algorithm, that allow us automatically select six types of computations. We perform analysis of the discovered types against to typical computational benchmarks,...
-
Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors
PublikacjaThe paper deals with parallelization of computing similarity measures between large vectors. Such computations are important components within many applications and consequently are of high importance. Rather than focusing on optimization of the algorithm itself, assuming specific measures, the paper assumes a general scheme for finding similarity measures for all pairs of vectors and investigates optimizations for scalability...
-
Category-Based Workload Modeling for Hardware Load Prediction in Heterogeneous IaaS Cloud
PublikacjaThe paper presents a method of hardware load prediction using workload models based on application categories and high-level characteristics. Application of the method to the problem of optimization of virtual machine scheduling in a heterogeneous Infrastructure as a Service (IaaS) computing cloud is described.
-
Checkpointing of Parallel MPI Applications using MPI One-sided API with Support for Byte-addressable Non-volatile RAM
PublikacjaThe increasing size of computational clusters results in an increasing probability of failures, which in turn requires application checkpointing in order to survive those failures. Traditional checkpointing requires data to be copied from application memory into persistent storage medium, which increases application execution time as it is usually done in a separate step. In this paper we propose to use emerging byte-addressable...
-
Clustering Context Items into User Trust Levels
PublikacjaAn innovative trust-based security model for Internet systems is proposed. The TCoRBAC model operates on user profiles built on the history of user with system interaction in conjunction with multi-dimensional context information. There is proposed a method of transforming the high number of possible context value variants into several user trust levels. The transformation implements Hierarchical Agglomerative Clustering strategy....
-
Completeness and Consistency of the System Requirement Specification
PublikacjaAlthough the System Requirement Specification, as a first formal and detailed document, is the base for the software project in classic software methodologies, there is a noticeable problem of assuring the completeness of this document. The lack of its completeness causes uncertainty of the project foundations. This was one of motivations for agile methodologies – if the SRS cannot be easily validated, if it can change in late project...
-
Considerations of Computational Efficiency in Volunteer and Cluster Computing
PublikacjaIn the paper we focus on analysis of performance and power consumption statistics for two modern environments used for computing – volunteer and cluster based systems. The former integrate computational power donated by volunteers from their own locations, often towards social oriented or targeted initiatives, be it of medical, mathematical or space nature. The latter is meant for high performance computing and is typically installed...
-
Depth Images Filtering In Distributed Streaming
PublikacjaIn this paper, we propose a distributed system for point cloud processing and transferring them via computer network regarding to effectiveness-related requirements. We discuss the comparison of point cloud filters focusing on their usage for streaming optimization. For the filtering step of the stream pipeline processing we evaluate four filters: Voxel Grid, Radial Outliner Remover, Statistical Outlier Removal and Pass Through....
-
DEPTH IMAGES FILTERING IN DISTRIBUTED STREAMING
PublikacjaIn this paper we discuss the comparison of point cloud filters focusing on their applicability for streaming optimization. For the filtering stage within a stream pipeline processing we evaluate three filters: Voxel Grid, Pass Through and Statistical Outlier Removal. For the filters we perform series of the tests aiming at evaluation of changes of point cloud size and transmitting frequency (various fps ratio). We propose a distributed...
-
Energy Consumption Modeling in SPMD and DAC Applications
PublikacjaIn this chapter, we show a study of energy consumption during execution of SPMD and DAC application – the same applications which time of execution we modeled in the previous two chapters. We measured an average power usage at a single node of the GALERA+ cluster during application execution and then we modeled the total energy consumption by the application. Next we simulated the applications using MERPSYS and we compared the...
-
Executing Multiple Simulations in the MERPSYS Environment
PublikacjaThe chapter investigates the steps necessary to perform a simulation instance in the MERPSYS environment and discusses potential limitations in case when vast numbers of simulations are required. An extended architecture is proposed which includes a JMS-based simulation queue and multiple distributed simulators, overcoming the potential bottlenecks. The chapter introduces also methods for preparing suites of multiple simulations...
-
Game with a Purpose for Mappings Verification
PublikacjaMappings verification is a laborious task. The paper presents a Game with a Purpose based system for verification of automatically generated mappings. General description of idea standing behind the games with the purpose is given. Description of TGame system, a 2D platform mobile game with verification process included in the gameplay, is provided. Additional mechanisms for anti-cheating, increasing player’s motivation and gathering...
-
Harmony Search for Data Mining with Big Data
PublikacjaIn this paper, some harmony search algorithms have been proposed for data mining with big data. Three areas of big data processing have been studied to apply new metaheuristics. The first problem is related to MapReduce architecture that can be supported by a team of harmony search agents in grid infrastructure. The second dilemma involves development of harmony search in preprocessing of data series before data mining. Moreover,...
-
Harmony Search for Self-configuration of Fault–Tolerant and Intelligent Grids
PublikacjaIn this paper, harmony search algorithms have been proposed to self-configuration of fault-tolerant grids for big data processing. Self-configuration of computer grids lies in the fact that new computer nodes are automatically configured by software agents and then integrated into the grid. A base node works due to several configuration parameters that define some aspects of data communications and energy power consumption. We...
-
Harmony Search to Self-Configuration of Fault-Tolerant Grids for Big Data
PublikacjaIn this paper, harmony search algorithms have been proposed to self-configuration of fault-tolerant grids for big data processing. Some tasks related to big data processing have been considered. Moreover, two criteria have been applied to evaluate quality of grids. The first criterion is a probability that all tasks meet their deadlines and the second one is grid reliability. Furthermore, some intelligent agents based on harmony...
-
Identification of category associations using a multilabel classifier
PublikacjaDescription of the data using categories allows one to describe it on a higher abstraction level. In this way, we can operate on aggregated groups of the information, allowing one to see relationships that do not appear explicit when we analyze the individual objects separately. In this paper we present automatic identification of the associations between categories used for organization of the textual data. As experimental data...
-
Ischemic stroke in general practice - risk factors, prognosis - prospective single center study
PublikacjaBackground. Ischemic stroke is one of the leading causes of death and disability in Poland, in the European Union and in the United States. Awareness related to the risk factors, particularly the modifiable ones, is of crucial importance in general practice and may improve early and long-term prognoses. Objectives. The aim of this study was to assess the risk factors and half-year survival rate in patients with ischemic stroke...
-
KernelHive: a new workflow-based framework for multilevel high performance computing using clusters and workstations with CPUs and GPUs
PublikacjaThe paper presents a new open-source framework called KernelHive for multilevel parallelization of computations among various clusters, cluster nodes, and finally, among both CPUs and GPUs for a particular application. An application is modeled as an acyclic directed graph with a possibility to run nodes in parallel and automatic expansion of nodes (called node unrolling) depending on the number of computation units available....
-
Metody neuronowe do prognozowania finansowego
PublikacjaSztuczne sieci neuronowe mogą być stosowane do prognozowania kursów akcji na giełdzie, oceny wiarygodności kredytobiorców czy prognozowania kryzysów bankowych. W referacie omówiono zasady współpracy sieci neuronowych z algorytmami ewolucyjnymi oraz metodą wektorów wspierających. Ponadto, odniesiono się do pozostałych metod sztucznej inteligencji, które stosowane są w finansach.
-
Modeling DAC Application Execution Time
PublikacjaAn application written in the Divide And Conquer paradigm is more difficult to model than SPMD application because of complex algorithm, causing use of many coefficients in a computational complexity function. Processors are divided into various layers, each layer contains different number of processors. Data packets processed in different layers and transferred between layers have different length. Moreover first layer processors use...
-
Modeling energy consumption of parallel applications
PublikacjaThe paper presents modeling and simulation of energy consumption of two types of parallel applications: geometric Single Program Multiple Data (SPMD) and divide-and-conquer (DAC). Simulation is performed in a new MERPSYS environment. Model of an application uses the Java language with extension representing message exchange between processes working in parallel. Simulation is performed by running threads representing distinct process...
-
Modeling of Performance, Reliability and Energy Efficiency in Large-Scale Computational Environment
PublikacjaLarge scale of complexity of distributed computational systems imposes special challanges for prediction of quality in such systems.Existing quality models for lower-scale systems include functionality,performance,reliability,flexibility and usability.Among these attributes,performance and reliability have a particular significance to the large-scale systems computing quality modeling due to their strong dependence on the system...
-
Modeling Parallel Applications in the MERPSYS Environment
PublikacjaThe chapter presents how to model parallel computational applications for which simulation of execution in a large-scale parallel or distributed environment is performed within the MERPSYS environment. Specifically, it is shown what approaches can be adopted to model key paradigms often used for parallel applications: master-slave, geometric parallelism (single program multiple data), pipelined and divide-and-conquer applications....
-
Modeling SPMD Application Execution Time
PublikacjaParallel applications in a Single Process Multiple Data paradigm assume splitting huge amounts of data to multiple processors working in parallel at small data packets. As the individual data packets are not independent, the processors must interact with each other to exchange results of the calculations with their adjacent partners and take these results into account in their own computations. An example of SPMD is geometric parallelism...
-
Modeling Volunteer Based Systems
PublikacjaThere are two main approaches to processing tasks requiring high amounts of computational power. One approach is using clusters of mostly identical hardware, placed in dedicated locations [1, 2, 3]. The other approach is outsourcing computing resources from large numbers of volunteers connected to the Internet [7]. This chapter presents an application of a mathematical model of the volunteer computing presented in Volume 1 of this...
-
Multi-level Virtualization and Its Impact on System Performance in Cloud Computing
PublikacjaThe results of benchmarking tests of multi-level virtualized environments are presented. There is analysed the performance impact of hardware virtualization, container-type isolation and programming level abstraction. The comparison is made on the basis of a proposed score metric that allows you to compare different aspects of performance. There is general performance (CPU and memory), networking, disk operations and application-like...
-
NVRAM as Main Storage of Parallel File System
PublikacjaModern cluster environments' main trouble used to be lack of computational power provided by CPUs and GPUs, but recently they suffer more and more from insufficient performance of input and output operations. Apart from better network infrastructure and more sophisticated processing algorithms, a lot of solutions base on emerging memory technologies. This paper presents evaluation of using non-volatile random-access memory as a...
-
Performance and Power-Aware Modeling of MPI Applications for Cluster Computing
PublikacjaThe paper presents modeling of performance and power consumption when running parallel applications on modern cluster-based systems. The model includes basic so-called blocks representing either computations or communication. The latter includes both point-to-point and collective communication. Real measurements were performed using MPI applications and routines run on three different clusters with both Infiniband and Gigabit Ethernet...
-
Reliability Modeling for SPMD and DAC Applications in MERPSYS
PublikacjaIn this chapter we shall see a method for modeling of reliability of application execution in the MERPSYS system, and the results of reliability simulation of the same SPMD and DAC application which we have previously presented in this book. Unlike energy usage modeling and modeling of execution efficiency we could not compare the reliability model and simulation results to real experiment just because the experiment would last too...
-
The chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identified complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for different size of the data as well as for different number of the processors used for the computations. The results we got using the model have been compared to the results obtained from real computational environment.
PublikacjaThe chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identified complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for different size of the data as well as for different number of the processors used...
-
Towards Healthcare Cloud Computing
PublikacjaIn this paper we present construction of a software platform for supporting medical research teams, in the area of impedance cardiography, called IPMed. Using the platform, research tasks will be performed by the teams through computer-supported cooperative work. The platform enables secure medical data storing, access to the data for research group members, cooperative analysis of medical data and provide analysis supporting tools...
-
Towards increasing F-measure of approximate string matching in O(1) complexity
PublikacjaThe paper analyzes existing approaches for approximate string matching based on linear search with Levenshtein distance, AllScan and CPMerge algorithms using cosine, Jaccard and Dice distance measures. The methods are presented and compared to our approach that improves indexing time using Locally Sensitive Hashing. Advantages and drawbacks of the methods are identified based on theoretical considerations as well as empirical evaluations...
-
ZARZADZANIE ORGANIZACJĄ TYPU SMART
PublikacjaZaprezentowano organizacji, w której charakterystykę wpisano następujące cechy: strategiczne uwarunkowanie, maksymalną innowacyjność, atrakcyjność dla interesariuszy, rozwijanie osobowości pracowników oraz twórczość w działaniu. Te atrybuty, tworzą koncepcję organizacji SMART (akronim od pierwszych liter powyższych cech), która wyróżnia się dodatkowo strukturą opartą o dynamicznie kreowane zespoły oraz skutecznym wspomaganiem...
Rok 2015
-
A method for counting people attending large public events
PublikacjaThe algorithm for people counting in crowded scenes, based on the idea of virtual gate which uses optical flow method is presented. The concept and practical application of the developed algorithm under real conditions is depicted. The aim of the work is to estimate the number of people passing through entrances of a large sport hall. The most challenging problem was the unpredicted behavior of people while entering the building....
-
Adaption of Open Science Paradigm at Gdańsk University of Technology
PublikacjaOpen Access plays growing role in sharing results of today's research. The movement is very strong and many Universities, including Gdańsk University of Technology, are adapting this approach. This paper presents general information about Open Access movement and the approach adapted by the Gdańsk University of Technology. The components of Open Science platform as well as the procedures are described.
-
Analiza kreatywności oraz wpływu elementów typowo marketingowych wykorzystywanych na internetowych portalach branży turystycznej
PublikacjaW obecnej chwili konkurencja pomiędzy agencjami turystycznymi operującymi na globalnym rynku w Internecie, w powiązaniu ze stale rosnącą biznesową dojrzałością całego sektora, wymusiła wprowadzanie kreatywnych oraz celowo zorientowanych elementów o naturze typowo marketingowej w celu wywołania wrażenia unikalności w sposobie traktowania klienta. Marketingowe elementy o charakterze klasycznym, np. zniżki typu „first minute”, „last...
-
BeesyCluster as Front-End for High Performance Computing Services
PublikacjaThe paper presents the BeesyCluster system as a middleware allowing invocation of services on high performance computing resources within the NIWA Centre of Competence project. Access is possible through both WWW and SOAP Web Service interfaces. The former allows non-experienced users to invoke both simple and complex services exposed through easyto-use servlets. The latter is meant for integration of external applications with...
-
Big Data Processing by Volunteer Computing Supported by Intelligent Agents
PublikacjaIn this paper, volunteer computing systems have been proposed for big data processing. Moreover, intelligent agents have been developed to efficiency improvement of a grid middleware layer. In consequence, an intelligent volunteer grid has been equipped with agents that belong to five sets. The first one consists of some user tasks. Furthermore, two kinds of semi-intelligent tasks have been introduced to implement a middleware...
-
Business Process Modelling and Enterprise Improvement Analysis
PublikacjaThe main trends in developing an approach and tools for business process modelling are discussed. Business Process Management (BPM) and BPM Notation (BPMN) are presented. Moreover, the possibilities of C2NIWA in a wide range of these tools running on a supercomputer are suggested. Firstly, the company development strategy can be planned . Secondly, quick and flexible changes of its processes can be modelled, analysed and implemented....
-
C2 Niwa - The Centre of Competence for Novel Infrastructure of Workable Applications
PublikacjaScientific and cloud computing are discussed in the paper . The paradigm of EaaS (Everything as a Service) is presented and the design methodology of service oriented applications is proposed. Implementation of the SOSE (Software Oriented Software Engineering) approach is analyzed and the roles of developing teams to obtain software products satisfying business requirements are also shown. The above listed activities create a basic...
-
C2 NIWA Community-Segmentation Criteria and Building Brand Associations on the Example of a Selected Target Group
PublikacjaEvery organization which offers products or services wishes to communicate with their customers in the most effective way. This kind of communication is based on proper selection of target groups, which are extracted in the process of market segmentation. That is way it is very important to ask the question to whom the message is to be directed and what kind of message we want to give. This article describes the selection criteria...
-
CD NIWA jako platforma współpracy biznesu i nauki
PublikacjaArtykuł prezentuje koncepcję Centrum Doskonałości Naukowej Infrastruktury Wytwarzania Aplikacji: CD NIWA, które jest środowiskiem przetwarzania w chmurze obliczeniowej, źródłem stale doskonalonych aplikacji i usług oraz platformą współpracy użytkowników. Interaktywność współdziałania nauki i biznesu jest głównym założeniem i celem Centrum Doskonałości NIWA, które ma też służyć budowaniu społeczności współpracującej w zakresie realizacji...
-
Collective citizens' behavior modelling with support of the Internet of Things and Big Data
PublikacjaIn this paper, collective human behaviors are modelled by a development of Big Data mining related to the Internet of Things. Some studies under MapReduce architectures have been carried out to improve an efficiency of Big Data mining. Intelligent agents in data mining have been analyzed for smart city systems, as well as data mining has been described by genetic programming. Furthermore, artificial neural networks have been discussed...
-
DBpedia As a Formal Knowledge Base – An Evaluation
PublikacjaDBpedia is widely used by researchers as a mean of accessing Wikipedia in a standardized way. In this paper it is characterized from the point of view of questions answering system. Simple implementation of such system is also presented. The paper also characterizes alternatives to DBpedia in form of OpenCyc and YAGO knowledge bases. A comparison between DBpedia and those knowledge bases is presented.
-
Future research directions in design of reliable communication systems
PublikacjaIn this position paper on reliable networks, we discuss new trends in the design of reliable communication systems. We focus on a wide range of research directions including protection against software failures as well as failures of communication systems equipment. In particular, we outline future research trends in software failure mitigation, reliability of wireless communications, robust optimization and network design, multilevel...
-
Improving css-KNN Classification Performance by Shifts in Training Data
PublikacjaThis paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...
-
Improving Effectiveness of SVM Classifier for Large Scale Data
PublikacjaThe paper presents our approach to SVM implementation in parallel environment. We describe how classification learning and prediction phases were pararellised. We also propose a method for limiting the number of necessary computations during classifier construction. Our method, named one-vs-near, is an extension of typical one-vs-all approach that is used for binary classifiers to work with multiclass problems. We perform experiments...
-
Information Retrieval in Wikipedia with Conceptual Directions
PublikacjaThe paper describes our algorithm used for retrieval of textual information from Wikipedia. The experiments show that the algorithm allows to improve typical evaluation measures of retrieval quality. The improvement of the retrieval results was achieved by two phase usage approach. In first the algorithm extends the set of content that has been indexed by the specified keywords and thus increases the Recall value. Then, using the...
-
Integration of Services into Workflow Applications
PublikacjaDescribing state-of-the-art solutions in distributed system architectures, Integration of Services into Workflow Applications presents a concise approach to the integration of loosely coupled services into workflow applications. It discusses key challenges related to the integration of distributed systems and proposes solutions, both in terms of theoretical aspects such as models and workflow scheduling algorithms, and technical...
-
Inteligentne superkomputery wirtualne do prognozowania trendów w finansach
PublikacjaW pracy zaproponowano wykorzystanie infrastruktury superkomputerów wirtualnych do obliczeń inteligentnych w finansach. Ponieważ zaawansowane metody wymagają dużych mocy obliczeniowych, omówiono gridy obliczeniowe rozumiane jako superkomputery wirtualne ze szczególnym uwzględnieniem środowiska obliczeniowego dla cyberwaluty Bitcoin. Ponadto zaproponowano programowanie genetyczne do opracowania strategii inwestycji giełdowych. Na...
-
Inteligentne systemy agentowe w systemach zdalnego nauczania
PublikacjaW pracy omówiono inteligentne systemy agentowe w systemach zdalnego nauczania. Po krótkim przedstawieniu ewolucji systemów zdalnego nauczania i ich wybranych zastosowań, scharakteryzowano inteligentne agenty edukacyjne. Omówiono wykorzystanie programowania genetycznego oraz algorytmów neuro-ewolucyjnych do implementacji oprogramowania tej klasy. Ponadto, nawiązano do modelu Map-Reduce, który efektywnie wspiera architekturę nowoczesnego...
-
KASKADA Platform in Cloud Environment
PublikacjaKASKADA is a computing platform for developing and running services and applications oriented to multimedia processing and data streaming. A solution for serving the platform in the PaaS model is presented. The paper briefly describes the software for creating the private cloud environment based on the OpenStack software. The coud has been deployed to a supercomputer working in C2 NIWA at the Gdansk University of technology. This...
-
Massive surveillance data processing with supercomputing cluster
PublikacjaIn recent years, increasingly complex algorithms for automated analysis of surveillance data are being developed. The rapid growth in the number of monitoring installations and higher expectations of the quality parameters of the captured data result in an enormous computational cost of analyzing the massive volume of data. In this paper a new model of online processing of surveillance data streams is proposed, which assumes the...
-
Metoda ewolucyjno-neuronowa oraz metoda wektorów nośnych w bankowości
PublikacjaW pracy omówiono wykorzystanie metody ewolucyjno-neuronowe do oceny wiarygodności kredytobiorców. Ponadto, doprecyzowano zastosowanie metody wektorów wspierających do prognozowania kryzysów bankowych. Na zakończenie odniesiono się do alternatywnych metod sztucznej inteligencji, które stosowane są w bankowości.
-
Mobile Offloading Framework: Solution for Optimizing Mobile Applications Using Cloud Computing
PublikacjaNumber of mobile devices and applications is growing rapidly in recent years. Capabilities and performance of these devices can be tremendously extended with the integration of cloud computing. However, multiple challenges regarding implementation of these type of mobile applications are known, like differences in architecture, optimization and operating system support. This paper summarizes issues with mobile cloud computing and...
-
Modern Methods of Software Development
PublikacjaSoftware development methods consist of such activities like analysis, planning, development, testing, deployment, maintenance and retirement. All of them can be divided into two main categories - traditional and agile. The objective of this paper is to review some of the most popular traditional, agile and open source development methods. Special attention was paid to the common stages of all methods - testing and maintenance.
-
Modern Platform for Parallel Algorithms Testing: Java on Intel Xeon Phi
PublikacjaParallel algorithms are popular method of increasing system performance. Apart from showing their properties using asymptotic analysis, proof-of-concept implementation and practical experiments are often required. In order to speed up the development and provide simple and easily accessible testing environment that enables execution of reliable experiments, the paper proposes a platform with multi-core computational accelerator:...
-
Optymalizacja parametrów aplikacji w procesie wytwarzania oprogramowania dla Big Data
PublikacjaWytwarzanie oprogramowania wiąże się z szeregiem decyzji projektowych obejmujących architekturę aplikacji, wykorzystywane technologie implementacji, jak i zewnętrzne biblioteki. W pracy przedstawiono metodę wyboru technologii i bibliotek związanych z big data, której celem jest optymalizacja atrybutów aplikacji takich jak wydajność działajacej aplikacji jak również optymalizacja procesu wytwarzania oprogramowania. Metoda wyboru...
-
Platforma KASKADA jako system zapewniania bezpieczeństwa poprzez masową analizę strumieni multimedialnych w czasie rzeczywistym
PublikacjaW artykule przedstawiono Platformę KASKADA rozumianą jako system przetwarzania danych cyfrowych i strumieni multimedialnych oraz stanowiącą ofertę usług wspomagających zapewnienie bezpieczeństwa publicznego, ocenę badań medycznych i ochronę własności intelektualnej. celem prowadzonych prac było stworzenie innowacyjnego systemu umozliwiajacego wydajną i masową analizę dokumentów cyfrowych i strumieni multimedialnych w czasie rzeczywistym...
-
Program IPMed — zastosowanie kardiografii impedancyjnej do monitorowania układu krążenia
PublikacjaZaburzenia hemodynamiczne stanowią drugą najczęstszą przyczynę zgonu u chorych po udarze mózgu. Istniejące standardy postępowania nie uwzględniają pełnej diagnostyki układu sercowo-naczyniowego u tych chorych. W niniejszej pracy przedstawiono wieloośrodkowy projekt IPMed, który jest próbą aplikacji kardiografii impedancyjnej (ICG) do monitorowania hemodynamiki pacjentów po udarze mózgu.
-
Real-Time connection Between Immerse 3D Vizualization Laboratory and Kaskada Platform
PublikacjaMultimedia stream processing into two cooperative different systems (cluster platform and virtual lab) is considered. The considered selected information about the systems is presented and the idea of its communication when executing the distributed application is proposed. A general schema of the communication architecture is given. Tests of data transmission quality are considered and their results are presented.
-
Retrieval of Heterogeneus Sevices in C2NIWA Repository
PublikacjaThe paper reviews the methods used for retrieval of information and services. The selected approaches presented in the review inspired us to build retrieval mechanisms in a system for searching the resources stored in the C2NIWA repository. We describe the architecture of the system, its functions and the surrounding subsystems to which it is related. For retrieval of C2NIWA sevices we propos three approaches based on: keyword...
-
Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives
PublikacjaLong-term Web archives comprise Web documents gathered over longer time periods and can easily reach hundreds of terabytes in size. Semantic annotations such as named entities can facilitate intelligent access to the Web archive data. However, the annotation of the entire archive content on this scale is often infeasible. The most efficient way to access the documents within Web archives is provided through their URLs, which are...
-
Service Recommendation on WIKI-WS Platform
PublikacjaThe article presents the issues of recaommendation services for users. The commonly used solutions include known classes of recommendation systems and information about their area of use and the most frequently used algorithms. the issue of choosing the services has been described based on the Wiki-WS platform including a model of cooperation with the recommendation system. The conclusion from the analysis of the proposed model...
-
Simulation of parallel similarity measure computations for large data sets
PublikacjaThe paper presents our approach to implementation of similarity measure for big data analysis in a parallel environment. We describe the algorithm for parallelisation of the computations. We provide results from a real MPI application for computations of similarity measures as well as results achieved with our simulation software. The simulation environment allows us to model parallel systems of various sizes with various components...
-
Task Allocation and Scalability Evaluation for Real-Time Multimedia Processing in a Cluster Envirinment
PublikacjaAn allocation algorithm for stream processing tasks is proposed (Modified best Fit Descendent, MBFD). A comparison with another solution (BFD) is provided. Tests of the algorithms in an HPC environment are descrobed and the results are presented. A proper scalability metric is proposed and used for the evaluation of the allocation algorithm.
-
Tryton Supercomputer Capabilities for Analysis of Massive Data Streams
PublikacjaThe recently deployed supercomputer Tryton, located in the Academic Computer Center of Gdansk University of Technology, provides great means for massive parallel processing. Moreover, the status of the Center as one of the main network nodes in the PIONIER network enables the fast and reliable transfer of data produced by miscellaneous devices scattered in the area of the whole country. The typical examples of such data are streams...
-
Two Stage SVM and kNN Text Documents Classifier
PublikacjaThe paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...
-
User Trust Levels and Their Impact on System Security and Usability
PublikacjaA multilateral trust between a user and a system is considered. First of all we concentrate on user trust levels associated with the context-oriented CoRBAC model. Consequently, there were computed user profiles on the basis of its implementation in the information processing system “My GUT”. Furthermore, analysis of these profiles and the impact of user trust levels on system security and usability have been discussed.
-
Using Rule-Based System for Monitoring Marine Navigation Data Processing
PublikacjaProcessing marine navigational data requires sophisticated software solutions. Typically, specialized tools called processors are analyzing raw data from different sensors. It becomes important to create the monitoring software that is able to validate and verify processing components integrated into the final system. Drools®business rule management platform provides a core business rules engine, web authoring and rules management...
-
WIKI-WS as a C2 NIWA Web Service Management Platform
PublikacjaThe Wiki-WS platform was implemented within the C2 NIWA project for production purposes. Wiki-WS stands for developing, managing and maintaining web services. The production deployment needed implementation of several functional improvements and establishing a strong security 7 safety policy. The WikiWS platform has to be used as an educational environmement for developing web sevices and production environment for execution of...
-
Wyzwania bezpieczeństwa nowoczesnych platform nauczania zdalnego
PublikacjaW artykule zaprezentowano aspekty bezpieczeństwa nowoczesnych platform nauczania zdalnego. Przedstawiono ich charakterystykę i wyzwania technologiczne. Zdefiniowano bezpieczeństwo i istniejące w tym obszarze zagrożenia. Przybliżono metody oceny poziomu bezpieczeństwa. Na bazie wdrożonej na Politechnice Gdańskiej platformy eNauczanie PG omówiono sposoby zapewniania zakładanego poziomu bezpieczeństwa takich systemów.
Rok 2014
-
A Human Behaviour Model Agent for Testing of Voluntary Computing Systems
PublikacjaPaper presents a design and performance of a voluntary-based distributed computing system testing agent, implementing a human behaviour model. The agent, nicknamed iRobot, was designed and implemented to enable controlled, large scale testing of core algorithms of Comcute - a new voluntary distributed computing platform complementary to BOINC. The main agent design goals were: emulation of human behaviour when browsing web pages,...
-
A Model-Driven Solution for Development of Multimedia Stream Processing Applications
PublikacjaThis paper presents results of action research related to model-driven solutions in the area of multimedia stream processing. The practical problem to be solved was the need to support application developers who make their multimedia stream processing applications in a supercomputer environment. The solution consists of a domain-specific visual language for composing complex services from simple services called Multimedia Stream...
-
A Regular Expression Matching Application with Configurable Data Intensity for Testing Heterogeneous HPC Systems
PublikacjaModern High Performance Computing (HPC) systems are becoming increasingly heterogeneous in terms of utilized hardware, as well as software solutions. The problems, that we wish to efficiently solve using those systems have different complexity, not only considering magnitude, but also the type of complexity: computation, data or communication intensity. Developing new mechanisms for dealing with those complexities or choosing an...
-
A Workflow Application for Parallel Processing of Big Data from an Internet Portal
PublikacjaThe paper presents a workflow application for efficient parallel processing of data downloaded from an Internet portal. The workflow partitions input files into subdirectories which are further split for parallel processing by services installed on distinct computer nodes. This way, analysis of the first ready subdirectories can start fast and is handled by services implemented as parallel multithreaded applications using multiple...