Publications
Filters
total: 855
Catalog Publications
Year 2016
-
Harmony Search for Self-configuration of Fault–Tolerant and Intelligent Grids
PublicationIn this paper, harmony search algorithms have been proposed to self-configuration of fault-tolerant grids for big data processing. Self-configuration of computer grids lies in the fact that new computer nodes are automatically configured by software agents and then integrated into the grid. A base node works due to several configuration parameters that define some aspects of data communications and energy power consumption. We...
-
Harmony Search to Self-Configuration of Fault-Tolerant Grids for Big Data
PublicationIn this paper, harmony search algorithms have been proposed to self-configuration of fault-tolerant grids for big data processing. Some tasks related to big data processing have been considered. Moreover, two criteria have been applied to evaluate quality of grids. The first criterion is a probability that all tasks meet their deadlines and the second one is grid reliability. Furthermore, some intelligent agents based on harmony...
-
Identification of category associations using a multilabel classifier
PublicationDescription of the data using categories allows one to describe it on a higher abstraction level. In this way, we can operate on aggregated groups of the information, allowing one to see relationships that do not appear explicit when we analyze the individual objects separately. In this paper we present automatic identification of the associations between categories used for organization of the textual data. As experimental data...
-
Ischemic stroke in general practice - risk factors, prognosis - prospective single center study
PublicationBackground. Ischemic stroke is one of the leading causes of death and disability in Poland, in the European Union and in the United States. Awareness related to the risk factors, particularly the modifiable ones, is of crucial importance in general practice and may improve early and long-term prognoses. Objectives. The aim of this study was to assess the risk factors and half-year survival rate in patients with ischemic stroke...
-
KernelHive: a new workflow-based framework for multilevel high performance computing using clusters and workstations with CPUs and GPUs
PublicationThe paper presents a new open-source framework called KernelHive for multilevel parallelization of computations among various clusters, cluster nodes, and finally, among both CPUs and GPUs for a particular application. An application is modeled as an acyclic directed graph with a possibility to run nodes in parallel and automatic expansion of nodes (called node unrolling) depending on the number of computation units available....
-
Metody neuronowe do prognozowania finansowego
PublicationSztuczne sieci neuronowe mogą być stosowane do prognozowania kursów akcji na giełdzie, oceny wiarygodności kredytobiorców czy prognozowania kryzysów bankowych. W referacie omówiono zasady współpracy sieci neuronowych z algorytmami ewolucyjnymi oraz metodą wektorów wspierających. Ponadto, odniesiono się do pozostałych metod sztucznej inteligencji, które stosowane są w finansach.
-
Modeling DAC Application Execution Time
PublicationAn application written in the Divide And Conquer paradigm is more difficult to model than SPMD application because of complex algorithm, causing use of many coefficients in a computational complexity function. Processors are divided into various layers, each layer contains different number of processors. Data packets processed in different layers and transferred between layers have different length. Moreover first layer processors use...
-
Modeling energy consumption of parallel applications
PublicationThe paper presents modeling and simulation of energy consumption of two types of parallel applications: geometric Single Program Multiple Data (SPMD) and divide-and-conquer (DAC). Simulation is performed in a new MERPSYS environment. Model of an application uses the Java language with extension representing message exchange between processes working in parallel. Simulation is performed by running threads representing distinct process...
-
Modeling of Performance, Reliability and Energy Efficiency in Large-Scale Computational Environment
PublicationLarge scale of complexity of distributed computational systems imposes special challanges for prediction of quality in such systems.Existing quality models for lower-scale systems include functionality,performance,reliability,flexibility and usability.Among these attributes,performance and reliability have a particular significance to the large-scale systems computing quality modeling due to their strong dependence on the system...
-
Modeling Parallel Applications in the MERPSYS Environment
PublicationThe chapter presents how to model parallel computational applications for which simulation of execution in a large-scale parallel or distributed environment is performed within the MERPSYS environment. Specifically, it is shown what approaches can be adopted to model key paradigms often used for parallel applications: master-slave, geometric parallelism (single program multiple data), pipelined and divide-and-conquer applications....
-
Modeling SPMD Application Execution Time
PublicationParallel applications in a Single Process Multiple Data paradigm assume splitting huge amounts of data to multiple processors working in parallel at small data packets. As the individual data packets are not independent, the processors must interact with each other to exchange results of the calculations with their adjacent partners and take these results into account in their own computations. An example of SPMD is geometric parallelism...
-
Modeling Volunteer Based Systems
PublicationThere are two main approaches to processing tasks requiring high amounts of computational power. One approach is using clusters of mostly identical hardware, placed in dedicated locations [1, 2, 3]. The other approach is outsourcing computing resources from large numbers of volunteers connected to the Internet [7]. This chapter presents an application of a mathematical model of the volunteer computing presented in Volume 1 of this...
-
Multi-level Virtualization and Its Impact on System Performance in Cloud Computing
PublicationThe results of benchmarking tests of multi-level virtualized environments are presented. There is analysed the performance impact of hardware virtualization, container-type isolation and programming level abstraction. The comparison is made on the basis of a proposed score metric that allows you to compare different aspects of performance. There is general performance (CPU and memory), networking, disk operations and application-like...
-
NVRAM as Main Storage of Parallel File System
PublicationModern cluster environments' main trouble used to be lack of computational power provided by CPUs and GPUs, but recently they suffer more and more from insufficient performance of input and output operations. Apart from better network infrastructure and more sophisticated processing algorithms, a lot of solutions base on emerging memory technologies. This paper presents evaluation of using non-volatile random-access memory as a...
-
Performance and Power-Aware Modeling of MPI Applications for Cluster Computing
PublicationThe paper presents modeling of performance and power consumption when running parallel applications on modern cluster-based systems. The model includes basic so-called blocks representing either computations or communication. The latter includes both point-to-point and collective communication. Real measurements were performed using MPI applications and routines run on three different clusters with both Infiniband and Gigabit Ethernet...
-
Reliability Modeling for SPMD and DAC Applications in MERPSYS
PublicationIn this chapter we shall see a method for modeling of reliability of application execution in the MERPSYS system, and the results of reliability simulation of the same SPMD and DAC application which we have previously presented in this book. Unlike energy usage modeling and modeling of execution efficiency we could not compare the reliability model and simulation results to real experiment just because the experiment would last too...
-
The chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identified complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for different size of the data as well as for different number of the processors used for the computations. The results we got using the model have been compared to the results obtained from real computational environment.
PublicationThe chapter analyses the K-Means algorithm in its parallel setting. We provide detailed description of the algorithm as well as the way we paralellize the computations. We identified complexity of the particular steps of the algorithm that allows us to build the algorithm model in MERPSYS system. The simulations with the MERPSYS have been performed for different size of the data as well as for different number of the processors used...
-
Towards Healthcare Cloud Computing
PublicationIn this paper we present construction of a software platform for supporting medical research teams, in the area of impedance cardiography, called IPMed. Using the platform, research tasks will be performed by the teams through computer-supported cooperative work. The platform enables secure medical data storing, access to the data for research group members, cooperative analysis of medical data and provide analysis supporting tools...
-
Towards increasing F-measure of approximate string matching in O(1) complexity
PublicationThe paper analyzes existing approaches for approximate string matching based on linear search with Levenshtein distance, AllScan and CPMerge algorithms using cosine, Jaccard and Dice distance measures. The methods are presented and compared to our approach that improves indexing time using Locally Sensitive Hashing. Advantages and drawbacks of the methods are identified based on theoretical considerations as well as empirical evaluations...
-
ZARZADZANIE ORGANIZACJĄ TYPU SMART
PublicationZaprezentowano organizacji, w której charakterystykę wpisano następujące cechy: strategiczne uwarunkowanie, maksymalną innowacyjność, atrakcyjność dla interesariuszy, rozwijanie osobowości pracowników oraz twórczość w działaniu. Te atrybuty, tworzą koncepcję organizacji SMART (akronim od pierwszych liter powyższych cech), która wyróżnia się dodatkowo strukturą opartą o dynamicznie kreowane zespoły oraz skutecznym wspomaganiem...
Year 2015
-
A method for counting people attending large public events
PublicationThe algorithm for people counting in crowded scenes, based on the idea of virtual gate which uses optical flow method is presented. The concept and practical application of the developed algorithm under real conditions is depicted. The aim of the work is to estimate the number of people passing through entrances of a large sport hall. The most challenging problem was the unpredicted behavior of people while entering the building....
-
Adaption of Open Science Paradigm at Gdańsk University of Technology
PublicationOpen Access plays growing role in sharing results of today's research. The movement is very strong and many Universities, including Gdańsk University of Technology, are adapting this approach. This paper presents general information about Open Access movement and the approach adapted by the Gdańsk University of Technology. The components of Open Science platform as well as the procedures are described.
-
Analiza kreatywności oraz wpływu elementów typowo marketingowych wykorzystywanych na internetowych portalach branży turystycznej
PublicationW obecnej chwili konkurencja pomiędzy agencjami turystycznymi operującymi na globalnym rynku w Internecie, w powiązaniu ze stale rosnącą biznesową dojrzałością całego sektora, wymusiła wprowadzanie kreatywnych oraz celowo zorientowanych elementów o naturze typowo marketingowej w celu wywołania wrażenia unikalności w sposobie traktowania klienta. Marketingowe elementy o charakterze klasycznym, np. zniżki typu „first minute”, „last...
-
BeesyCluster as Front-End for High Performance Computing Services
PublicationThe paper presents the BeesyCluster system as a middleware allowing invocation of services on high performance computing resources within the NIWA Centre of Competence project. Access is possible through both WWW and SOAP Web Service interfaces. The former allows non-experienced users to invoke both simple and complex services exposed through easyto-use servlets. The latter is meant for integration of external applications with...
-
Big Data Processing by Volunteer Computing Supported by Intelligent Agents
PublicationIn this paper, volunteer computing systems have been proposed for big data processing. Moreover, intelligent agents have been developed to efficiency improvement of a grid middleware layer. In consequence, an intelligent volunteer grid has been equipped with agents that belong to five sets. The first one consists of some user tasks. Furthermore, two kinds of semi-intelligent tasks have been introduced to implement a middleware...
-
Business Process Modelling and Enterprise Improvement Analysis
PublicationThe main trends in developing an approach and tools for business process modelling are discussed. Business Process Management (BPM) and BPM Notation (BPMN) are presented. Moreover, the possibilities of C2NIWA in a wide range of these tools running on a supercomputer are suggested. Firstly, the company development strategy can be planned . Secondly, quick and flexible changes of its processes can be modelled, analysed and implemented....
-
C2 Niwa - The Centre of Competence for Novel Infrastructure of Workable Applications
PublicationScientific and cloud computing are discussed in the paper . The paradigm of EaaS (Everything as a Service) is presented and the design methodology of service oriented applications is proposed. Implementation of the SOSE (Software Oriented Software Engineering) approach is analyzed and the roles of developing teams to obtain software products satisfying business requirements are also shown. The above listed activities create a basic...
-
C2 NIWA Community-Segmentation Criteria and Building Brand Associations on the Example of a Selected Target Group
PublicationEvery organization which offers products or services wishes to communicate with their customers in the most effective way. This kind of communication is based on proper selection of target groups, which are extracted in the process of market segmentation. That is way it is very important to ask the question to whom the message is to be directed and what kind of message we want to give. This article describes the selection criteria...
-
CD NIWA jako platforma współpracy biznesu i nauki
PublicationArtykuł prezentuje koncepcję Centrum Doskonałości Naukowej Infrastruktury Wytwarzania Aplikacji: CD NIWA, które jest środowiskiem przetwarzania w chmurze obliczeniowej, źródłem stale doskonalonych aplikacji i usług oraz platformą współpracy użytkowników. Interaktywność współdziałania nauki i biznesu jest głównym założeniem i celem Centrum Doskonałości NIWA, które ma też służyć budowaniu społeczności współpracującej w zakresie realizacji...
-
Collective citizens' behavior modelling with support of the Internet of Things and Big Data
PublicationIn this paper, collective human behaviors are modelled by a development of Big Data mining related to the Internet of Things. Some studies under MapReduce architectures have been carried out to improve an efficiency of Big Data mining. Intelligent agents in data mining have been analyzed for smart city systems, as well as data mining has been described by genetic programming. Furthermore, artificial neural networks have been discussed...
-
DBpedia As a Formal Knowledge Base – An Evaluation
PublicationDBpedia is widely used by researchers as a mean of accessing Wikipedia in a standardized way. In this paper it is characterized from the point of view of questions answering system. Simple implementation of such system is also presented. The paper also characterizes alternatives to DBpedia in form of OpenCyc and YAGO knowledge bases. A comparison between DBpedia and those knowledge bases is presented.
-
Future research directions in design of reliable communication systems
PublicationIn this position paper on reliable networks, we discuss new trends in the design of reliable communication systems. We focus on a wide range of research directions including protection against software failures as well as failures of communication systems equipment. In particular, we outline future research trends in software failure mitigation, reliability of wireless communications, robust optimization and network design, multilevel...
-
Improving css-KNN Classification Performance by Shifts in Training Data
PublicationThis paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...
-
Improving Effectiveness of SVM Classifier for Large Scale Data
PublicationThe paper presents our approach to SVM implementation in parallel environment. We describe how classification learning and prediction phases were pararellised. We also propose a method for limiting the number of necessary computations during classifier construction. Our method, named one-vs-near, is an extension of typical one-vs-all approach that is used for binary classifiers to work with multiclass problems. We perform experiments...
-
Information Retrieval in Wikipedia with Conceptual Directions
PublicationThe paper describes our algorithm used for retrieval of textual information from Wikipedia. The experiments show that the algorithm allows to improve typical evaluation measures of retrieval quality. The improvement of the retrieval results was achieved by two phase usage approach. In first the algorithm extends the set of content that has been indexed by the specified keywords and thus increases the Recall value. Then, using the...
-
Integration of Services into Workflow Applications
PublicationDescribing state-of-the-art solutions in distributed system architectures, Integration of Services into Workflow Applications presents a concise approach to the integration of loosely coupled services into workflow applications. It discusses key challenges related to the integration of distributed systems and proposes solutions, both in terms of theoretical aspects such as models and workflow scheduling algorithms, and technical...
-
Inteligentne superkomputery wirtualne do prognozowania trendów w finansach
PublicationW pracy zaproponowano wykorzystanie infrastruktury superkomputerów wirtualnych do obliczeń inteligentnych w finansach. Ponieważ zaawansowane metody wymagają dużych mocy obliczeniowych, omówiono gridy obliczeniowe rozumiane jako superkomputery wirtualne ze szczególnym uwzględnieniem środowiska obliczeniowego dla cyberwaluty Bitcoin. Ponadto zaproponowano programowanie genetyczne do opracowania strategii inwestycji giełdowych. Na...
-
Inteligentne systemy agentowe w systemach zdalnego nauczania
PublicationW pracy omówiono inteligentne systemy agentowe w systemach zdalnego nauczania. Po krótkim przedstawieniu ewolucji systemów zdalnego nauczania i ich wybranych zastosowań, scharakteryzowano inteligentne agenty edukacyjne. Omówiono wykorzystanie programowania genetycznego oraz algorytmów neuro-ewolucyjnych do implementacji oprogramowania tej klasy. Ponadto, nawiązano do modelu Map-Reduce, który efektywnie wspiera architekturę nowoczesnego...
-
KASKADA Platform in Cloud Environment
PublicationKASKADA is a computing platform for developing and running services and applications oriented to multimedia processing and data streaming. A solution for serving the platform in the PaaS model is presented. The paper briefly describes the software for creating the private cloud environment based on the OpenStack software. The coud has been deployed to a supercomputer working in C2 NIWA at the Gdansk University of technology. This...
-
Massive surveillance data processing with supercomputing cluster
PublicationIn recent years, increasingly complex algorithms for automated analysis of surveillance data are being developed. The rapid growth in the number of monitoring installations and higher expectations of the quality parameters of the captured data result in an enormous computational cost of analyzing the massive volume of data. In this paper a new model of online processing of surveillance data streams is proposed, which assumes the...
-
Metoda ewolucyjno-neuronowa oraz metoda wektorów nośnych w bankowości
PublicationW pracy omówiono wykorzystanie metody ewolucyjno-neuronowe do oceny wiarygodności kredytobiorców. Ponadto, doprecyzowano zastosowanie metody wektorów wspierających do prognozowania kryzysów bankowych. Na zakończenie odniesiono się do alternatywnych metod sztucznej inteligencji, które stosowane są w bankowości.
-
Mobile Offloading Framework: Solution for Optimizing Mobile Applications Using Cloud Computing
PublicationNumber of mobile devices and applications is growing rapidly in recent years. Capabilities and performance of these devices can be tremendously extended with the integration of cloud computing. However, multiple challenges regarding implementation of these type of mobile applications are known, like differences in architecture, optimization and operating system support. This paper summarizes issues with mobile cloud computing and...
-
Modern Methods of Software Development
PublicationSoftware development methods consist of such activities like analysis, planning, development, testing, deployment, maintenance and retirement. All of them can be divided into two main categories - traditional and agile. The objective of this paper is to review some of the most popular traditional, agile and open source development methods. Special attention was paid to the common stages of all methods - testing and maintenance.
-
Modern Platform for Parallel Algorithms Testing: Java on Intel Xeon Phi
PublicationParallel algorithms are popular method of increasing system performance. Apart from showing their properties using asymptotic analysis, proof-of-concept implementation and practical experiments are often required. In order to speed up the development and provide simple and easily accessible testing environment that enables execution of reliable experiments, the paper proposes a platform with multi-core computational accelerator:...
-
Optymalizacja parametrów aplikacji w procesie wytwarzania oprogramowania dla Big Data
PublicationWytwarzanie oprogramowania wiąże się z szeregiem decyzji projektowych obejmujących architekturę aplikacji, wykorzystywane technologie implementacji, jak i zewnętrzne biblioteki. W pracy przedstawiono metodę wyboru technologii i bibliotek związanych z big data, której celem jest optymalizacja atrybutów aplikacji takich jak wydajność działajacej aplikacji jak również optymalizacja procesu wytwarzania oprogramowania. Metoda wyboru...
-
Platforma KASKADA jako system zapewniania bezpieczeństwa poprzez masową analizę strumieni multimedialnych w czasie rzeczywistym
PublicationW artykule przedstawiono Platformę KASKADA rozumianą jako system przetwarzania danych cyfrowych i strumieni multimedialnych oraz stanowiącą ofertę usług wspomagających zapewnienie bezpieczeństwa publicznego, ocenę badań medycznych i ochronę własności intelektualnej. celem prowadzonych prac było stworzenie innowacyjnego systemu umozliwiajacego wydajną i masową analizę dokumentów cyfrowych i strumieni multimedialnych w czasie rzeczywistym...
-
Program IPMed — zastosowanie kardiografii impedancyjnej do monitorowania układu krążenia
PublicationZaburzenia hemodynamiczne stanowią drugą najczęstszą przyczynę zgonu u chorych po udarze mózgu. Istniejące standardy postępowania nie uwzględniają pełnej diagnostyki układu sercowo-naczyniowego u tych chorych. W niniejszej pracy przedstawiono wieloośrodkowy projekt IPMed, który jest próbą aplikacji kardiografii impedancyjnej (ICG) do monitorowania hemodynamiki pacjentów po udarze mózgu.
-
Real-Time connection Between Immerse 3D Vizualization Laboratory and Kaskada Platform
PublicationMultimedia stream processing into two cooperative different systems (cluster platform and virtual lab) is considered. The considered selected information about the systems is presented and the idea of its communication when executing the distributed application is proposed. A general schema of the communication architecture is given. Tests of data transmission quality are considered and their results are presented.
-
Retrieval of Heterogeneus Sevices in C2NIWA Repository
PublicationThe paper reviews the methods used for retrieval of information and services. The selected approaches presented in the review inspired us to build retrieval mechanisms in a system for searching the resources stored in the C2NIWA repository. We describe the architecture of the system, its functions and the surrounding subsystems to which it is related. For retrieval of C2NIWA sevices we propos three approaches based on: keyword...
-
Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives
PublicationLong-term Web archives comprise Web documents gathered over longer time periods and can easily reach hundreds of terabytes in size. Semantic annotations such as named entities can facilitate intelligent access to the Web archive data. However, the annotation of the entire archive content on this scale is often infeasible. The most efficient way to access the documents within Web archives is provided through their URLs, which are...