Publications
Filters
total: 855
Catalog Publications
Year 2015
-
Service Recommendation on WIKI-WS Platform
PublicationThe article presents the issues of recaommendation services for users. The commonly used solutions include known classes of recommendation systems and information about their area of use and the most frequently used algorithms. the issue of choosing the services has been described based on the Wiki-WS platform including a model of cooperation with the recommendation system. The conclusion from the analysis of the proposed model...
-
Simulation of parallel similarity measure computations for large data sets
PublicationThe paper presents our approach to implementation of similarity measure for big data analysis in a parallel environment. We describe the algorithm for parallelisation of the computations. We provide results from a real MPI application for computations of similarity measures as well as results achieved with our simulation software. The simulation environment allows us to model parallel systems of various sizes with various components...
-
Task Allocation and Scalability Evaluation for Real-Time Multimedia Processing in a Cluster Envirinment
PublicationAn allocation algorithm for stream processing tasks is proposed (Modified best Fit Descendent, MBFD). A comparison with another solution (BFD) is provided. Tests of the algorithms in an HPC environment are descrobed and the results are presented. A proper scalability metric is proposed and used for the evaluation of the allocation algorithm.
-
Tryton Supercomputer Capabilities for Analysis of Massive Data Streams
PublicationThe recently deployed supercomputer Tryton, located in the Academic Computer Center of Gdansk University of Technology, provides great means for massive parallel processing. Moreover, the status of the Center as one of the main network nodes in the PIONIER network enables the fast and reliable transfer of data produced by miscellaneous devices scattered in the area of the whole country. The typical examples of such data are streams...
-
Two Stage SVM and kNN Text Documents Classifier
PublicationThe paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...
-
User Trust Levels and Their Impact on System Security and Usability
PublicationA multilateral trust between a user and a system is considered. First of all we concentrate on user trust levels associated with the context-oriented CoRBAC model. Consequently, there were computed user profiles on the basis of its implementation in the information processing system “My GUT”. Furthermore, analysis of these profiles and the impact of user trust levels on system security and usability have been discussed.
-
Using Rule-Based System for Monitoring Marine Navigation Data Processing
PublicationProcessing marine navigational data requires sophisticated software solutions. Typically, specialized tools called processors are analyzing raw data from different sensors. It becomes important to create the monitoring software that is able to validate and verify processing components integrated into the final system. Drools®business rule management platform provides a core business rules engine, web authoring and rules management...
-
WIKI-WS as a C2 NIWA Web Service Management Platform
PublicationThe Wiki-WS platform was implemented within the C2 NIWA project for production purposes. Wiki-WS stands for developing, managing and maintaining web services. The production deployment needed implementation of several functional improvements and establishing a strong security 7 safety policy. The WikiWS platform has to be used as an educational environmement for developing web sevices and production environment for execution of...
-
Wyzwania bezpieczeństwa nowoczesnych platform nauczania zdalnego
PublicationW artykule zaprezentowano aspekty bezpieczeństwa nowoczesnych platform nauczania zdalnego. Przedstawiono ich charakterystykę i wyzwania technologiczne. Zdefiniowano bezpieczeństwo i istniejące w tym obszarze zagrożenia. Przybliżono metody oceny poziomu bezpieczeństwa. Na bazie wdrożonej na Politechnice Gdańskiej platformy eNauczanie PG omówiono sposoby zapewniania zakładanego poziomu bezpieczeństwa takich systemów.
Year 2014
-
A Human Behaviour Model Agent for Testing of Voluntary Computing Systems
PublicationPaper presents a design and performance of a voluntary-based distributed computing system testing agent, implementing a human behaviour model. The agent, nicknamed iRobot, was designed and implemented to enable controlled, large scale testing of core algorithms of Comcute - a new voluntary distributed computing platform complementary to BOINC. The main agent design goals were: emulation of human behaviour when browsing web pages,...
-
A Model-Driven Solution for Development of Multimedia Stream Processing Applications
PublicationThis paper presents results of action research related to model-driven solutions in the area of multimedia stream processing. The practical problem to be solved was the need to support application developers who make their multimedia stream processing applications in a supercomputer environment. The solution consists of a domain-specific visual language for composing complex services from simple services called Multimedia Stream...
-
A Regular Expression Matching Application with Configurable Data Intensity for Testing Heterogeneous HPC Systems
PublicationModern High Performance Computing (HPC) systems are becoming increasingly heterogeneous in terms of utilized hardware, as well as software solutions. The problems, that we wish to efficiently solve using those systems have different complexity, not only considering magnitude, but also the type of complexity: computation, data or communication intensity. Developing new mechanisms for dealing with those complexities or choosing an...
-
A Workflow Application for Parallel Processing of Big Data from an Internet Portal
PublicationThe paper presents a workflow application for efficient parallel processing of data downloaded from an Internet portal. The workflow partitions input files into subdirectories which are further split for parallel processing by services installed on distinct computer nodes. This way, analysis of the first ready subdirectories can start fast and is handled by services implemented as parallel multithreaded applications using multiple...
-
Accelerating Video Frames Classification With Metric Based Scene Segmentation
PublicationThis paper addresses the problem of the efficient classification of images in a video stream in cases, where all of the video has to be labeled. Realizing the similarity of consecutive frames, we introduce a set of simple metrics to measure that similarity. To use these observations for decreasing the number of necessary classifications, we propose a scene segmentation algorithm. Performed experiments have evaluated the acquired...
-
Accelerometer-based Human Activity Recognition and the Impact of the Sample Size
PublicationThe presented study focused on the recognition of eight user activities (e.g. walking, lying, climbing stairs) basing on the measurements from an accelerometer embedded in a mobile device. It is assumed that the device is carried in a specific location of the user’s clothing. Three types of classifiers were tested on different sizes of the samples. The influence of the time window (the duration of a single trial) on selected activities...
-
AN ALGORITHM FOR PORTAL HYPERTENSIVE GASTROPATHY RECOGNITION ON THE ENDOSCOPIC RECORDINGS
PublicationSymptoms recognition of portal hypertensive gastropathy (PHG) can be done by analysing endoscopic recordings, but manual analysis done by physician may take a long time. This increases probability of missing some symptoms and automated methods may be applied to prevent that. In this paper a novel hybrid algorithm for recognition of early stage of portal hypertensive gastropathy is proposed. First image preprocessing is described....
-
ANYTIME POLYNOMIAL HEURISTIC ALGORITHM FOR PARTITIONING GROUPS OF DATA WITH PRESERVING CLASS PROPORTIONS FOR CROSS-VALIDATION
PublicationThe article describes a problem of splitting data for k-fold cross-validation, where class proportions must be preserved, with additional constraint that data is divided into groups that cannot be split into different cross-validation sets. This problem often occurs in e.g. medical data processing, where data samples from one patient must be included in the same cross-validation set. As this problem is NP-complete, a heuristic...
-
Automatic Classification of Polish Sign Language Words
PublicationIn the article we present the approach to automatic recognition of hand gestures using eGlove device. We present the research results of the system for detection and classification of static and dynamic words of Polish language. The results indicate the usage of eGlove allows to gain good recognition quality that additionally can be improved using additional data sources such as RGB cameras.
-
Big Data Paradigm Developed in Volunteer Grid System with Genetic Programming Scheduler
PublicationArtificial intelligence techniques are capable to handle a large amount of information collected over the web. In this paper, big data paradigm has been studied in volunteer and grid system called Comcute that is optimized by a genetic programming scheduler. This scheduler can optimize load balancing and resource cost. Genetic programming optimizer has been applied for finding the Pareto solu-tions. Finally, some results from numerical...
-
Color-based Detection of Bleeding in Endoscopic Images
PublicationIn this paper a color descriptor designed for bleeding detection in endoscopic images is proposed. The development of the algorithm was carried out on a representative training set of 36 images of bleeding and 25 clear images. Another 38 bleeding and 26 normal images were used in the final stage as a test set. All of the considered images were extracted from separate endoscopic examinations. The experiments include color distribution...
-
Comparative Analysis of Text Representation Methods Using Classification
PublicationIn our work, we review and empirically evaluate five different raw methods of text representation that allow automatic processing of Wikipedia articles. The main contribution of the article—evaluation of approaches to text representation for machine learning tasks—indicates that the text representation is fundamental for achieving good categorization results. The analysis of the representation methods creates a baseline that cannot...
-
Comparison of selected algorithms for scheduling workflow applications with dynamically changing service availability
PublicationThis paper compares the quality and execution times of several algorithms for scheduling service based workflow applications with changeable service availability and parameters. A workflow is defined as an acyclic directed graph with nodes corresponding to tasks and edges to dependencies between tasks. For each task, one out of several available services needs to be chosen and scheduled to minimize the workflow execution time and...
-
Construction of tournament robot Mirror taking part in Mini Sumo competition
PublicationW artykule zaprezentowano projekt budowy robota turniejowego, startującego w zawodach Mini Sumo. Omówiono założenia dyscypliny Mini Sumo oraz konstrukcję mechaniczną, algorytmy sterujące i elektronikę zawartą w robocie.
-
Crowdsourcing and Volunteer Computing as Distributed Approach for Problem Solving
PublicationIn this paper, a combination between volunteer computing and crowdsourcing is presented. Two paradigms of the web computing are described, analyzed and compared in detail: grid computing and volunteer computing. Characteristics of BOINC and its contribution to global Internet processing are shown with the stress put onto applications the system can facilitate and problems it can solve. An alternative instance of a grid computing...
-
Data Mining Applications and Methods in Medicine
PublicationIn this paper we describe the research area of data mining and its applications in medicine. The origins of data mining and its crucial features are shortly presented. We discuss the specificity of medicine as an application area for computer systems. Characteristic features of the medical data are investigated. Common problems in the area are also presented as well as the strengths and capabilities of the data mining methods....
-
Dynamic Data Management Among Multiple Databases for Optimization of Parallel Computations in Heterogeneous HPC Systems
PublicationRapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...
-
Efficiency comparison of selected endoscopic video analysis algorithms
PublicationIn the paper, selected image analysis algorithms were examined and compared in the task of identifying informative frames, blurry frames, colorectal cancer and healthy tissue on endoscopic videos. In order to standardize the tests, the algorithms were modified by removing from them parts responsible for the classification, and replacing them with Support Vector Machines and Artificial Neural Networks. The tests were performed in...
-
Efficiency Evaluation of High Performance Computing Systems Using Data Envelopment Analysis
PublicationThe paper presents an evaluation method of high performance computing (HPC) systems using multicriteria efficiency analysis. The Data Envelopment Analysis approach was applied and adapted to the specifics of HPC, which enabled us to compare relative efficiency of systems considering simultaneously multiple parameters. The analysis is based on the TOP500 list of world largest supercomputers and their parameters such as: the number...
-
Emotion Recognition Based on Facial Expressions of Gamers
PublicationThis article presents an approach to emotion recognition based on facial expressions of gamers. With application of certain methods crucial features of an analyzed face like eyebrows' shape, eyes and mouth width, height were extracted. Afterwards a group of artificial intelligence methods was applied to classify a given feature set as one of the following emotions: happiness, sadness, anger and fear. The approach presented in this...
-
Endoscopy video analysis algorithms and their independence of rotation , brightness , contrast , color and blur
PublicationThe article presents selected image analysis algorithms for endoscopy videos. Mathematical methods that are part of these algorithms are described, and authors’ claims about the characteristics of these algorithms, such as the independence of rotation, brightness, contrast, etc. are mentioned. Using the common test on the real endoscopic image database and a set of image transformations, the validity of these claims was checked...
-
Evaluation of Path Based Methods for Conceptual Representation of the Text
PublicationTypical text clustering methods use the bag of words (BoW) representation to describe content of documents. However, this method is known to have several limitations. Employing Wikipedia as the lexical knowledge base has shown an improvement of the text representation for data-mining purposes. Promising extensions of that trend employ hierarchical organization of Wikipedia category system. In this paper we propose three path-based...
-
Framework Reuse - Heaven or Hell
PublicationObject-oriented frameworks have almost completely dominated the world of business applications. Frameworks are widely considered to be helpful and are designed to speed up creation of applications. However, when trying to apply a framework for a particular application, it often turns out that this is not as easy as it seemed at the beginning. It takes time to become familiar with the framework, with its concepts and the standard...
-
Genetic Programming for Interaction Efficient Supporting in Volunteer Computing Systems
PublicationVolunteer computing systems provide a middleware for interaction between project owners and great number volunteers. In this chapter, a genetic programming paradigm has been proposed to a multi-objective scheduler design for efficient using some resources of volunteer computers via the web. In a studied problem, genetic scheduler can optimize both a workload of a bottleneck computer and cost of system. Genetic programming has been...
-
How Specific Can We Be with k-NN Classifier?
PublicationThis paper discusses the possibility of designing a two stage classifier for large-scale hierarchical and multilabel text classification task, that will be a compromise between two common approaches to this task. First of it is called big-bang, where there is only one classifier that aims to do all the job at once. Top-down approach is the second popular option, in which at each node of categories’ hierarchy, there is a flat classifier...
-
INDIRECT CONTROL OVER SUBORDINATE UNITS
PublicationDeveloping a game universe usually involves creation of various units which can be both, encountered by a player or controlled by him. There is a number of works considering autonomous behaviors of units wandering around the game world. When it comes to the units controlled by the player, they are often deprived of autonomy and are strictly controlled by the player. This paper presents a concept of units behavior depending on their...
-
Internetowe systemy przetwarzania rozproszonego typu grid w zastosowaniach biznesowych
PublicationSkoncentrowano się na możliwościach wykorzystania oraz integracji rozproszonych mocy obliczeniowych komputerów Internautów w globalnej sieci www. Zaprezentowano paradygmaty sieciowego przetwarzania typu grid computing oraz volunteer computing. Podkreślono istotność tego typu przetwarzania w zagadnieniach wymagających bardzo dużych mocy obliczeniowych. Zaprezentowano przykłady rozwiązań systemowych tego typu: system BOINC, będący...
-
Inżynierowie przyszłości
PublicationW artykule omówiono dwie podstawowe metody kształcenia inżynierów oraz podkreślono rolę we współczesnym świecie inżynierów,zwracając uwagę na rozwój twórczych umiejętności, tak niezbędnych w globalnej gospodarce. Zaakcentowano potrzeby zmian oraz trudności ich realizacji w obecnych warunkach. Wskazano także rozwiązania już wdrażanie na Politechnice Gdańskiej.
-
Klasyfikacja algorytmów wyboru usług w architekturze SOA
PublicationWytwarzanie aplikacji w architekturze zorientowanej na usługi wymaga zastosowania odpowiednich algorytmów wyboru usług, które będą zapewniały wytworzenie aplikacji o wymaganych parametrach jakościowych. Biorąc pod uwagę, że problem wyboru usług ma wykładniczą złożoność obliczeniową, stosowne są zarówno algorytmy dokładne jak i przybliżone. W pracy przedstawiono porównanie i analizę istniejących algorytmów wyboru usług w kompozycji...
-
Knowledge Base Suitable for Answering Questions in Natural Language
PublicationThis paper presents three knowledge bases widely used by researchers coping with natural language processing: OpenCyc, DBpedia and YAGO. They are characterized from the point of view of questions answering system. In this paper a short description of the aforementioned system implementation is also presented.
-
Merging Images from Parallel Depth Cameras
PublicationIn this paper a problem of simultaneous information acquisition from multiple depth cameras is investigated, aiming at obtaining single overall picture containing information from all cameras. The experiments are carried out on Microsoft Kinect devices. A methodology for merging images from multiple positioned in a line cameras is proposed. The method is based on the concept of simulating a view of an imaginary camera covering...
-
Methods of Artificial Intelligence for Prediction and Prevention Crisis Situations in Banking Systems
PublicationIn this paper, a support vector machine has been studied due to prediction of bank crisis. To prevent outcomes of crisis situations, artificial neural networks have been characterized as applied to stock market investments, as well as to test the credibility of the bank's customers. Finally, some numerical experiments have been presented.
-
Modele przetwarzania aplikacji na klastrach obliczeniowych
PublicationW dzisiejszych czasach badania prowadzone w różnych dziedzinach nauki a także różnego rodzaju aplikacje i wdrożone systemy operują na coraz większych wolumenach danych przy jednocześnie rosnących wymaganiach dotyczących czasu wykonania. Z tego powodu coraz częściej do obliczeń wykorzystuje się klastry obliczeniowe, gdzie do dyspozycji użytkowników pozostaje ogromna moc obliczeniowa (liczona w tera a nawet peta flopach), terabajty...
-
Network-aware Data Prefetching Optimization of Computations in a Heterogeneous HPC Framework
PublicationRapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...
-
Neural Networks, Support Vector Machine and Genetic Algorithms for Autonomous Underwater Robot Support
PublicationIn this paper, artificial neural networks, a classification technique called support vector machine and meta-heuristics genetic algorithm have been considered for development in autonomous underwater robots. Artificial neural networks have been used for seabed modelling as well as support vector machine has been applied for the obstacles classification to avoid some collision problems. Moreover, genetic algorithm has been applied...
-
Obtaining a Well-Trained Artificial Intelligence Algorithm from Cross-Validation in Endoscopy
PublicationThe article shortly discusses endoscopic video analysis problems and artificial intelligence algorithms supporting it. The most common method of efficiency testing of these algorithms is to perform intensive cross-validation. This allows for accurately evaluate their performance of generalization. One of the main problems of this procedure is that there is no simple and universal way of obtaining a specific instance of a well-trained...
-
On Software Unit Testing For Improving Security And Performance Of Distributed Applications
PublicationPerformance and security are software (SW) application attributes situated on the opposite corners of system design. In the most drastic example the most secure component is the one totally isolated from the outside world, with communication performance reduced to zero level (e.g. disconnected physically from the network, placed inside a Faraday cage to eliminate possible wireless accessibility). On the other hand the most performance-optimized...
-
Optimization of Execution Time under Power Consumption Constraints in a Heterogeneous Parallel System with GPUs and CPUs
PublicationThe paper proposes an approach for parallelization of computations across a collection of clusters with heterogeneous nodes with both GPUs and CPUs. The proposed system partitions input data into chunks and assigns to par- ticular devices for processing using OpenCL kernels defined by the user. The sys- tem is able to minimize the execution time of the application while maintaining the power consumption of the utilized GPUs and...
-
Parallel computations in the volunteer based Comcute system
PublicationThe paper presents Comcute which is a novel multi-level implemen- tation of the volunteer based computing paradigm. Comcute was designed to let users donate the computing power of their PCs in a simplified manner, requiring only pointing their web browser at a specific web address and clicking a mouse. The server side appoints several servers to be in charge of execution of particular tasks. Thanks to that the system can survive...
-
Performance Modeling and Prediction of Real Application Workload in a Volunteer-based System
PublicationThe goal of this paper is to present a model that predicts the real workload placed on a volunteer based system by an application, with incorporation of not only performance but also availability of volunteers. The application consists of multiple data packets that need to be processed. Knowing the computational workload demand of a single data packet we show how to estimate the application workload in a volunteer based system. Furthermore,...
-
Possible uses of crisis situation aiding system in virtual world simulation
PublicationMany of the real world crisis situations like spreading fire, hostile units attack, flood, and etc. are commonly used in computer games where a simulation of extensive virtual world is crucial. This paper presents some ideas for possible uses of existing crisis situation aiding system in such environments. Moreover, it shows how this kind of system can be taught during subsequent games with a large number of players. As an example...