Filters
total: 8841
-
Catalog
- Publications 5574 available results
- Journals 105 available results
- Conferences 77 available results
- People 256 available results
- Inventions 1 available results
- Projects 6 available results
- Laboratories 1 available results
- e-Learning Courses 148 available results
- Events 23 available results
- Open Research Data 2650 available results
displaying 1000 best results Help
Search results for: DATA JOURNALS, DATA PAPERS, RESEARCH AREAS, BIBLIOMETRIC ANALYSIS
-
A Workflow Application for Parallel Processing of Big Data from an Internet Portal
PublicationThe paper presents a workflow application for efficient parallel processing of data downloaded from an Internet portal. The workflow partitions input files into subdirectories which are further split for parallel processing by services installed on distinct computer nodes. This way, analysis of the first ready subdirectories can start fast and is handled by services implemented as parallel multithreaded applications using multiple...
-
Artysta i analityk. Big data w przestrzeni kultury
PublicationTekst rozważa rolę Big Data - ogromnych zbiorów danych - w badaniu kultury oraz w jej tworzeniu. Przedmiotem analiz jest również wpływ tej technologii na twórczość artystyczną, w tym na współczesną architekturę i urbanistykę. Przedstawione zostały scenariusze potencjalnej przyszłej roli Big Data w społeczeństwie.
-
Platelet RNA Sequencing Data Through the Lens of Machine Learning
PublicationLiquid biopsies offer minimally invasive diagnosis and monitoring of cancer disease. This biosource is often analyzed using sequencing, which generates highly complex data that can be used using machine learning tools. Nevertheless, validating the clinical applications of such methods is challenging. It requires: (a) using data from many patients; (b) verifying potential bias concerning sample collection; and (c) adding interpretability...
-
Database systems for tomorrow: new challenges and research areas
PublicationZaprezentowano nowe obszary badawcze w dziedzinie baz danych, które będą rozwijane w najbliższej przyszłości. Omówiono takie obszary jak systemy typu ''Plug and Play'', duże systemy sfederowane, nowe architektury systemów baz danych oparte na dużych buforach pamięciowych, integracja danych i aplikacji oraz bazy danych semistrukturalnych.
-
Visualization of events using various kinds of synchronized data for the Border Guard
PublicationSTRADAR project is dedicated to streaming real-time data in a distributed dispatcher and teleinfor-mation system of the Border Guard. The Events Visualization Post is a software designed for simultaneous visualization of data of different types in BG headquarters. The software allows the operator to visualize files, images, SMS, SDS, video, audio, and current or archival data on naval situation on digital maps. All the visualized...
-
Evaluation of position estimation based on accelerometer data
PublicationThe paper concerns the problem of integrating data from accelerometers. A suitable model of a MEMS accelerometer is presented which is a part of inertial measurement units (IMU). Such units allow to measure orientation as well as to localize systems. They also appear to be applicable for systems positioning. The main purpose of the paper is to discuss conditions that must be satisfied to calculate the location of the sensor by...
-
Big Data and the Internet of Things in Edge Computing for Smart City
PublicationRequests expressing collective human expectations and outcomes from city service tasks can be partially satisfied by processing Big Data provided to a city cloud via the Internet of Things. To improve the efficiency of the city clouds an edge computing has been introduced regarding Big Data mining. This intelligent and efficient distributed system can be developed for citizens that are supposed to be informed and educated by the...
-
Researching Digital Society: Using Data-Mining to Identify Relevant Themes from an Open Access Journal
PublicationOpen Access scholarly literature is scientific output free from economic barriers and copyright restrictions. Using a case study approach, data mining methods and qualitative analysis, the scholarly output and the meta-data of the Open Access eJournal of e-Democracy and Open Government during the time interval 2009–2020 was analysed. Our study was able to identify the most prominent research topics (defined as thematic clusters)...
-
Researching Digital Society: Using Data-Mining to Identify Relevant Themes from an Open Access Journal
PublicationOpen Access scholarly literature is scientific output free from economic barriers and copyright restrictions. Using a case study approach, data mining methods and qualitative analysis, the scholarly output and the meta-data of the Open Access eJournal of e-Democracy and Open Government during the time interval 2009–2020 was analysed. Our study was able to identify the most prominent research topics (defined as thematic clusters)...
-
Reversible data hiding in encrypted DICOM images using sorted binary sequences of pixels
PublicationIn this paper, a novel reversible data hiding method for encrypted DICOM images is proposed. The method utilizes binary decomposition of the input data paired with a sorting process of the obtained binary sequences to ensure efficient data embedding in each predefined data block for specific most significant bit (MSB) planes while exploiting the properties of run-length encoding. The proposed scheme is lossless, and based on the...
-
Cache service for maps presentation in distributed information data exchange system
PublicationThe paper presents the proposition of caches implementation for map presentation in distributed information data exchange system. The concept of cache service is described in the context of distributed information data exchange system elements which control and present on maps positions and other identification data of vessels and other suspicious objects on the territorial sea, sea-coast and the internal sea-waters. The proposed...
-
Integration and verification of meteorological observations and NWP model data for the local GNSS tomography
PublicationGNSS meteorology applies the Global Navigation Satellite Systems (GNSS) to derive information about the state of the atmosphere (particularly troposphere). The tomography is one of the methods used in GNSS meteorology. The input data of GNSS tomography are the signal troposphere delays, results of GNSS data processing and additionally meteorological observations and Numerical Weather Prediction (NWP) models data. Different types...
-
Using EO satellite data in Safe City and Coastal Zone web-GIS
PublicationThe paper presents a novel design of a web-based Safe City & Coastal Zone GIS (SCCZ-GIS) which integrates data acquired from different remote sensing and geospatial data sources for monitoring the security of the coastal zone, its inhabitants and Critical Infrastructure. The system utilizes several innovative technologies and directly co-operates with different remote sensing data sources and services, like a satellite ground station...
-
Potential Saving of Antibiotics for Respiratory Infections in Several European Countries: Insights from Market Research Data
Publication -
Early Oceanographical Data Collected by the Institute of Oceanography, University of Gdańsk
PublicationThree data sets entitled Water currents in Głębinka Passage in late spring of 1975, Hydrometeorological and hydrochemical conditions in the Gulf of Gdańsk in the vicinity of Vistula river mouth in July of 1977, and Gulf of Gdańsk monitoring conducted by the Institute of Oceanography, University of Gdańsk, in 1981–1994 contain archival field measurement results from the Gulf of Gdańsk (the southern Baltic). The data can be used...
-
Model of an Integration Bus of Data and Ontologies of Smart Cities Processes
PublicationThis paper presents a model of an integration bus used in the design of Smart Cities system architectures. The model of such a bus becomes necessary when designing high-level architectures, within which the silo processes of the organization should be seen from the perspective of its ontology. For such a bus to be used by any city, a generic solution was proposed which can be implemented as a whole or in part depending on the requirements...
-
Impact of AI-Based Tools and Urban Big Data Analytics on the Design and Planning of Cities
PublicationWide access to large volumes of urban big data and artificial intelligence (AI)-based tools allow performing new analyses that were previously impossible due to the lack of data or their high aggregation. This paper aims to assess the possibilities of the use of urban big data analytics based on AI-related tools to support the design and planning of cities. To this end, the author introduces a conceptual framework to assess the...
-
Prediction of flow boiling heat transfer data for R134a, R600a and R290 in minichannels
PublicationIn the paper presented is the analysis of the results of calculations using a model to predict flow boiling of refrigerants such as R134a, R600a and R290. The latter two fluids were not used in development of model semiempirical correction. For that reason the model was verified with present experimental data. The experimental research was conducted for a full range of quality variation and a relatively wide range of mass velocity....
-
A Novel Spatio–Temporal Deep Learning Vehicle Turns Detection Scheme Using GPS-Only Data
PublicationWhether the computer is driving your car or you are, advanced driver assistance systems (ADAS) come into play on all levels, from weather monitoring to safety. These modern-day ADASs use various assisting tools for drivers to keep the journey safe; these sophisticated tools provide early signals of numerous events, such as road conditions, emerging traffic scenarios, and weather warnings. Many urban applications, such as car-sharing...
-
EvOLAP Graph – Evolution and OLAP-Aware Graph Data Model
PublicationThe objective of this paper is to propose a graph model that would be suitable for providing OLAP features on graph databases. The included features allow for a multidimensional and multilevel view on data and support analytical queries on operational and historical graph data. In contrast to many existing approaches tailored for static graphs, the paper addresses the issue for the changing graph schema. The model, named Evolution...
-
Communications in Statistics: Case Studies, Data Analysis and Applications
Journals -
Advances in Architectures, Big Data, and Machine Learning Techniques for Complex Internet of Things Systems
PublicationTe feld of Big Data is rapidly developing with a lot of ongoing research, which will likely continue to expand in the future. A crucial part of this is Knowledge Discovery from Data (KDD), also known as the Knowledge Discovery Process (KDP). Tis process is a very complex procedure, and for that reason it is essential to divide it into several steps (Figure 1). Some authors use fve steps to describe this procedure, whereas others...
-
Big Data Processing by Volunteer Computing Supported by Intelligent Agents
PublicationIn this paper, volunteer computing systems have been proposed for big data processing. Moreover, intelligent agents have been developed to efficiency improvement of a grid middleware layer. In consequence, an intelligent volunteer grid has been equipped with agents that belong to five sets. The first one consists of some user tasks. Furthermore, two kinds of semi-intelligent tasks have been introduced to implement a middleware...
-
Emulator and simulator of Terma SCANTER and ARPA radar data server
PublicationThe software solutions presented in this paper generate real-time data compatible with ARPA radar standard as well as Terma SCANTER 2001 radar cooperating with Video Distribution and Tracking (VDT) server. Two different approaches to this problem are considered: emulation based on the data captured from real devices and simulation of objects on the sea. For both of them architecture, implementation details and functional test results...
-
Improving css-KNN Classification Performance by Shifts in Training Data
PublicationThis paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...
-
Assessing Highway Travel Time Reliability using Probe Vehicle Data
PublicationProbe vehicle data (also known as “floating car data”) can be used to analyze travel time reliability of an existing road corridor in order to determine where, when, and how often traffic congestion occurs at particular road segments. The aim of the study is to find the best reliability performance measures for assessing congestion frequency and severity based on probe data. Pilot surveys conducted on A2 motorway in Poland confirm...
-
Synteza algorytmu detekcji pęknięcia szyby metodą ''data fission - data fusion''
PublicationPrzedstawiono założenia projektowe oraz proces syntezy algorytmu detekcyjnego akustycznego detektora pęknięcia szyby. W konstrukcji algorytmu użyto techniki rozszczepiania i syntezy danych. Przedstawiono użyte narzędzia badawcze, opracowany model pęknięcia szyby oraz wynki testowania finalnego algorytmu detekcyjnego. Metoda znalazła zastosowanie w konstrukcji akustycznego detektora pęknięcia szyby stosowanego w systemach alarmowych.
-
Simulation of parallel similarity measure computations for large data sets
PublicationThe paper presents our approach to implementation of similarity measure for big data analysis in a parallel environment. We describe the algorithm for parallelisation of the computations. We provide results from a real MPI application for computations of similarity measures as well as results achieved with our simulation software. The simulation environment allows us to model parallel systems of various sizes with various components...
-
DATA MINING STAC 2022/2023
e-Learning CoursesSTAC
-
Advanced Data Mining 2022/23
e-Learning Courses -
Numerical Methods - Data Engineering - 2023
e-Learning Coursesstudia inżynierskie, informatyka i inżynieria danych
-
Business Data Analytics-2024 /2025
e-Learning Courses -
Analiza danych typu Big Data
e-Learning Courses -
Numerical Methods - Data Engineering - 2024
e-Learning CoursesInżynieria danych
-
Business Data Analytics-2023 /2024
e-Learning Courses -
Inżynieria Danych Data Science 2024
e-Learning Courses -
Advanced Data Mining 2023/24
e-Learning Courses -
Security information sharing for smart grids: Developing the right data model
PublicationThe smart grid raises new security concerns which require novel solutions. It is commonly agreed that to protect the grid the effective collaboration and information sharing between the relevant stakeholders is prerequisite. Developing a security information sharing platform for the smart grid is a new research direction which poses several challenges related to the highly distributed and heterogeneous character of the grid. In...
-
thestats: An Open-Data R Package for Exploring Turkish Higher Education Statistics
PublicationThere are open datasets available for official statistics, finance, education, and a variety of other domains. The open datasets are published by third-party vendors as well as official authorities. For example, The Turkish Higher Education Council maintains a web portal dedicated to higher education in Türkiye. Detailed datasets about universities, faculties, and departments can be obtained from the portal. Using the data provided...
-
Using Isolation Forest and Alternative Data Products to Overcome Ground Truth Data Scarcity for Improved Deep Learning-based Agricultural Land Use Classification Models
PublicationHigh-quality labelled datasets represent a cornerstone in the development of deep learning models for land use classification. The high cost of data collection, the inherent errors introduced during data mapping efforts, the lack of local knowledge, and the spatial variability of the data hinder the development of accurate and spatially-transferable deep learning models in the context of agriculture. In this paper, we investigate...
-
Application of Web-GIS for Dissemination and 3D Visualization of Large-Volume LiDAR Data
PublicationThe increasing number of digital data sources, which allow for semi-automatic collection and storage of information regarding various aspects of life has recently granted a considerable rise in popularity to the term “Big data”. As far as geospatial data is concerned, one of the major sources of Big data are Light Detection And Ranging (LiDAR) scanners, which produce high resolution three-dimensional data on a local scale. The...
-
MAPSERVER – INFORMATION FLOW MANAGEMENT SOFTWARE FOR THE BORDER GUARD DISTRIBUTED DATA EXCHANGE SYSTEM
PublicationIn this paper the architecture of the software designed for management of position and identification data of floating and flying objects in Maritime areas controlled by Polish Border Guard is presented. The software was designed for managing information stored in a distributed system with two variants of the software, one for a mobile device installed on a vessel, an airplane or a car and second for a central server. The details...
-
The role and importance of WIMAX mobile system as a high-performance data transfer technology in wireless sensor networks for wide area monitoring applications
PublicationThe study discuses basic features and functional design of WiMAX Mobile system, based on the IEEE 802.16e (Release 1.5 Rev. 2.0) standard. The analysis has been made in terms of ability to use this system to transmit video stream related to monitoringof large agglomeration areas. What is more, the study includes comparison of technical parameters of WiMAX Mobile system with competitive systems such as: HSPA+ and UMTS-LTE, which...
-
Preeclampsia Risk Prediction Using Machine Learning Methods Trained on Synthetic Data
PublicationThis paper describes a research study that investigates the use of machine learning algorithms on synthetic data to classify the risk of developing preeclampsia by pregnant women. Synthetic datasets were generated based on parameter distributions from three real patient studies. Four models were compared: XGBoost, Support Vector Machine (SVM), Random Forest, and Explainable Boosting Machines (EBM). The study found that the XGBoost...
-
Low-Cost Data-Driven Surrogate Modeling of Antenna Structures by Constrained Sampling
PublicationFull-wave electromagnetic (EM) analysis has become one of the major design tools for contemporary antenna structures. Although reliable, it is computationally expensive which makes automated simulation-driven antenna design (e.g., parametric optimization) difficult. This difficulty can be alleviated by utilization of fast and accurate replacement models (surrogates). Unfortunately, conventional data-driven modeling of antennas...
-
Description logic based generator of data-centric applications
PublicationThe knowledge stored in Ontology Management Systems (OMS) that originally has the form of expressions, can be seen as a user application specification or as knowledge provided by an expert. The generator of applications discussed in this paper is defined as a program that automatically generates an application that meets a certain specification stored in OMS. It is shown that it is possible to build a user interface for data management...
-
Description logic based generator of data-centric applications
PublicationThe knowledge stored in Ontology Management Systems (OMS) that originally has the form of expressions, can be seen as a user application specification or as knowledge provided by an expert. The generator of applications discussed in this paper is defined as a program that automatically generates an application that meets a certain specification stored in OMS. It is shown that it is possible to build a user interface for data management...
-
Assessing business process complexity based on textual data: Evidence from ITIL IT ticket processing
PublicationPurpose This study aims to draw the attention of business process management (BPM) research and practice to the textual data generated in the processes and the potential of meaningful insights extraction. The authors apply standard natural language processing (NLP) approaches to gain valuable knowledge in the form of business process (BP) complexity concept suggested in the study. It is built on the objective, subjective and meta-knowledge...
-
Validating data acquired with experimental multimodal biometric system installed in bank branches
PublicationAn experimental system was engineered and implemented in 100 copies inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank client voice recognition and hand vein distribution verification. The main purpose of the presented research was to analyze questionnaire responses reflecting user opinions on: comfort, ergonomics, intuitiveness and other aspects of the biometric enrollment...
-
X-ray images of Baltic herring. Data analysis
Open Research DataBased on the developed methodology for the: (i) optimal method of catching, (ii) transporting and storing fish, (iii) measuring and (iv) analyzing X-rays images, the existing collection of X-ray images of Baltic herring, caught in October 2002 during the Swedish component of the Baltic International Acoustic Survey (BIAS) in the Baltic proper (ICES...