Filters
total: 7495
filtered: 4317
-
Catalog
- Publications 4317 available results
- Journals 101 available results
- Conferences 111 available results
- People 107 available results
- Inventions 1 available results
- Projects 8 available results
- Laboratories 1 available results
- e-Learning Courses 173 available results
- Events 22 available results
- Open Research Data 2654 available results
Chosen catalog filters
displaying 1000 best results Help
Search results for: DATA
-
The molecular entities in linked data dataset
Publication -
A Survey on the Datasets and Algorithms for Satellite Data Applications
PublicationThis survey compiles insights and describes datasets and algorithms for applications based on remote sensing. The goal of this review is twofold: datasets review for particular groups of tasks and high-level steps of data flow between satellite instruments and end applications from an implementation and development perspective. The article outlines the generalized data processing pipelines, taking into account the variations in...
-
Data Analysis in Bridge of Data
PublicationThe chapter presents the data analysis aspects of the Bridge of Data project. The software framework used, Jupyter, and its configuration are presented. The solution’s architecture, including the TRYTON supercomputer as the underlying infrastructure, is described. The use case templates provided by the Stat-reducer application are presented, including data analysis related to spatial points’ cloud-, audio- and wind-related research.
-
A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies
PublicationIn this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...
-
Music Data Processing and Mining in Large Databases for Active Media
PublicationThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Harmony Search for Data Mining with Big Data
PublicationIn this paper, some harmony search algorithms have been proposed for data mining with big data. Three areas of big data processing have been studied to apply new metaheuristics. The first problem is related to MapReduce architecture that can be supported by a team of harmony search agents in grid infrastructure. The second dilemma involves development of harmony search in preprocessing of data series before data mining. Moreover,...
-
Data reduction and stacking for imbalanced data classification
Publication -
Data governance: Organizing data for trustworthy Artificial Intelligence
PublicationThe rise of Big, Open and Linked Data (BOLD) enables Big Data Algorithmic Systems (BDAS) which are often based on machine learning, neural networks and other forms of Artificial Intelligence (AI). As such systems are increasingly requested to make decisions that are consequential to individuals, communities and society at large, their failures cannot be tolerated, and they are subject to stringent regulatory and ethical requirements....
-
Data Reduction Algorithm for Machine Learning and Data Mining
Publication -
Combining Road Network Data from OpenStreetMap with an Authoritative Database
PublicationComputer modeling of road networks requires detailed and up-to-date dataset. This paper proposes a method of combining authoritative databases with OpenStreetMap (OSM) system. The complete route is established by finding paths in the graph constructed from partial data obtained from OSM. In order to correlate data from both sources, a method of coordinate conversion is proposed. The algorithm queries road data from OSM and provides...
-
DATA INTEROPERABILITY AND THE OPEN DATA ECOSYSTEM: ROLES AND RESEARCH AREAS
PublicationSustainability and value-creation are considered important parameters to measure the success of an open data system. Unfortunately, existing open data systems are not meeting their promises to achieve a sustainable and value-based open data system. Van Loenen et al. (2021) proposed a sustainable and value-creating open data ecosystem. According to their study, the open data ecosystem needs to be user-driven, inclusive, circular,...
-
Asking Data in a Controlled Way with Ask Data Anything NQL
PublicationWhile to collect data, it is necessary to store it, to understand its structure it is necessary to do data-mining. Business Intelligence (BI) enables us to make intelligent, data-driven decisions by the mean of a set of tools that allows the creation of a potentially unlimited number of machine-generated, data-driven reports, which are calculated by a machine as a response to queries specified by humans. Natural Query Languages...
-
Data Extraction, Transformation and Loading process in data warehouse development.
PublicationOmówiono podstawowe elementy procesu projektowania hurtowni danych.
-
Machine Learning and data mining tools applied for databases of low number of records
Publication -
Data librarian and data steward – new tasks and responsibilities of academic libraries in the context of Open Research Data implementation in Poland
PublicationThesis/Objective – The policy of Open Access (OA) for researching resources in Europe has been implemented for more than 10 years. The first recommendations concerning providing OA to scientific materials were defined during the implementation of the 7th Framework Programme. Introducing another set of recommendations concerning OA to research data was the next stage. The recommendations were transformed into obligations under the...
-
Managing Data from Heterogeneous Data Sources Using Knowledge Layer
Publication -
Atmospheric emissions of POPs in Europe - a discussion of existing data and data need
PublicationZaproponowano, aby schematy inwentaryzacji i raportowania emisji w przyszłości były tak dobrane, aby zaspokajały potrzeby zarówno administracji odpowiedzialnej za strategię redukcji emisji jak i naukowców wykorzystujących inwentaryzację emisji na potrzeby dalszych badań naukowych. Stan obecny inwentaryzacji emisji POP oszacowany został jako niezadowalający a dalsze usprawnienia są konieczne dla poprawienia wiarygodności wyników...
-
Managing data from heterogeneous data sources using knowledge layer
PublicationW procesie integrowania danych przy użyciu ontologii, ważne jest aby zarządzać danymi przechowywanymi w zewnętrznych źródłach, analogicznie jak tymi przechowywanymi w Bazie Wiedzy. Zaprezentowana w poprzednich pracach metoda kartograficznej reprezentacji wiedzy pozwala na wnioskowanie z danych przechowywanych w Bazie wiedzy. Rozwiązanie zaprezentowane w tej pracy umożliwia wykorzystanie metody kartograficznej do wnioskowania z...
-
Big Data i 5V – nowe wyzwania w świecie danych (Big Data and 5V – New Challenges in the World of Data)
PublicationRodzaje danych, składające się na zbiory typu Big Data, to m.in. dane generowane przez użytkowników portali internetowych, dane opisujące transakcje dokonywane poprzez Internet, dane naukowe (biologiczne, astronomiczne, pomiary fizyczne itp.), dane generowane przez roboty w wyniku automatycznego przeszukiwania przez nie Internetu (Web mining, Web crawling), dane grafowe obrazujące powiązania pomiędzy stronami WWW itd. Zazwyczaj,...
-
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
PublicationDeveloping signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....
-
Manufacturing Data Analysis in Internet of Things/Internet of Data (IoT/IoD) Scenario
PublicationComputer integrated manufacturing (CIM) has enormous benefits as it increases the rate of production, reduces errors and production waste, and streamlines manufacturing sub-systems. However, there are some new challenges related to CIM operating in the Internet of Things/Internet of Data (IoT/IoD) scenarios associated with Industry 4.0 and cyber-physical systems. The main challenge is to deal with the massive volume of data flowing...
-
Synteza algorytmu detekcji pęknięcia szyby metodą ''data fission - data fusion''
PublicationPrzedstawiono założenia projektowe oraz proces syntezy algorytmu detekcyjnego akustycznego detektora pęknięcia szyby. W konstrukcji algorytmu użyto techniki rozszczepiania i syntezy danych. Przedstawiono użyte narzędzia badawcze, opracowany model pęknięcia szyby oraz wynki testowania finalnego algorytmu detekcyjnego. Metoda znalazła zastosowanie w konstrukcji akustycznego detektora pęknięcia szyby stosowanego w systemach alarmowych.
-
Database for integration of information in distributed data exchange system elements of Border Guard
PublicationThe paper presents the database solution for integration of information in distributed data exchange system elements of the Polish Border Guard. The proposed database solution is described in the context of data exchange system elements which control position and store identification data of vessels (fishing, sports and sailing boats) and other suspicious objects on the territorial sea, sea-coast and the internal sea-waters controlled...
-
Towards High-Value Datasets Determination for Data-Driven Development: A Systematic Literature Review
PublicationOpen government data (OGD) is seen as a political and socio-economic phenomenon that promises to promote civic engagement and stimulate public sector innovations in various areas of public life. To bring the expected benefits, data must be reused and transformed into value-added products or services. This, in turn, sets another precondition for data that are expected to not only be available and comply with open data principles,...
-
Enhanced uniform data sampling for constrained data‐driven modeling of antenna input characteristics
PublicationData-driven surrogates are the most popular replacement models utilized in many fields of engineering and science, including design of microwave and antenna structures. The primary practical issue is a curse of dimensionality which limits the number of independent parameters that can be accounted for in the modelling process. Recently, a performance-driven modelling technique has been proposed where the constrained domain of the...
-
Artificial intelligence and health-related data: The patient’s best interest and data ownership dilemma
Publication -
BIG PROBLEMS WITH BIG DATA
PublicationThe article presents an overview of the most important issues related to the phenomenon called big data. The characteristics of big data concerning the data itself and the data sources are presented. Then, the big data life cycle concept is formulated. The next sections focus on two big data technologies: MapReduce for big data processing and NoSQL databases for big data storage.
-
The Bridge of Data Project Objectives
PublicationOpen Research Data (ORD) is one of the emerging trends for researchers across the globe. However, it has to be stressed that the level of implementation and awareness of ORD varies between countries. Many initiatives have been created in Polish scientific institutions to support the process of opening publications. These are mainly Open Access (OA) repositories, implementing the so-called green road of OA. However, only a few universities...
-
Processing of Satellite Data in the Cloud
PublicationThe dynamic development of digital technologies, especially those dedicated to devices generating large data streams, such as all kinds of measurement equipment (temperature and humidity sensors, cameras, radio-telescopes and satellites – Internet of Things) enables more in-depth analysis of the surrounding reality, including better understanding of various natural phenomenon, starting from atomic level reactions, through macroscopic...
-
Distributed Learning with Data Reduction
Publication -
Differential analysis of impedance data
Publication -
DATA MINING IN CONSTRUCTION RESEARCH
Publication -
Dynamic Data Management Among Multiple Databases for Optimization of Parallel Computations in Heterogeneous HPC Systems
PublicationRapid development of diverse computer architectures and hardware accelerators caused that designing parallel systems faces new problems resulting from their heterogeneity. Our implementation of a parallel system called KernelHive allows to efficiently run applications in a heterogeneous environment consisting of multiple collections of nodes with different types of computing devices. The execution engine of the system is open for...
-
Client-side versus server-side geographic data processing performance comparison: Data and code
PublicationThe data and code presented in this article are related to the research article entitled “Analysis of Server-side and Client-side Web-GIS data processing methods on the example of JTS and JSTS using open data from OSM and Geoportal” (Kulawiak et al., 2019). The provided 12 datasets include multi-point and multi-polygon data of different scales and volumes, representing real-world geographic features. The datasets cover the area...
-
Using Synchronously Registered Biosignals Dataset for Teaching Basics of Medical Data Analysis – Case Study
PublicationMedical data analysis and processing strongly relies on the data quality itself. The correct data registration allows many unnecessary steps in data processing to be avoided. Moreover, it takes a certain amount of experience to acquire data that can produce replicable results. Because consistency is crucial in the teaching process, students have access to pre-recorded real data without the necessity of using additional equipment...
-
Challenges of Comparing Marine Microbiome Community Composition Data Provided by Different Commercial Laboratories and Classification Databases
Publication -
DATA JOURNALS AND DATA PAPERS IN VARIOUS RESEARCH AREAS AND SCIENTIFIC DISCIPLINES – BIBLIOMETRIC ANALYSIS BASED ON INCITES
PublicationThe main aim of this work is to provide insight into a bibliometric analysis of Data Journals and Data Papers in terms of research areas, disciplines, publication year and country. In particular, we calculated many bibliometric indicators, especially: the number of publications and citations. Furthermore, this work also investigated the top 20 journals in which scientists published the largest number of Data Papers. It was found...
-
Streaming Real-time Data in Distributed Dispatcher and Teleinformation Systems for Visualization of Multimedia Data of the Border Guard
PublicationSurveillance of the sea borders is a very important task for the Border Guard. Monitoring of country maritime border is an important task of the Border Guard. This task can be facilitated with the use of the technology enabling gathering information from distributed sources and its supervision and visualization. This task can be accomplished using a technology that allows to collect information from distributed sensors of different...
-
RPC communication layer and introduction to data protection for embedded PC based control and data acquisition module
Publication -
A survey of medical researchers indicates poor awareness of research data management processes and a role for data librarians
Publication -
Dis/Trust and data-driven technologies
PublicationThis concept paper contextualises, defines, and systematises the concepts of trust and distrust (and their interrelations), providing a critical review of existing literature so as to identify gaps, disjuncture, and continuities in the use of these concepts across the social sciences and in the context of the consolidation of the digital society. Firstly, the development of the concept of trust is explored by looking at its use...
-
Data Mining Applications and Methods in Medicine
PublicationIn this paper we describe the research area of data mining and its applications in medicine. The origins of data mining and its crucial features are shortly presented. We discuss the specificity of medicine as an application area for computer systems. Characteristic features of the medical data are investigated. Common problems in the area are also presented as well as the strengths and capabilities of the data mining methods....
-
Sharing research data across disciplines
PublicationThis monograph is a collection of experiences gathered by the team implementing the Bridge of Data project. However, it is not just a simple summary of the project implementation. It shows and systematizes the substantive and technical works performed by the teams and several issues related to data management itself in various disciplines, represented by members of the scientific team and other researchers from partner universities.The...
-
Radar data fusion in the STRADAR system
PublicationThe main task of the Polish Border Guard is protection of the country’s border which requires utilization of multimedia surveillance systems automatically gathering, processing and sharing various data. The paper presents such a system developed for the Maritime Division of the Polish Border Guard within the STRADAR project and the problem of fusion of radar data in this system. The system, apart from providing communication means,...
-
Big Data in Regenerative Urban Design
PublicationWhy the use of Big Data in regenerative planning matters? The aim of this chapter is to study under what conditions Big Data can be integrated into regenerative design and sustainable planning? Authors seek to answer how – when related to the ecosystem and to human activities – Big Data can be used to: • both shape policies that support the development of regenerative human settlements, • support restorative design for practitioners...
-
The Use of Big Data in Regenerative Planning
PublicationWith the increasing significance of Big Data sources and their reliability for studying current urban development processes, new possibilities have appeared for analyzing the urban planning of contemporary cities. At the same time, the new urban development paradigm related to regenerative sustainability requires a new approach and hence a better understanding of the processes changing cities today, which will allow more efficient...
-
Collaborative Data Acquisition and Learning Support
PublicationWith the constant development of neural networks, traditional algorithms relying on data structures lose their significance as more and more solutions are using AI rather than traditional algorithms. This in turn requires a lot of correctly annotated and informative data samples. In this paper, we propose a crowdsourcing based approach for data acquisition and tagging with support for Active Learning where the system acts as an...
-
Active Learning Based on Crowdsourced Data
PublicationThe paper proposes a crowdsourcing-based approach for annotated data acquisition and means to support Active Learning training approach. In the proposed solution, aimed at data engineers, the knowledge of the crowd serves as an oracle that is able to judge whether the given sample is informative or not. The proposed solution reduces the amount of work needed to annotate large sets of data. Furthermore, it allows a perpetual increase...
-
Linking music data in executable documents
PublicationThis paper presents the application of Interactive Open Document Architecture (IODA) to music and video data. This architecture was design to create multilayer documents which consist of many files. The paper shows the method of creating media documents on the basis of IODA. These kind of documents were called IODA Media Documents (IMD). IMD have links that connect many different kinds of files containing music and video data....
-
Fundamentals of Data-Driven Surrogate Modeling
PublicationThe primary topic of the book is surrogate modeling and surrogate-based design of high-frequency structures. The purpose of the first two chapters is to provide the reader with an overview of the two most important classes of modeling methods, data-driven (or approx-imation), as well as physics-based ones. These are covered in Chap-ters 1 and 2, respectively. The remaining parts of the book give an exposition of the specific aspects...