Filters
total: 7318
filtered: 4258
-
Catalog
- Publications 4258 available results
- Journals 91 available results
- Conferences 77 available results
- People 105 available results
- Inventions 1 available results
- Projects 6 available results
- Laboratories 1 available results
- e-Learning Courses 153 available results
- Events 22 available results
- Open Research Data 2604 available results
Chosen catalog filters
displaying 1000 best results Help
Search results for: MISSING DATA IMPUTATION
-
Missing Verification of Source Data in Hypertension Research: The HYGIA PROJECT in Perspective
Publication -
Chemometric exploration of sea water chemical component data sets with missing elements
Publication -
Different philosophical approaches to estimating missing data in AHP frameworks for uncertainty representation in risk assessments
PublicationAHP (ang. Analytic Hierarchy Process) jest jedną z metod szeroko stosowaną w wieloatrybutowym podejmowaniu decyzji. Zwykle ekspert lub grupa ekspertów jest proszona o wyrażenie swojej subiektywnej opinii o każdej z par wariantów decyzyjnych. Na tej podstawie tworzone są tzw. macierze ocen. Zdarza się często, że macierze takie są niekompletne i wtedy występuje problem brakujących danych. Referat dotyczy wybranych metod ich uzupełnienia.
-
Cavitation based cleaner technologies for biodiesel production and processing of hydrocarbon streams: A perspective on key fundamentals, missing process data and economic feasibility – A review
PublicationThe present review emphasizes the role of hydrodynamic cavitation (HC) and acoustic cavitation in clean and green technologies for selected fuels (of hydrocarbon origins such as gasoline, naphtha, diesel, heavy oil, and crude oil) processing applications including biodiesel production. Herein, the role of cavitation reactors, their geometrical parameters, physicochemical properties of liquid media, liquid oxidants, catalyst loading,...
-
Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires
PublicationBackground Endometriosis is a condition that significantly affects the quality of life of about 10 % of reproductive-aged women. It is characterized by the presence of tissue similar to the uterine lining (endometrium) outside the uterus, which can lead lead scarring, adhesions, pain, and fertility issues. While numerous factors associated with endometriosis are documented, a wide range of symptoms may still be undiscovered. Methods In...
-
Identification of category associations using a multilabel classifier
PublicationDescription of the data using categories allows one to describe it on a higher abstraction level. In this way, we can operate on aggregated groups of the information, allowing one to see relationships that do not appear explicit when we analyze the individual objects separately. In this paper we present automatic identification of the associations between categories used for organization of the textual data. As experimental data...
-
Expectation-Maximization Model for Substitution of Missing Values Characterizing Greenness of Organic Solvents
PublicationOrganic solvents are ubiquitous in chemical laboratories and the Green Chemistry trend forces their detailed assessments in terms of greenness. Unfortunately, some of them are not fully characterized, especially in terms of toxicological endpoints that are time consuming and expensive to be determined. Missing values in the datasets are serious obstacles, as they prevent the full greenness characterization of chemicals. A featured...
-
Improving css-KNN Classification Performance by Shifts in Training Data
PublicationThis paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...
-
Client-side versus server-side geographic data processing performance comparison: Data and code
PublicationThe data and code presented in this article are related to the research article entitled “Analysis of Server-side and Client-side Web-GIS data processing methods on the example of JTS and JSTS using open data from OSM and Geoportal” (Kulawiak et al., 2019). The provided 12 datasets include multi-point and multi-polygon data of different scales and volumes, representing real-world geographic features. The datasets cover the area...
-
Participatory Budgeting in Poland – Missing Link in Urban Regeneration Process
PublicationIn last thirty years Poland has gone a long way toward democracy and decentralization. Role of public participation in planning is increasing rapidly and recently many new instruments of empowering the community is being introduced, participatory budgeting is one of the most important. On the other hand, urban regeneration is one of the most important challenges of polish cities are facing. Technical and transport infrastructure...
-
Big Data i 5V – nowe wyzwania w świecie danych (Big Data and 5V – New Challenges in the World of Data)
PublicationRodzaje danych, składające się na zbiory typu Big Data, to m.in. dane generowane przez użytkowników portali internetowych, dane opisujące transakcje dokonywane poprzez Internet, dane naukowe (biologiczne, astronomiczne, pomiary fizyczne itp.), dane generowane przez roboty w wyniku automatycznego przeszukiwania przez nie Internetu (Web mining, Web crawling), dane grafowe obrazujące powiązania pomiędzy stronami WWW itd. Zazwyczaj,...
-
Integrated Sectors - Diversified Earnings: The (Missing) Impact of Offshoring on Wages and Wage Convergence in the EU27
PublicationThis paper assesses the impact of international outsourcing/offshoring practices on the process of wage equalization across manufacturing sectors in a sample of EU27 economies (1995-2009). We discriminate between heterogeneous wage effects on different skill categories of workers (low, medium and high skill). The main focus is on the labour market outcomes of vertical integration, so we augment a model of conditional wage convergence...
-
Missing Puzzle Pieces in Dementia Research: HCN Channels and Theta Oscillations
PublicationIncreasing evidence indicates a role of hyperpolarization activated cation (HCN) channels in controlling the resting membrane potential, pacemaker activity, memory formation, sleep, and arousal. Their disfunction may be associated with the development of epilepsy and age-related memory decline. Neuronal hyperexcitability involved in epileptogenesis and EEG desynchronization occur in the course of dementia in human Alzheimer’s Disease...
-
A Perspective on Missing Aspects in Ongoing Purification Research towards Melissa officinalis
PublicationMelissa officinalis L. is a medicinal plant used worldwide for ethno-medical purposes. Today, it is grown everywhere; while it is known to originate from Southern Europe, it is now found around the world, from North America to New Zealand. The biological properties of this medicinal plant are mainly related to its high content of phytochemical (bioactive) compounds, such as flavonoids, polyphenolic compounds, aldehydes, glycosides...
-
Data Analysis in Bridge of Data
PublicationThe chapter presents the data analysis aspects of the Bridge of Data project. The software framework used, Jupyter, and its configuration are presented. The solution’s architecture, including the TRYTON supercomputer as the underlying infrastructure, is described. The use case templates provided by the Stat-reducer application are presented, including data analysis related to spatial points’ cloud-, audio- and wind-related research.
-
Data librarian and data steward – new tasks and responsibilities of academic libraries in the context of Open Research Data implementation in Poland
PublicationThesis/Objective – The policy of Open Access (OA) for researching resources in Europe has been implemented for more than 10 years. The first recommendations concerning providing OA to scientific materials were defined during the implementation of the 7th Framework Programme. Introducing another set of recommendations concerning OA to research data was the next stage. The recommendations were transformed into obligations under the...
-
Harmony Search for Data Mining with Big Data
PublicationIn this paper, some harmony search algorithms have been proposed for data mining with big data. Three areas of big data processing have been studied to apply new metaheuristics. The first problem is related to MapReduce architecture that can be supported by a team of harmony search agents in grid infrastructure. The second dilemma involves development of harmony search in preprocessing of data series before data mining. Moreover,...
-
The molecular entities in linked data dataset
Publication -
Data governance: Organizing data for trustworthy Artificial Intelligence
PublicationThe rise of Big, Open and Linked Data (BOLD) enables Big Data Algorithmic Systems (BDAS) which are often based on machine learning, neural networks and other forms of Artificial Intelligence (AI). As such systems are increasingly requested to make decisions that are consequential to individuals, communities and society at large, their failures cannot be tolerated, and they are subject to stringent regulatory and ethical requirements....
-
DATA INTEROPERABILITY AND THE OPEN DATA ECOSYSTEM: ROLES AND RESEARCH AREAS
PublicationSustainability and value-creation are considered important parameters to measure the success of an open data system. Unfortunately, existing open data systems are not meeting their promises to achieve a sustainable and value-based open data system. Van Loenen et al. (2021) proposed a sustainable and value-creating open data ecosystem. According to their study, the open data ecosystem needs to be user-driven, inclusive, circular,...
-
Missing the sweet spot: one of the two N-glycans on human Gb3/CD77 synthase is expendable
Publication -
Asking Data in a Controlled Way with Ask Data Anything NQL
PublicationWhile to collect data, it is necessary to store it, to understand its structure it is necessary to do data-mining. Business Intelligence (BI) enables us to make intelligent, data-driven decisions by the mean of a set of tools that allows the creation of a potentially unlimited number of machine-generated, data-driven reports, which are calculated by a machine as a response to queries specified by humans. Natural Query Languages...
-
Processing of LiDAR and Multibeam Sonar Point Cloud Data for 3D Surface and Object Shape Reconstruction
PublicationUnorganised point cloud dataset, as a transitional data model in several applications, usually contains a considerable amount of undesirable irregularities, such as strong variability of local point density, missing data, overlapping points and noise caused by scattering characteristics of the environment. For these reasons, further processing of such data, e.g. for construction of higher order geometric models of the topography...
-
Methods for quality improvement of multibeam and LiDAR point cloud data in the context of 3D surface reconstruction
PublicationPoint cloud dataset is the transitional data model used in several marine and land remote-sensing applications. During further steps of processing, the transformation of point cloud spatial data to more complex models containing higher order geometric structures like edges and facets may be possible, if an appropriate quality level of input data is provided. Point cloud datasets usually contain a considerable amount of undesirable...
-
BIG PROBLEMS WITH BIG DATA
PublicationThe article presents an overview of the most important issues related to the phenomenon called big data. The characteristics of big data concerning the data itself and the data sources are presented. Then, the big data life cycle concept is formulated. The next sections focus on two big data technologies: MapReduce for big data processing and NoSQL databases for big data storage.
-
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
PublicationDeveloping signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....
-
Data on LEGO sets release dates and worldwide retail prices combined with aftermarket transaction prices in Poland between June 2018 and June 2023
PublicationThe dataset contains LEGO bricks sets item count and pricing history for AI-based set pricing prediction. The data spans the timeframe from June 2018 to June 2023. The data was obtained from three sources: Brickset.com (LEGO sets retail prices, release dates, and IDs), Lego.com official web page (ID number of each set that was released by Lego, its retail prices, the current status of the set) and promoklocki.pl web page (the retail...
-
Data on the identification of microsatellite markers in Eisenia fetida and Eisenia andrei
Publication -
Manufacturing Data Analysis in Internet of Things/Internet of Data (IoT/IoD) Scenario
PublicationComputer integrated manufacturing (CIM) has enormous benefits as it increases the rate of production, reduces errors and production waste, and streamlines manufacturing sub-systems. However, there are some new challenges related to CIM operating in the Internet of Things/Internet of Data (IoT/IoD) scenarios associated with Industry 4.0 and cyber-physical systems. The main challenge is to deal with the massive volume of data flowing...
-
Low-Level Aerial Photogrammetry as a Source of Supplementary Data for ALS Measurements
PublicationThe development of laser scanning technology ALS allows to make high-resolution measurements for large areas result-ing in significant reduction of costs. The main stakeholders at heights data received from the airborne laser scanning is mainly state administration. The state institutions appear among projects such as ISOK. Each point is classified in ac-cordance with the standard LAS 1.2, our research focuses on the class 6 -...
-
The Bridge of Data Project Objectives
PublicationOpen Research Data (ORD) is one of the emerging trends for researchers across the globe. However, it has to be stressed that the level of implementation and awareness of ORD varies between countries. Many initiatives have been created in Polish scientific institutions to support the process of opening publications. These are mainly Open Access (OA) repositories, implementing the so-called green road of OA. However, only a few universities...
-
Mono- and bimetallic (Pt/Cu) titanium(IV) oxide photocatalysts. Physicochemical and photocatalytic data of magnetic nanocomposites’ shell
PublicationSurface modification of titania with noble and semi-noble metals resulted in significant enhancement of photocatalytic activity. Presented data, showing the photocatalytic properties of TiO2-M (where M is Pt and/or Cu) photocatalysts were further used as Fe3O4@SiO2/TiO2-M magnetic nanocomposites shells in "Mono- and bimetallic (Pt/Cu) titanium(IV) oxide core-shell photocatalysts with Vis light activity and magnetic separability"...
-
Processing of Satellite Data in the Cloud
PublicationThe dynamic development of digital technologies, especially those dedicated to devices generating large data streams, such as all kinds of measurement equipment (temperature and humidity sensors, cameras, radio-telescopes and satellites – Internet of Things) enables more in-depth analysis of the surrounding reality, including better understanding of various natural phenomenon, starting from atomic level reactions, through macroscopic...
-
DATA JOURNALS AND DATA PAPERS IN VARIOUS RESEARCH AREAS AND SCIENTIFIC DISCIPLINES – BIBLIOMETRIC ANALYSIS BASED ON INCITES
PublicationThe main aim of this work is to provide insight into a bibliometric analysis of Data Journals and Data Papers in terms of research areas, disciplines, publication year and country. In particular, we calculated many bibliometric indicators, especially: the number of publications and citations. Furthermore, this work also investigated the top 20 journals in which scientists published the largest number of Data Papers. It was found...
-
Data Mining Applications and Methods in Medicine
PublicationIn this paper we describe the research area of data mining and its applications in medicine. The origins of data mining and its crucial features are shortly presented. We discuss the specificity of medicine as an application area for computer systems. Characteristic features of the medical data are investigated. Common problems in the area are also presented as well as the strengths and capabilities of the data mining methods....
-
Sharing research data across disciplines
PublicationThis monograph is a collection of experiences gathered by the team implementing the Bridge of Data project. However, it is not just a simple summary of the project implementation. It shows and systematizes the substantive and technical works performed by the teams and several issues related to data management itself in various disciplines, represented by members of the scientific team and other researchers from partner universities.The...
-
Radar data fusion in the STRADAR system
PublicationThe main task of the Polish Border Guard is protection of the country’s border which requires utilization of multimedia surveillance systems automatically gathering, processing and sharing various data. The paper presents such a system developed for the Maritime Division of the Polish Border Guard within the STRADAR project and the problem of fusion of radar data in this system. The system, apart from providing communication means,...
-
Application of the Heavy-Atom Effect for (Sub)microsecond Thermally Activated Delayed Fluorescence and an All-Organic Light-Emitting Device with Low-Efficiency Roll-off
PublicationThefeatureof abundantandenvironmentallyfriendlyheavyatoms(HAs)like bromineto acceleratespin-forbiddentransitionsin organicmoleculeshas beenknownforyears.In combinationwiththe easinessof incorporation,brominederivativesof organicemittersshowingthermallyactivateddelayedfluorescence(TADF)emergeas a cheapand efficientsolutionforthe slowreverseintersystemcrossing(rISC)problemin suchemittersand strongefficiencyroll-offof all-organiclight-emittingdiodes(OLEDs).Here,we...
-
Big Data in Regenerative Urban Design
PublicationWhy the use of Big Data in regenerative planning matters? The aim of this chapter is to study under what conditions Big Data can be integrated into regenerative design and sustainable planning? Authors seek to answer how – when related to the ecosystem and to human activities – Big Data can be used to: • both shape policies that support the development of regenerative human settlements, • support restorative design for practitioners...
-
The Use of Big Data in Regenerative Planning
PublicationWith the increasing significance of Big Data sources and their reliability for studying current urban development processes, new possibilities have appeared for analyzing the urban planning of contemporary cities. At the same time, the new urban development paradigm related to regenerative sustainability requires a new approach and hence a better understanding of the processes changing cities today, which will allow more efficient...
-
CPLFD-GDPT5: High-resolution gridded daily precipitation and temperature data set for two largest Polish river basins
PublicationThe CHASE-PL (Climate change impact assessment for selected sectors in Poland) Forcing Data–Gridded Daily Precipitation & Temperature Dataset–5 km (CPLFD-GDPT5) consists of 1951–2013 daily minimum and maximum air temperatures and precipitation totals interpolated onto a 5 km grid based on daily meteorological observations from the Institute of Meteorology and Water Management (IMGW-PIB; Polish stations), Deutscher Wetterdienst...
-
A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies
PublicationIn this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...
-
Enhanced uniform data sampling for constrained data‐driven modeling of antenna input characteristics
PublicationData-driven surrogates are the most popular replacement models utilized in many fields of engineering and science, including design of microwave and antenna structures. The primary practical issue is a curse of dimensionality which limits the number of independent parameters that can be accounted for in the modelling process. Recently, a performance-driven modelling technique has been proposed where the constrained domain of the...
-
Collaborative Data Acquisition and Learning Support
PublicationWith the constant development of neural networks, traditional algorithms relying on data structures lose their significance as more and more solutions are using AI rather than traditional algorithms. This in turn requires a lot of correctly annotated and informative data samples. In this paper, we propose a crowdsourcing based approach for data acquisition and tagging with support for Active Learning where the system acts as an...
-
Active Learning Based on Crowdsourced Data
PublicationThe paper proposes a crowdsourcing-based approach for annotated data acquisition and means to support Active Learning training approach. In the proposed solution, aimed at data engineers, the knowledge of the crowd serves as an oracle that is able to judge whether the given sample is informative or not. The proposed solution reduces the amount of work needed to annotate large sets of data. Furthermore, it allows a perpetual increase...
-
Streaming Real-time Data in Distributed Dispatcher and Teleinformation Systems for Visualization of Multimedia Data of the Border Guard
PublicationSurveillance of the sea borders is a very important task for the Border Guard. Monitoring of country maritime border is an important task of the Border Guard. This task can be facilitated with the use of the technology enabling gathering information from distributed sources and its supervision and visualization. This task can be accomplished using a technology that allows to collect information from distributed sensors of different...
-
On the impact of Big Data and Cloud Computing on a scalable multimedia archiving system
PublicationMultimedia Archiver (MA) is a system build upon the promise and fascination of the possibilities emerging from cloud computing and big data. We aim to present and describe how the Multimedia Archiving system works for us to record, put in context and allow a swift access to large amounts of data. We introduce the architecture, identified goals and needs taken into account while designing a system processing data with Big Data...
-
Big Data Analytics for ICT Monitoring and Development
PublicationThe expanded growth of information and communication technology has opened new era of digitization which is proving to be a great challenge for researchers and scientists around the globe. The utmost paradigm is to handle and process the explosion of data with minimal cost and discover relevant hidden information in the least amount of time. The buzz word “BIG DATA” is a widely anticipated term with the potential to handle heterogeneous,...
-
3D MODELLING OF CYLINDRICAL-SHAPED OBJECTS FROM LIDAR DATA - AN ASSESSMENT BASED ON THEORETICAL MODELLING AND EXPERIMENTAL DATA
PublicationDespite the increasing availability of measured laser scanning data and their widespread use, there is still the problem of rapid and correct numerical interpretation of results. This is due to the large number of observations that carry similar information. Therefore, it is necessary to extract from the results only the essential features of the modelled objects. Usually, it is based on a process using filtration, followed by...
-
Linking music data in executable documents
PublicationThis paper presents the application of Interactive Open Document Architecture (IODA) to music and video data. This architecture was design to create multilayer documents which consist of many files. The paper shows the method of creating media documents on the basis of IODA. These kind of documents were called IODA Media Documents (IMD). IMD have links that connect many different kinds of files containing music and video data....