Filters
total: 6167
filtered: 4885
-
Catalog
- Publications 4885 available results
- Journals 137 available results
- Conferences 79 available results
- People 165 available results
- Inventions 12 available results
- Projects 7 available results
- Laboratories 3 available results
- Research Teams 2 available results
- e-Learning Courses 212 available results
- Events 33 available results
- Open Research Data 632 available results
Chosen catalog filters
displaying 1000 best results Help
Search results for: data%20mining
-
Harmony Search for Data Mining with Big Data
PublicationIn this paper, some harmony search algorithms have been proposed for data mining with big data. Three areas of big data processing have been studied to apply new metaheuristics. The first problem is related to MapReduce architecture that can be supported by a team of harmony search agents in grid infrastructure. The second dilemma involves development of harmony search in preprocessing of data series before data mining. Moreover,...
-
DATA MINING IN CONSTRUCTION RESEARCH
Publication -
Data Reduction Algorithm for Machine Learning and Data Mining
Publication -
Ensemble Classifier for Mining Data Streams
Publication -
Data Mining Applications and Methods in Medicine
PublicationIn this paper we describe the research area of data mining and its applications in medicine. The origins of data mining and its crucial features are shortly presented. We discuss the specificity of medicine as an application area for computer systems. Characteristic features of the medical data are investigated. Common problems in the area are also presented as well as the strengths and capabilities of the data mining methods....
-
Multimedia data mining for e-Commerce.
PublicationPrzedstawiono studium porównawcze metod eksploracji danych dla e-Commerce.Skupiono się na studium przypadku aplikacji medycznych - wyszukiwania przypadków podobnych.
-
Application of decisional DNA in web data mining
PublicationPrzedstawiono pilotową koncepcję i aplikację integracji reprezentacji wiedzy opartej na decyzyjnym DNA oraz systemów pozyskiwania wiedzy i danych z Internetu. Wskazano na zalety proponowanej integracji oraz przedstawiono kierunki przyszłych badań w tym zakresie.
-
Frequent Sequence Mining in Web Log Data
Publication -
Frequent Sequence Mining in Web Log Data
PublicationThe amount of information available even on a single web server can be huge. On the other hand, the amount of visitors (users) can often reach a number of at least six digits. Users vary in gender, age and education, and in consequence their information needs are different. Moreover, they subconsciously expect to get more adequate content after visiting the first few pages. The scope of this kind of problem relates to the domain...
-
On a Certain Research Gap in Big Data Mining for Customer Insights
Publication -
Mining e-mail message sequences from log data
Publication -
Music Data Processing and Mining in Large Databases for Active Media
PublicationThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Machine Learning and data mining tools applied for databases of low number of records
Publication -
Choosing Exploration Process Path in Data Mining Processes for Complex Internet Objects
PublicationWe present an experimental case study of a novel and original framework for classifying aggregate objects, i.e. objects that consist of other objects. The features of the aggregated objects are converted into the features of aggregate ones, by use of aggregate functions. The choice of the functions, along with the specific method of classification can be automated by choosing of one of several process paths, and different paths...
-
Mapping of the Covid-19 Vaccine Uptake Determinants From Mining Twitter Data
PublicationOpinion polls on vaccine uptake clearly show that Covid-19 vaccine hesitancy is increasing worldwide. Thus, reaching herd immunity not only depends on the efficacy of the vaccine itself, but also on overcoming this hesitancy of uptake in the population. In this study, we revealed the determinants regarding vaccination directly from people’s opinions on Twitter, based on the framework of the 6As taxonomy. Covid-19 vaccine acceptance...
-
Choosing Exploration Process Path in Data Mining Processes for Complex Internet Objects
PublicationWe present an experimental case study of a novel and original framework for classifying aggregate objects, i.e. objects that consist of other objects. The features of the aggregated objects are converted into the features of aggregate ones, by use of aggregate functions. The choice of the functions, along with the specific method of classification can be automated by choosing of one of several process paths, and different paths...
-
Ensemble Online Classifier Based on the One-Class Base Classifiers for Mining Data Streams
Publication -
Comprehensive Comparison of a Few Variants of Cluster Analysis as Data Mining Tool in Supporting Environmental Management
PublicationA few variants of hierarchical cluster analysis (CA) as tool of assessment of multidimensional similarity in environmental dataset are compared. The dataset consisted of analytical results of determination of metals (Na, K, Ca, Sc, Fe, Co, Zn, As, Br, Rb, Mo, Sb, Cs, Ba, La, Ce, Sm, Hf and Th) in ambient air dried and kept alive, by the means of hydroponics, moss baskets collected in 12 locations on the area of Tricity (Poland)....
-
Researching Digital Society: Using Data-Mining to Identify Relevant Themes from an Open Access Journal
PublicationOpen Access scholarly literature is scientific output free from economic barriers and copyright restrictions. Using a case study approach, data mining methods and qualitative analysis, the scholarly output and the meta-data of the Open Access eJournal of e-Democracy and Open Government during the time interval 2009–2020 was analysed. Our study was able to identify the most prominent research topics (defined as thematic clusters)...
-
Researching Digital Society: Using Data-Mining to Identify Relevant Themes from an Open Access Journal
PublicationOpen Access scholarly literature is scientific output free from economic barriers and copyright restrictions. Using a case study approach, data mining methods and qualitative analysis, the scholarly output and the meta-data of the Open Access eJournal of e-Democracy and Open Government during the time interval 2009–2020 was analysed. Our study was able to identify the most prominent research topics (defined as thematic clusters)...
-
Estimating Water Retention in Post-mining Excavations Using LiDAR ALS Data for the Strzelin Quarry, in Lower Silesia
Publication -
Data Analysis in Bridge of Data
PublicationThe chapter presents the data analysis aspects of the Bridge of Data project. The software framework used, Jupyter, and its configuration are presented. The solution’s architecture, including the TRYTON supercomputer as the underlying infrastructure, is described. The use case templates provided by the Stat-reducer application are presented, including data analysis related to spatial points’ cloud-, audio- and wind-related research.
-
EUDEM2: The European Union in humanitarian demining. State of the art on humanitarian demining.
PublicationPrzedstawiono projekt 5 Programu Ramowego. Opisano stan technologii technik wykrywania min lądowych w krajach europejskich.
-
High-Speed Videoendoscopy Enhances the Objective Assessment of Glottic Organic Lesions: A Case-Control Study with Multivariable Data-Mining Model Development
Publication -
Data reduction and stacking for imbalanced data classification
Publication -
Data governance: Organizing data for trustworthy Artificial Intelligence
PublicationThe rise of Big, Open and Linked Data (BOLD) enables Big Data Algorithmic Systems (BDAS) which are often based on machine learning, neural networks and other forms of Artificial Intelligence (AI). As such systems are increasingly requested to make decisions that are consequential to individuals, communities and society at large, their failures cannot be tolerated, and they are subject to stringent regulatory and ethical requirements....
-
DATA INTEROPERABILITY AND THE OPEN DATA ECOSYSTEM: ROLES AND RESEARCH AREAS
PublicationSustainability and value-creation are considered important parameters to measure the success of an open data system. Unfortunately, existing open data systems are not meeting their promises to achieve a sustainable and value-based open data system. Van Loenen et al. (2021) proposed a sustainable and value-creating open data ecosystem. According to their study, the open data ecosystem needs to be user-driven, inclusive, circular,...
-
Asking Data in a Controlled Way with Ask Data Anything NQL
PublicationWhile to collect data, it is necessary to store it, to understand its structure it is necessary to do data-mining. Business Intelligence (BI) enables us to make intelligent, data-driven decisions by the mean of a set of tools that allows the creation of a potentially unlimited number of machine-generated, data-driven reports, which are calculated by a machine as a response to queries specified by humans. Natural Query Languages...
-
Data Extraction, Transformation and Loading process in data warehouse development.
PublicationOmówiono podstawowe elementy procesu projektowania hurtowni danych.
-
Text-mining Similarity Approximation Operators for Opinion Mining in BI tools
PublicationThe concept of the Text-mining Similarity Approximation Operators for Opinion Mining as extensions to Natural Language Interface Database is defined. The new operators: “keywords of” dimension; subsetting operator “about C is q”; aggregation operator “by similar C” are proposed. These operators are based on the Latent Semantic Analysis and Social Network Analysis
-
Data librarian and data steward – new tasks and responsibilities of academic libraries in the context of Open Research Data implementation in Poland
PublicationThesis/Objective – The policy of Open Access (OA) for researching resources in Europe has been implemented for more than 10 years. The first recommendations concerning providing OA to scientific materials were defined during the implementation of the 7th Framework Programme. Introducing another set of recommendations concerning OA to research data was the next stage. The recommendations were transformed into obligations under the...
-
Managing Data from Heterogeneous Data Sources Using Knowledge Layer
Publication -
Atmospheric emissions of POPs in Europe - a discussion of existing data and data need
PublicationZaproponowano, aby schematy inwentaryzacji i raportowania emisji w przyszłości były tak dobrane, aby zaspokajały potrzeby zarówno administracji odpowiedzialnej za strategię redukcji emisji jak i naukowców wykorzystujących inwentaryzację emisji na potrzeby dalszych badań naukowych. Stan obecny inwentaryzacji emisji POP oszacowany został jako niezadowalający a dalsze usprawnienia są konieczne dla poprawienia wiarygodności wyników...
-
Managing data from heterogeneous data sources using knowledge layer
PublicationW procesie integrowania danych przy użyciu ontologii, ważne jest aby zarządzać danymi przechowywanymi w zewnętrznych źródłach, analogicznie jak tymi przechowywanymi w Bazie Wiedzy. Zaprezentowana w poprzednich pracach metoda kartograficznej reprezentacji wiedzy pozwala na wnioskowanie z danych przechowywanych w Bazie wiedzy. Rozwiązanie zaprezentowane w tej pracy umożliwia wykorzystanie metody kartograficznej do wnioskowania z...
-
Big Data i 5V – nowe wyzwania w świecie danych (Big Data and 5V – New Challenges in the World of Data)
PublicationRodzaje danych, składające się na zbiory typu Big Data, to m.in. dane generowane przez użytkowników portali internetowych, dane opisujące transakcje dokonywane poprzez Internet, dane naukowe (biologiczne, astronomiczne, pomiary fizyczne itp.), dane generowane przez roboty w wyniku automatycznego przeszukiwania przez nie Internetu (Web mining, Web crawling), dane grafowe obrazujące powiązania pomiędzy stronami WWW itd. Zazwyczaj,...
-
Cytokine TGFβ Gene Polymorphism in Asthma: TGF-Related SNP Analysis Enhances the Prediction of Disease Diagnosis (A Case-Control Study With Multivariable Data-Mining Model Development)
Publication -
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
PublicationDeveloping signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....
-
Manufacturing Data Analysis in Internet of Things/Internet of Data (IoT/IoD) Scenario
PublicationComputer integrated manufacturing (CIM) has enormous benefits as it increases the rate of production, reduces errors and production waste, and streamlines manufacturing sub-systems. However, there are some new challenges related to CIM operating in the Internet of Things/Internet of Data (IoT/IoD) scenarios associated with Industry 4.0 and cyber-physical systems. The main challenge is to deal with the massive volume of data flowing...
-
Synteza algorytmu detekcji pęknięcia szyby metodą ''data fission - data fusion''
PublicationPrzedstawiono założenia projektowe oraz proces syntezy algorytmu detekcyjnego akustycznego detektora pęknięcia szyby. W konstrukcji algorytmu użyto techniki rozszczepiania i syntezy danych. Przedstawiono użyte narzędzia badawcze, opracowany model pęknięcia szyby oraz wynki testowania finalnego algorytmu detekcyjnego. Metoda znalazła zastosowanie w konstrukcji akustycznego detektora pęknięcia szyby stosowanego w systemach alarmowych.
-
Enhanced uniform data sampling for constrained data‐driven modeling of antenna input characteristics
PublicationData-driven surrogates are the most popular replacement models utilized in many fields of engineering and science, including design of microwave and antenna structures. The primary practical issue is a curse of dimensionality which limits the number of independent parameters that can be accounted for in the modelling process. Recently, a performance-driven modelling technique has been proposed where the constrained domain of the...
-
Schema mining in XML documents.
PublicationW artykule przedstawiono algorytm COBWEB S+T służący do wywodzenia schematów z kolekcji dokumentów XML. Algorytm wykorzystuje model danych semistrukturalnych oraz alorytm COBWEB służący do grupowania koncepcyjnego. W artykule zaprezentowano również wyniki testów działania algorytmu.
-
Differential analysis of impedance data
Publication -
Distributed Learning with Data Reduction
Publication -
Processing of Satellite Data in the Cloud
PublicationThe dynamic development of digital technologies, especially those dedicated to devices generating large data streams, such as all kinds of measurement equipment (temperature and humidity sensors, cameras, radio-telescopes and satellites – Internet of Things) enables more in-depth analysis of the surrounding reality, including better understanding of various natural phenomenon, starting from atomic level reactions, through macroscopic...
-
The Bridge of Data Project Objectives
PublicationOpen Research Data (ORD) is one of the emerging trends for researchers across the globe. However, it has to be stressed that the level of implementation and awareness of ORD varies between countries. Many initiatives have been created in Polish scientific institutions to support the process of opening publications. These are mainly Open Access (OA) repositories, implementing the so-called green road of OA. However, only a few universities...
-
BIG PROBLEMS WITH BIG DATA
PublicationThe article presents an overview of the most important issues related to the phenomenon called big data. The characteristics of big data concerning the data itself and the data sources are presented. Then, the big data life cycle concept is formulated. The next sections focus on two big data technologies: MapReduce for big data processing and NoSQL databases for big data storage.
-
Client-side versus server-side geographic data processing performance comparison: Data and code
PublicationThe data and code presented in this article are related to the research article entitled “Analysis of Server-side and Client-side Web-GIS data processing methods on the example of JTS and JSTS using open data from OSM and Geoportal” (Kulawiak et al., 2019). The provided 12 datasets include multi-point and multi-polygon data of different scales and volumes, representing real-world geographic features. The datasets cover the area...
-
A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies
PublicationIn this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...
-
RPC communication layer and introduction to data protection for embedded PC based control and data acquisition module
Publication -
A survey of medical researchers indicates poor awareness of research data management processes and a role for data librarians
Publication