displaying 1000 best results Help
Search results for: analysis of large data set
-
Deep Data Analysis of a Large Microarray Collection for Leukemia Biomarker Identification
Publication -
Comprehensive Analysis of MILE Gene Expression Data Set Advances Discovery of Leukaemia Type and Subtype Biomarkers
Publication -
Analysis of results of large-scale multimodal biometric identity verification experiment
PublicationAn analysis of a large set of biometric data obtained during the enrolment and the verification phase in an experimental biometric system installed in bank branches is presented. Subjective opinions of bank clients and of bank tellers were also surveyed concerning the studied biometric methods in order to discover and to explore relations emerging from the obtained multimodal dataset. First, data acquisition and identity verification...
-
Accelerated large scale test set-up design in natural corrosion marine environment
PublicationThe standards for conducting small-scale specimen tests are well developed, but there is a lack of direct guidelines for conducting corrosion tests for large-scale specimens. The objective here is to develop a methodology which may be used in designing an accelerated corrosion test of large-scale structural components subjected to a natural corrosion marine environment. Different factors influencing corrosion degradation of steel...
-
Krzysztof Goczyła prof. dr hab. inż.
PeopleKrzysztof Goczyła, full professor of Gdańsk University of Technology, computer scientist, a specialist in software engineering, knowledge engineering and databases. He graduated from the Faculty of Electronics Technical University of Gdansk in 1976 with a degree in electronic engineering, specializing in automation. Since then he has been working at Gdańsk University of Technology. In 1982 he obtained a doctorate in computer science...
-
APPLICATION OF CHEMOMETRIC ANALYSIS TO THE STUDY OF SNOW AT THE SUDETY MOUNTAINS, POLAND
PublicationSnow samples were collected during winter 2011/2012 in three posts in the Western Sudety Mountains (Poland) in 3 consecutive phases of snow cover development, i.e. stabilisation (Feb 1st), growth (Mar 15th) and its ablation (Mar 27th). To maintain a fixed number of samples, each snow profile has been divided into six layers, but hydrochemical indications were made for each 10 cm section of core. The complete data set was subjected...
-
Processing of Satellite Data in the Cloud
PublicationThe dynamic development of digital technologies, especially those dedicated to devices generating large data streams, such as all kinds of measurement equipment (temperature and humidity sensors, cameras, radio-telescopes and satellites – Internet of Things) enables more in-depth analysis of the surrounding reality, including better understanding of various natural phenomenon, starting from atomic level reactions, through macroscopic...
-
Simultaneous determination of hydrophobicity and dissociation constant for a large set of compounds by gradient reverse phase high performance liquid chromatography–mass spectrometry technique
Publication -
Uncertainty quantification of modal parameter estimates obtained from subspace identification: An experimental validation on a laboratory test of a large-scale wind turbine blade
PublicationThe uncertainty afflicting modal parameter estimates stems from e.g., the finite data length, unknown, or partly measured inputs and the choice of the identification algorithm. Quantification of the related errors with the statistical Delta method is a recent tool, useful in many modern modal analysis applications e.g., damage diagnosis, reliability analysis, model calibration. In this paper, the Delta method-based uncertainty...
-
Trade differentiation and the characteristics of new imported and exported products - international panel data analysis
PublicationDrawing on o large panel of international economies we have shown how the set of imported and exported products evolves in economic growth process. Strong activity at the extensive margin, manifested through the rise in the number of active export and import lines, is typical for early stages of development. Trade diversification tendency, typical for a predominant mass of observations in our panel, is associated with changes in...
-
Segmentation Quality Refinement in Large-Scale Medical Image Dataset with Crowd-Sourced Annotations
PublicationDeployment of different techniques of deep learning including Convolutional Neural Networks (CNN) in image classification systems has accomplished outstanding results. However, the advantages and potential impact of such a system can be completely negated if it does not reach a target accuracy. To achieve high classification accuracy with low variance in medical image classification system, there is needed the large size of the...
-
Text Mining Algorithms for Extracting Brand Knowledge; The fashion Industry Case
PublicationBrand knowledge is determined by customer knowledge. The opportunity to develop brands based on customer knowledge management has never been greater. Social media as a set of leading communication platforms enable peer to peer interplays between customers and brands. A large stream of such interactions is a great source of information which, when thoroughly analyzed, can become a source of innovation and lead to competitive advantage....
-
Numerical Study of the Impinging Jets Formed by an Injector with Different Nozzle Diameters
Open Research DataThe data set contains the simulation files related to the research paper “Numerical Study of the Impinging Jets Formed by an Injector with Different Nozzle Diameters”, https://doi.org/10.4271/2022-01-1080.
-
Stochastic intervals for the family of quadratic maps
Open Research DataNumerical analysis of chaotic dynamics is a challenging task. The one-parameter families of logistic maps and closely related quadratic maps f_a(x)=a-x^2 are well-known examples of such dynamical systems. Determining parameter values that yield stochastic-like dynamics is especially difficult, because although this set has positive Lebesgue measure,...
-
Compressive Sensing Approach to Harmonics Detection in the Ship Electrical Network
PublicationThe contribution of this paper is to show the opportunities for using the compressive sensing (CS) technique for detecting harmonics in a frequency sparse signal. The signal in a ship’s electrical network, polluted by harmonic distortions, can be modeled as a superposition of a small number of sinusoids and the discrete Fourier transform (DFT) basis forms its sparse domain. According to the theory of CS, a signal may be reconstructed...
-
Energy efficiency of electric multiple units in suburban operation
PublicationThis thesis presents approach to analysis of energy efficiency of a suburban rail network, using novel models developed on the Matlab/Simulink basis. Necessary features and requirements for such models were determined thru in-depth review of the source literature in all applicable fields: electrified transportation systems, electric multiple units construction, vehicle drivetrains and finally, existing simulation methods. Existing...
-
Finite element models used in diagnostics of transverse cracks in bridge approach pavement
Open Research DataTransverse cracks in the asphalt pavement were observed on bridge structures next to single-module expansion joints with a 5 meter approach slab set at the depth of 1 m. The finite element (FE) models of the approach pavement were created to investigate the reasons of premature cracking and crack initiation mechanism over the back edge of the abutment...
-
Magdalena Szuflita-Żurawska
PeopleHead of the Scientific and Technical Information Services at the Gdansk University of Technology Library and the Leader of the Open Science Competence Center. She is also a Plenipotentiary of the Rector of the Gdańsk University of Technology for open science. She is a PhD Candidate. Her main areas of research and interests include research productivity, motivation, management of HEs, Open Access, Open Research Data, information...
-
SYNAT Music Genre Parameters PCA 19
Open Research DataThe dataset contains feature vector after Principal Component Analysis (PCA) performing, so there are 11 music genres and 19-element vector derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of 52532 music excerpts described...
-
SYNAT_PCA_48
Open Research DataThere is a series of datasets containing feature vectors derived from music tracks. The dataset contains 51582 music tracks (22 music genres) and feature vector after Principal Component Analysis (PCA) performing, so there are 48-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier...
-
SYNAT_PCA_11
Open Research DataThe dataset contains 51582 music tracks (22 music genres) and feature vector after Principal Component Analysis (PCA) performing, so there are 11-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of more than...
-
Qualitative data analysis methods
e-Learning CoursesThis is the continuation of Qualitatative Data Analysis Methods course provided online
-
Data Analysis 2023/24
e-Learning CoursesData Analysisdr inż. Karol Flisikowski, prof. PG - winter semester 2023/24
-
Wydłużanie krzywej przejściowej w analitycznej metodzie projektowania
PublicationW pracy przedstawiono problematykę wydłużania krzywych przejściowych, wykorzystując do tego celu analityczną metodę projektowania. Podstawę analizy stanowiły obliczenia numeryczne przeprowadzone dla szerokiego zestawu parametrów charakteryzujących standardowy układu geometryczny. Po sformułowaniu odpowiednich zależności teoretycznych rozpatrzono kwestie znaczenia wielkości promienia łuku kołowego i kąta zwrotu trasy na uzyskane...
-
Aktualne trendy w zakresie certyfikacji normatywnych systemów zarządzania w branży spożywczej
PublicationW opracowaniu omówiono aktualne trendy dotyczące certyfikacji systemu zarządzania jakością w sektorze rolno-spożywczym. Na podstawie danych uzyskanych z dużej, renomowanej jednostki certyfikującej przedstawiono analizę porównawczą dotyczącą liczby wydanych certyfikatów dla ogólnych systemów zarządzania oraz dla systemów branżowych dedykowanych sektorowi rolno-spożywczemu w odniesieniu do przedsiębiorstw różnych wielkości.
-
Platforma KASKADA jako system zapewniania bezpieczeństwa poprzez masową analizę strumieni multimedialnych w czasie rzeczywistym
PublicationW artykule przedstawiono Platformę KASKADA rozumianą jako system przetwarzania danych cyfrowych i strumieni multimedialnych oraz stanowiącą ofertę usług wspomagających zapewnienie bezpieczeństwa publicznego, ocenę badań medycznych i ochronę własności intelektualnej. celem prowadzonych prac było stworzenie innowacyjnego systemu umozliwiajacego wydajną i masową analizę dokumentów cyfrowych i strumieni multimedialnych w czasie rzeczywistym...
-
Deep neural networks for data analysis
e-Learning CoursesThe aim of the course is to familiarize students with the methods of deep learning for advanced data analysis. Typical areas of application of these types of methods include: image classification, speech recognition and natural language understanding. Celem przedmiotu jest zapoznanie studentów z metodami głębokiego uczenia maszynowego na potrzeby zaawansowanej analizy danych. Do typowych obszarów zastosowań tego typu metod należą:...
-
SET-VALUED ANALYSIS
Journals -
Music Data Processing and Mining in Large Databases for Active Media
PublicationThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Analiza przestrzenna branży transportu lądowego w Polsce
PublicationCelem opracowania było określenie zróżnicowania rozkładu przestrzennego branży transportu lądowego w Polsce oraz czynników wpływających na jej rozmiary. Analizę przeprowadzono na szczeblu wojewódzkim oraz powiatowym, na podstawie danych za lata 2009 i 2012. Wykorzystano analizę lokalizacji, obliczono i zinterpretowano autokorelację przestrzenną, skontruowano i oszacowano także model regresji przestrzennej. Stwierdzono, że na szczeblu...
-
Data Analysis in Bridge of Data
PublicationThe chapter presents the data analysis aspects of the Bridge of Data project. The software framework used, Jupyter, and its configuration are presented. The solution’s architecture, including the TRYTON supercomputer as the underlying infrastructure, is described. The use case templates provided by the Stat-reducer application are presented, including data analysis related to spatial points’ cloud-, audio- and wind-related research.
-
Visual Dimensions of Modeling Languages in Interdisciplinary Perspective
PublicationUżyteczność języków modelowania wizualnego zależy od notacji. Notacja może być postrzegana jako zestaw wizualnych komponentów, które w określony sposób oddziałują na ludzkie oko i ludzki mózg. Referat przedstawia analizę interdyscyplinarną wykonaną w celu lepszego zrozumienia wizualnych wymiarów języków modelowania. Wizualne wymiary pochodzą z teorii opisujących percepcję wzrokową, wizualizację danych oraz reprezentacje poznawcze....
-
BEARING SYSTEMS OF WIND TURBINES – MAINTENANCE PROBLEMS
PublicationW praktyce eksploatacyjnej turbin wiatrowych o tradycyjnej konstrukcji (przekładniowych) obserwowana jest duża awaryjność przekładni, a zwłaszcza łożysk tocznych szybkoobrotowych wałów przekładni. W Polsce na zlecenie właściciela dużej farmy wiatrowej, firmy Energa Wytwarzanie S.A., Politechnika Gdańska we współpracy z Politechniką Poznańską i Akademią Górniczo Hutniczą podjęły badania mające na celu diagnozowanie przyczyn uszkodzeń...
-
Application of Web-GIS for Dissemination and 3D Visualization of Large-Volume LiDAR Data
PublicationThe increasing number of digital data sources, which allow for semi-automatic collection and storage of information regarding various aspects of life has recently granted a considerable rise in popularity to the term “Big data”. As far as geospatial data is concerned, one of the major sources of Big data are Light Detection And Ranging (LiDAR) scanners, which produce high resolution three-dimensional data on a local scale. The...
-
Set-Valued and Variational Analysis
Journals -
Qualitative evaluation of distributed clinical systems supporting research teams working on large-scale data
PublicationInthispaper,fivecontemporaryscalablesystemstosupportmedicalresearchteams are presented. Their functionalities extend from heterogeneous unstructured data acquisition through large-scale data storing, to on-the-fly analyzing by using robust methods. Such kinds of systems can be useful in the development of new medical procedures and recommendation rules for decision support systems. A short description of each of them is provided....
-
Algorytmy wykrywania krawędzi w obrazie
PublicationWykrywanie krawędzi jest pierwszym etapem w cyfrowym przetwarzaniu obrazów. Operacja ta polega na usunięciu informacji takich jak kolor czy też jasność, a pozostawieniu jedynie krawędzi. Efektem tej operacji jest znaczna redukcja ilości danych do dalszej analizy. Pozwala to na zastosowanie w następnych etapach bardziej złożonych algorytmów rozpoznawania obiektów na podstawie kształtu. W artykule zaprezentowano zastosowanie algorytmów...
-
Using Principal Component Analysis and Canonical Discriminant Analysis for multibeam seafloor characterisation data
PublicationThe paper presents the seafloor characterisation based on multibeam sonar data. It relies on using the integrated model and description of three types of multibeam data obtained during seafloor sensing: 1) the grey-level sonar images (echograms) of seabed, 2) the 3D model of the seabed surface which consists of bathymetric data, 3) the set of time domain bottom echo envelopes received in the consecutive sonar beams. The classification...
-
Simulation of parallel similarity measure computations for large data sets
PublicationThe paper presents our approach to implementation of similarity measure for big data analysis in a parallel environment. We describe the algorithm for parallelisation of the computations. We provide results from a real MPI application for computations of similarity measures as well as results achieved with our simulation software. The simulation environment allows us to model parallel systems of various sizes with various components...
-
Strength analysis of a large-size supporting structure for an offshore wind turbine
PublicationThe offshore wind power industry is the branch of electric energy production from renewable sources which is most intensively developed in EU countries. At present, there is a tendency to install larger-power wind turbines at larger distances from the seashore, on relatively deep waters. Consequently, technological solutions for new supporting structures intended for deeper water regions are undergoing rapid development now. Various...
-
Automated Valuation Model based on fuzzy and rough set theory for real estate market with insufficient source data
PublicationObjective monitoring of the real estate value is a requirement to maintain balance, increase security and minimize the risk of a crisis in the financial and economic sector of every country. The valuation of real estate is usually considered from two points of view, i.e. individual valuation and mass appraisal. It is commonly believed that Automated Valuation Models (AVM) should be devoted to mass appraisal, which requires a large...
-
Applied Set-Valued Analysis and Optimization
Journals -
Intelligent Data Analysis
Journals -
LIFETIME DATA ANALYSIS
Journals -
Qualitative Data Analysis Methods -Summer 2022
e-Learning Courses -
Using LSTM networks to predict engine condition on large scale data processing framework
PublicationAs the Internet of Things technology is developing rapidly, companies have an ability to observe the health of engine components and constructed systems through collecting signals from sensors. According to output of IoT sensors, companies can build systems to predict the conditions of components. Practically the components are required to be maintained or replaced before the end of life in performing their assigned task. Predicting...
-
Large deformation finite element analysis of undrained pile installation
PublicationIn this paper, a numerical undrained analysis of pile jacking into the subsoil using Abaqus software suit has been presented. Two different approaches, including traditional Finite Element Method (FEM) and Arbitrary Lagrangian–Eulerian (ALE) formulation, were tested. In the first method, the soil was modelled as a two-phase medium and effective stress analysis was performed. In the second one (ALE), a single-phase medium was assumed...
-
SYNAT_MUSIC_GENRE_FV_173
Open Research DataThis is the original dataset containing 51582 music tracks (22 music genres) and 173 element-feature vector [1-6,9]. A collection of more than 50000 music excerpts described with a set of descriptors obtained through the analysis of 30-second mp3 recordings was gathered in a database called SYNAT. The SYNAT database was realized by the Gdansk University...
-
GIS Solution for Weather Forecast Data Analysis
PublicationIn this paper authors present the GIS system for the analysis of the numerical weather prediction data. This kind of data has multidimensional character (three dimensions and time) and its analysis should consider all the available factors. Proposed GIS system consists of RASDAMAN application with implemented OLAP cube mechanism, which enables the user to process data in the spatial-time domain. It also simplifies the meteorological...
-
How ethics combine with big data: a bibliometric analysis
PublicationThe term Big Data is becoming increasingly widespread throughout the world, and its use is no longer limited to the IT industry, quantitative scientific research, and entrepreneurship, but entered as well everyday media and conversations. The prevalence of Big Data is simply a result of its usefulness in searching, downloading, collecting and processing massive datasets. It is therefore not surprising that the number of scientific...