Filtry
wszystkich: 3690
wybranych: 2509
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: analysis of large data set
-
Deep Data Analysis of a Large Microarray Collection for Leukemia Biomarker Identification
Publikacja -
Comprehensive Analysis of MILE Gene Expression Data Set Advances Discovery of Leukaemia Type and Subtype Biomarkers
Publikacja -
Analysis of results of large-scale multimodal biometric identity verification experiment
PublikacjaAn analysis of a large set of biometric data obtained during the enrolment and the verification phase in an experimental biometric system installed in bank branches is presented. Subjective opinions of bank clients and of bank tellers were also surveyed concerning the studied biometric methods in order to discover and to explore relations emerging from the obtained multimodal dataset. First, data acquisition and identity verification...
-
Accelerated large scale test set-up design in natural corrosion marine environment
PublikacjaThe standards for conducting small-scale specimen tests are well developed, but there is a lack of direct guidelines for conducting corrosion tests for large-scale specimens. The objective here is to develop a methodology which may be used in designing an accelerated corrosion test of large-scale structural components subjected to a natural corrosion marine environment. Different factors influencing corrosion degradation of steel...
-
APPLICATION OF CHEMOMETRIC ANALYSIS TO THE STUDY OF SNOW AT THE SUDETY MOUNTAINS, POLAND
PublikacjaSnow samples were collected during winter 2011/2012 in three posts in the Western Sudety Mountains (Poland) in 3 consecutive phases of snow cover development, i.e. stabilisation (Feb 1st), growth (Mar 15th) and its ablation (Mar 27th). To maintain a fixed number of samples, each snow profile has been divided into six layers, but hydrochemical indications were made for each 10 cm section of core. The complete data set was subjected...
-
Processing of Satellite Data in the Cloud
PublikacjaThe dynamic development of digital technologies, especially those dedicated to devices generating large data streams, such as all kinds of measurement equipment (temperature and humidity sensors, cameras, radio-telescopes and satellites – Internet of Things) enables more in-depth analysis of the surrounding reality, including better understanding of various natural phenomenon, starting from atomic level reactions, through macroscopic...
-
Simultaneous determination of hydrophobicity and dissociation constant for a large set of compounds by gradient reverse phase high performance liquid chromatography–mass spectrometry technique
Publikacja -
Uncertainty quantification of modal parameter estimates obtained from subspace identification: An experimental validation on a laboratory test of a large-scale wind turbine blade
PublikacjaThe uncertainty afflicting modal parameter estimates stems from e.g., the finite data length, unknown, or partly measured inputs and the choice of the identification algorithm. Quantification of the related errors with the statistical Delta method is a recent tool, useful in many modern modal analysis applications e.g., damage diagnosis, reliability analysis, model calibration. In this paper, the Delta method-based uncertainty...
-
Trade differentiation and the characteristics of new imported and exported products - international panel data analysis
PublikacjaDrawing on o large panel of international economies we have shown how the set of imported and exported products evolves in economic growth process. Strong activity at the extensive margin, manifested through the rise in the number of active export and import lines, is typical for early stages of development. Trade diversification tendency, typical for a predominant mass of observations in our panel, is associated with changes in...
-
Segmentation Quality Refinement in Large-Scale Medical Image Dataset with Crowd-Sourced Annotations
PublikacjaDeployment of different techniques of deep learning including Convolutional Neural Networks (CNN) in image classification systems has accomplished outstanding results. However, the advantages and potential impact of such a system can be completely negated if it does not reach a target accuracy. To achieve high classification accuracy with low variance in medical image classification system, there is needed the large size of the...
-
Text Mining Algorithms for Extracting Brand Knowledge; The fashion Industry Case
PublikacjaBrand knowledge is determined by customer knowledge. The opportunity to develop brands based on customer knowledge management has never been greater. Social media as a set of leading communication platforms enable peer to peer interplays between customers and brands. A large stream of such interactions is a great source of information which, when thoroughly analyzed, can become a source of innovation and lead to competitive advantage....
-
Compressive Sensing Approach to Harmonics Detection in the Ship Electrical Network
PublikacjaThe contribution of this paper is to show the opportunities for using the compressive sensing (CS) technique for detecting harmonics in a frequency sparse signal. The signal in a ship’s electrical network, polluted by harmonic distortions, can be modeled as a superposition of a small number of sinusoids and the discrete Fourier transform (DFT) basis forms its sparse domain. According to the theory of CS, a signal may be reconstructed...
-
Energy efficiency of electric multiple units in suburban operation
PublikacjaThis thesis presents approach to analysis of energy efficiency of a suburban rail network, using novel models developed on the Matlab/Simulink basis. Necessary features and requirements for such models were determined thru in-depth review of the source literature in all applicable fields: electrified transportation systems, electric multiple units construction, vehicle drivetrains and finally, existing simulation methods. Existing...
-
Wydłużanie krzywej przejściowej w analitycznej metodzie projektowania
PublikacjaW pracy przedstawiono problematykę wydłużania krzywych przejściowych, wykorzystując do tego celu analityczną metodę projektowania. Podstawę analizy stanowiły obliczenia numeryczne przeprowadzone dla szerokiego zestawu parametrów charakteryzujących standardowy układu geometryczny. Po sformułowaniu odpowiednich zależności teoretycznych rozpatrzono kwestie znaczenia wielkości promienia łuku kołowego i kąta zwrotu trasy na uzyskane...
-
Aktualne trendy w zakresie certyfikacji normatywnych systemów zarządzania w branży spożywczej
PublikacjaW opracowaniu omówiono aktualne trendy dotyczące certyfikacji systemu zarządzania jakością w sektorze rolno-spożywczym. Na podstawie danych uzyskanych z dużej, renomowanej jednostki certyfikującej przedstawiono analizę porównawczą dotyczącą liczby wydanych certyfikatów dla ogólnych systemów zarządzania oraz dla systemów branżowych dedykowanych sektorowi rolno-spożywczemu w odniesieniu do przedsiębiorstw różnych wielkości.
-
Platforma KASKADA jako system zapewniania bezpieczeństwa poprzez masową analizę strumieni multimedialnych w czasie rzeczywistym
PublikacjaW artykule przedstawiono Platformę KASKADA rozumianą jako system przetwarzania danych cyfrowych i strumieni multimedialnych oraz stanowiącą ofertę usług wspomagających zapewnienie bezpieczeństwa publicznego, ocenę badań medycznych i ochronę własności intelektualnej. celem prowadzonych prac było stworzenie innowacyjnego systemu umozliwiajacego wydajną i masową analizę dokumentów cyfrowych i strumieni multimedialnych w czasie rzeczywistym...
-
Music Data Processing and Mining in Large Databases for Active Media
PublikacjaThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Analiza przestrzenna branży transportu lądowego w Polsce
PublikacjaCelem opracowania było określenie zróżnicowania rozkładu przestrzennego branży transportu lądowego w Polsce oraz czynników wpływających na jej rozmiary. Analizę przeprowadzono na szczeblu wojewódzkim oraz powiatowym, na podstawie danych za lata 2009 i 2012. Wykorzystano analizę lokalizacji, obliczono i zinterpretowano autokorelację przestrzenną, skontruowano i oszacowano także model regresji przestrzennej. Stwierdzono, że na szczeblu...
-
Data Analysis in Bridge of Data
PublikacjaThe chapter presents the data analysis aspects of the Bridge of Data project. The software framework used, Jupyter, and its configuration are presented. The solution’s architecture, including the TRYTON supercomputer as the underlying infrastructure, is described. The use case templates provided by the Stat-reducer application are presented, including data analysis related to spatial points’ cloud-, audio- and wind-related research.
-
Visual Dimensions of Modeling Languages in Interdisciplinary Perspective
PublikacjaUżyteczność języków modelowania wizualnego zależy od notacji. Notacja może być postrzegana jako zestaw wizualnych komponentów, które w określony sposób oddziałują na ludzkie oko i ludzki mózg. Referat przedstawia analizę interdyscyplinarną wykonaną w celu lepszego zrozumienia wizualnych wymiarów języków modelowania. Wizualne wymiary pochodzą z teorii opisujących percepcję wzrokową, wizualizację danych oraz reprezentacje poznawcze....
-
BEARING SYSTEMS OF WIND TURBINES – MAINTENANCE PROBLEMS
PublikacjaW praktyce eksploatacyjnej turbin wiatrowych o tradycyjnej konstrukcji (przekładniowych) obserwowana jest duża awaryjność przekładni, a zwłaszcza łożysk tocznych szybkoobrotowych wałów przekładni. W Polsce na zlecenie właściciela dużej farmy wiatrowej, firmy Energa Wytwarzanie S.A., Politechnika Gdańska we współpracy z Politechniką Poznańską i Akademią Górniczo Hutniczą podjęły badania mające na celu diagnozowanie przyczyn uszkodzeń...
-
Application of Web-GIS for Dissemination and 3D Visualization of Large-Volume LiDAR Data
PublikacjaThe increasing number of digital data sources, which allow for semi-automatic collection and storage of information regarding various aspects of life has recently granted a considerable rise in popularity to the term “Big data”. As far as geospatial data is concerned, one of the major sources of Big data are Light Detection And Ranging (LiDAR) scanners, which produce high resolution three-dimensional data on a local scale. The...
-
Qualitative evaluation of distributed clinical systems supporting research teams working on large-scale data
PublikacjaInthispaper,fivecontemporaryscalablesystemstosupportmedicalresearchteams are presented. Their functionalities extend from heterogeneous unstructured data acquisition through large-scale data storing, to on-the-fly analyzing by using robust methods. Such kinds of systems can be useful in the development of new medical procedures and recommendation rules for decision support systems. A short description of each of them is provided....
-
Algorytmy wykrywania krawędzi w obrazie
PublikacjaWykrywanie krawędzi jest pierwszym etapem w cyfrowym przetwarzaniu obrazów. Operacja ta polega na usunięciu informacji takich jak kolor czy też jasność, a pozostawieniu jedynie krawędzi. Efektem tej operacji jest znaczna redukcja ilości danych do dalszej analizy. Pozwala to na zastosowanie w następnych etapach bardziej złożonych algorytmów rozpoznawania obiektów na podstawie kształtu. W artykule zaprezentowano zastosowanie algorytmów...
-
Using Principal Component Analysis and Canonical Discriminant Analysis for multibeam seafloor characterisation data
PublikacjaThe paper presents the seafloor characterisation based on multibeam sonar data. It relies on using the integrated model and description of three types of multibeam data obtained during seafloor sensing: 1) the grey-level sonar images (echograms) of seabed, 2) the 3D model of the seabed surface which consists of bathymetric data, 3) the set of time domain bottom echo envelopes received in the consecutive sonar beams. The classification...
-
Simulation of parallel similarity measure computations for large data sets
PublikacjaThe paper presents our approach to implementation of similarity measure for big data analysis in a parallel environment. We describe the algorithm for parallelisation of the computations. We provide results from a real MPI application for computations of similarity measures as well as results achieved with our simulation software. The simulation environment allows us to model parallel systems of various sizes with various components...
-
Strength analysis of a large-size supporting structure for an offshore wind turbine
PublikacjaThe offshore wind power industry is the branch of electric energy production from renewable sources which is most intensively developed in EU countries. At present, there is a tendency to install larger-power wind turbines at larger distances from the seashore, on relatively deep waters. Consequently, technological solutions for new supporting structures intended for deeper water regions are undergoing rapid development now. Various...
-
Automated Valuation Model based on fuzzy and rough set theory for real estate market with insufficient source data
PublikacjaObjective monitoring of the real estate value is a requirement to maintain balance, increase security and minimize the risk of a crisis in the financial and economic sector of every country. The valuation of real estate is usually considered from two points of view, i.e. individual valuation and mass appraisal. It is commonly believed that Automated Valuation Models (AVM) should be devoted to mass appraisal, which requires a large...
-
Using LSTM networks to predict engine condition on large scale data processing framework
PublikacjaAs the Internet of Things technology is developing rapidly, companies have an ability to observe the health of engine components and constructed systems through collecting signals from sensors. According to output of IoT sensors, companies can build systems to predict the conditions of components. Practically the components are required to be maintained or replaced before the end of life in performing their assigned task. Predicting...
-
Large deformation finite element analysis of undrained pile installation
PublikacjaIn this paper, a numerical undrained analysis of pile jacking into the subsoil using Abaqus software suit has been presented. Two different approaches, including traditional Finite Element Method (FEM) and Arbitrary Lagrangian–Eulerian (ALE) formulation, were tested. In the first method, the soil was modelled as a two-phase medium and effective stress analysis was performed. In the second one (ALE), a single-phase medium was assumed...
-
GIS Solution for Weather Forecast Data Analysis
PublikacjaIn this paper authors present the GIS system for the analysis of the numerical weather prediction data. This kind of data has multidimensional character (three dimensions and time) and its analysis should consider all the available factors. Proposed GIS system consists of RASDAMAN application with implemented OLAP cube mechanism, which enables the user to process data in the spatial-time domain. It also simplifies the meteorological...
-
How ethics combine with big data: a bibliometric analysis
PublikacjaThe term Big Data is becoming increasingly widespread throughout the world, and its use is no longer limited to the IT industry, quantitative scientific research, and entrepreneurship, but entered as well everyday media and conversations. The prevalence of Big Data is simply a result of its usefulness in searching, downloading, collecting and processing massive datasets. It is therefore not surprising that the number of scientific...
-
Metoda obliczania skutków wdrożenia strategii zarządzania popytem na energię elektryczną (DSM/DSR) w systemach elektroenergetycznych
PublikacjaW niniejszej rozprawie poruszono zagadnienie strategii zarządzania popytem na energię elektryczną (DSM/DSR) i sposobów obliczania efektów ich wdrożenia. W związku z tym opisano oczekiwane efekty wdrożenia tych rozwiązań oraz ich zalety i wady. Zaprezentowano i przeanalizowano istniejące już metody obliczania skutków wdrożenia DSM/DSR. Zaproponowano nową metodę, która poprzez formę algorytmu uporządkowuje proces obliczania i oceny...
-
Improving Effectiveness of SVM Classifier for Large Scale Data
PublikacjaThe paper presents our approach to SVM implementation in parallel environment. We describe how classification learning and prediction phases were pararellised. We also propose a method for limiting the number of necessary computations during classifier construction. Our method, named one-vs-near, is an extension of typical one-vs-all approach that is used for binary classifiers to work with multiclass problems. We perform experiments...
-
A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies
PublikacjaIn this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...
-
Manufacturing Data Analysis in Internet of Things/Internet of Data (IoT/IoD) Scenario
PublikacjaComputer integrated manufacturing (CIM) has enormous benefits as it increases the rate of production, reduces errors and production waste, and streamlines manufacturing sub-systems. However, there are some new challenges related to CIM operating in the Internet of Things/Internet of Data (IoT/IoD) scenarios associated with Industry 4.0 and cyber-physical systems. The main challenge is to deal with the massive volume of data flowing...
-
Analysis of the Suitability of Selected Data Tranmission Systems in RSMAD
PublikacjaThis paper analyses the suitability of the selected radio communication systems currently used for data transmission, or usable in the future, in Radio System for Monitoring and Acquisition of Data from Traffic Enforcement Cameras (in short RSMAD). The paper also presents the advantages and disadvantages of each systems, paying particular attention to features that directly affect the suitability of the solution in the RSMAD system....
-
CPLFD-GDPT5: High-resolution gridded daily precipitation and temperature data set for two largest Polish river basins
PublikacjaThe CHASE-PL (Climate change impact assessment for selected sectors in Poland) Forcing Data–Gridded Daily Precipitation & Temperature Dataset–5 km (CPLFD-GDPT5) consists of 1951–2013 daily minimum and maximum air temperatures and precipitation totals interpolated onto a 5 km grid based on daily meteorological observations from the Institute of Meteorology and Water Management (IMGW-PIB; Polish stations), Deutscher Wetterdienst...
-
PERFORMANCE OF ENDOSCOPIC IMAGE ANALYSIS ALGORITHMS IN LARGE BOWEL VIDEOS PROCESSING
PublikacjaComputer-assisted endoscopy is a rapidly developing eld of study. Many image anal- ysis algorithms exist, achieving very high rates of eciency at processing single endoscopic images. However, most of them were never tested in processing real-life endoscopic videos. In the article such tests of 16 endoscopy image analysis algorithms are presented and dis- cussed. Tests were performed on two real-life endoscopic videos of a human...
-
Robustness Analysis of a Distributed MPC Control System of a Turbo-Generator Set of a Nuclear Plant – Disturbance Issues
PublikacjaTypically, there are two main control loops with PI controllers operating at each turbo-generator set. In this paper, a distributed model predictive controller with local quadratic model predictive controllers for the turbine generator is proposed instead of a set of classical PI controllers. The local quadratic predictive controllers utilize step-response models for the controlled system components. The parameters of these models...
-
Tryton Supercomputer Capabilities for Analysis of Massive Data Streams
PublikacjaThe recently deployed supercomputer Tryton, located in the Academic Computer Center of Gdansk University of Technology, provides great means for massive parallel processing. Moreover, the status of the Center as one of the main network nodes in the PIONIER network enables the fast and reliable transfer of data produced by miscellaneous devices scattered in the area of the whole country. The typical examples of such data are streams...
-
DATA JOURNALS AND DATA PAPERS IN VARIOUS RESEARCH AREAS AND SCIENTIFIC DISCIPLINES – BIBLIOMETRIC ANALYSIS BASED ON INCITES
PublikacjaThe main aim of this work is to provide insight into a bibliometric analysis of Data Journals and Data Papers in terms of research areas, disciplines, publication year and country. In particular, we calculated many bibliometric indicators, especially: the number of publications and citations. Furthermore, this work also investigated the top 20 journals in which scientists published the largest number of Data Papers. It was found...
-
Analysis of server-side and client-side Web-GIS data processing methods on the example of JTS and JSTS using open data from OSM and geoportal
PublikacjaThe last decade has seen a rapid evolution of processing, analysis and visualization of freely available geographic data using Open Source Web-GIS. In the beginning, Web-based Geographic Information Systems employed a thick-client approach which required installation of platform-specific browser plugins. Later on, research focus shifted to platform-independent thin client solutions in which data processing and analysis was performed...
-
Ranking metrics in gene set enrichment analysis: do they matter?
Publikacja -
Export diversification and economic development: a dynamic spatial data analysis
PublikacjaThis paper contributes to the empirical literature on the relationship between ‘export variety’ (export diversification) and economic development by relaxing the assumption of cross-country independence and allowing for spatial diffusion of shocks in observed and unobserved factors. Export variety is measured for a balanced panel of 114 countries (1992-2012) using very detailed information on their exports (HS 6-digit product...
-
Analysis of High Resolution Clouds of Points as a Source of Biometric Data
PublikacjaThe article presents the analysis devoted to human face data obtained by means of precise photographic scanners. Collected point clouds were used to make high precision meshes of human face. The essence of these studies is the comparison of relative features as well as the comparison of absolute models which require as precisely as possible matching of face models. The article focuses on the analysis of various parts of the human...
-
Distance learning trends: introducing new solutions to data analysis courses
PublikacjaNowadays data analysis of any kind becomes a piece of art. The same happens with the teaching processes of statistics, econometrics and other related courses. This is not only because we are facing (and are forced to) teach online or in a hybrid mode. Students expect to see not only the theoretical part of the study and solve some practical examples together with the instructor. They are waiting to see a variety of tools, tutorials,...
-
Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set
PublikacjaThis work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...
-
The use of weight in motion data to heavy vehicles - pavement interaction analysis
PublikacjaWeight in motion (WIM) is the element of modern intelligent transport systems created to control commercial vehicles and to detect overloaded items. Because all of the vehicles crossing the WIM station are weighed and recognized, given data can be used to vehicle-pavement interaction analysis. The paper presents the results of analysis based on data from seven WIM stations localized in Motorway and National Roads in Poland. The...
-
Export diversification and economic development: A dynamic spatial data analysis
PublikacjaThis paper contributes to the empirical literature on the relationship between “export variety” (export diversification) and economic development by relaxing the assumption of cross-country independence and allowing for spatial diffusion of shocks in observed and unobserved factors. Export variety is measured for a balanced panel of 114 countries (1992–2012) using very detailed information on their exports (HS 6-digit product level)....