Search results for: clustering

Categorization of Wikipedia articles with spectral clustering

Publication

J. Szymański

- LECTURE NOTES IN COMPUTER SCIENCE - Year 2011

Abstract. The article reports application of clustering algorithms for creating hierarchical groups withinWikipedia articles.We evaluate three spectral clustering algorithms based on datasets constructed with usage ofWikipedia categories. Selected algorithm has been implemented in the system that categorize Wikipedia search results in the fly.

Categorization of Cloud Workload Types with Clustering

Publication

- Year 2017

The paper presents a new classification schema of IaaS cloud workloads types, based on the functional characteristics. We show the results of an experiment of automatic categorization performed with different benchmarks that represent particular workload types. Monitoring of resource utilization allowed us to construct workload models that can be processed with machine learning algorithms. The direct connection between the functional...

Full text to download in external service

Clustering Context Items into User Trust Levels

Publication

- Advances in Intelligent Systems and Computing - Year 2016

An innovative trust-based security model for Internet systems is proposed. The TCoRBAC model operates on user profiles built on the history of user with system interaction in conjunction with multi-dimensional context information. There is proposed a method of transforming the high number of possible context value variants into several user trust levels. The transformation implements Hierarchical Agglomerative Clustering strategy....

Full text available to download

Weighted Clustering for Bees Detection on Video Images

Publication

- Year 2020

This work describes a bee detection system to monitor bee colony conditions. The detection process on video images has been divided into 3 stages: determining the regions of interest (ROI) for a given frame, scanning the frame in ROI areas using the DNN-CNN classifier, in order to obtain a confidence of bee occurrence in each window in any position and any scale, and form one detection window from a cloud of windows provided by...

Full text available to download

Clustering Bathymetric Data for Electronic Navigational Charts

Publication

M. Wlodarczyk–Sielicka
A. Stateczny

- JOURNAL OF NAVIGATION - Year 2016

Full text to download in external service

Agent-Based Non-distributed and Distributed Clustering

Publication

I. Czarnowski
P. Jȩdrzejowicz

- Year 2009

Full text to download in external service

Information Retrieval with the Use of Music Clustering by Directions Algorithm

Publication

A. Kaczmarek

- Year 2013

This paper introduces the Music Clustering by Directions (MCBD) algorithm. The algorithm is designed to support users of query by humming systems in formulating queries. This kind of systems makes it possible to retrieve songs and tunes on the basis of a melody recorded by the user. The Music Clustering by Directions algorithm is a kind of an interactive query expansion method. On the basis of query, the algorithm provides suggestions...

Full text to download in external service

Development and Research of the Text Messages Semantic Clustering Methodology

Publication

N. Rizun
P. Kapłański
Y. Taranenko

- Year 2016

The methodology of semantic clustering analysis of customer’s text-opinions collection is developed. The author's version of the mathematical models of formalization and practical realization of short textual messages semantic clustering procedure is proposed, based on the customer’s text-opinions collection Latent Semantic Analysis knowledge extracting method. An algorithm for semantic clustering of the text-opinions is developed,...

Full text available to download

External Validation Measures for Nested Clustering of Text Documents

Publication

- Year 2011

Abstract. This article handles the problem of validating the results of nested (as opposed to "flat") clusterings. It shows that standard external validation indices used for partitioning clustering validation, like Rand statistics, Hubert Γ statistic or F-measure are not applicable in nested clustering cases. Additionally to the work, where F-measure was adopted to hierarchical classification as hF-measure, here some methods to...

Spectral Clustering Wikipedia Keyword-Based search Results

Publication

- FRONTIERS IN ROBOTICS AND AI - Year 2017

The paper summarizes our research in the area of unsupervised categorization of Wikipedia articles. As a practical result of our research, we present an application of spectral clustering algorithm used for grouping Wikipedia search results. The main contribution of the paper is a representation method for Wikipedia articles that has been based on combination of words and links and used for categoriation of search results in this...

Full text available to download

Interactive Query Expansion with the Use of Clustering by Directions Algorithm

Publication

A. Kaczmarek

- IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS - Year 2011

This paper concerns Clustering by Directions algorithm. The algorithm introduces a novel approach to interactive query expansion. It is designed to support users of search engines in forming web search queries. When a user executes a query, the algorithm shows potential directions in which the search can be continued. This paper describes the algorithm and it presents an enhancement which reduces the computational complexity of...

A Clustering-Based Methodology for Selection of Fault Tolerance Techniques

Publication

P. Kaczmarek
M. Roman

- LECTURE NOTES IN COMPUTER SCIENCE - Year 2012

Development of dependable applications requires selection of appropriate fault tolerance techniques that balance efficiency in fault handling and resulting consequences, such as increased development cost or performance degradation. This paper describes an advisory system that recommends fault tolerance techniques considering specified development and runtime application attributes. In the selection process, we use the K-means...

Ontology clustering by directions algorithm to expand ontology queries

Publication

A. Kaczmarek

- Year 2009

This paper concerns formulating ontology queries. It describes existing languages in which ontologies can be queried. It focuses on languages which are intended to be easily understood by users who are willing to retrieve information from ontologies. Such a language can be, for example, a type of controlled natural language (CNL). In this paper a novel algorithm called Ontology Clustering by Directions is presented. The algorithm...

K-means clustering for SAT-AIS data analysis

Publication

M. Mieczyńska
I. Czarnowski

- WMU Journal of Maritime Affairs - Year 2021

Full text to download in external service

0-step K-means for clustering Wikipedia search results

Publication

J. Szymański

- Year 2011

This article describes an improvement for K-means algorithm and its application in the form of a system that clusters search results retrieved from Wikipedia. The proposed algorithm eliminates K-means isadvantages and allows one to create a cluster hierarchy. The main contributions of this paper include the ollowing: (1) The concept of an improved K-means algorithm and its application for hierarchical clustering....

Self-Organizing Map representation for clustering Wikipedia search results

Publication

J. Szymański

- LECTURE NOTES IN COMPUTER SCIENCE - Year 2011

The article presents an approach to automated organization of textual data. The experiments have been performed on selected sub-set of Wikipedia. The Vector Space Model representation based on terms has been used to build groups of similar articles extracted from Kohonen Self-Organizing Maps with DBSCAN clustering. To warrant efficiency of the data processing, we performed linear dimensionality reduction of raw data using Principal...

Wyszukiwanie informacji z wykorzystaniem algorytmu Ontology Clustering by Directions

Publication

A. Kaczmarek

- Year 2009

Artykuł opisuje algorytm Ontology Clustering by Directions. Algorytm ten ma na celu wspieranie użytkowników w formułowaniu ontologicznych zapytań. Ontologiczne zapytania służą do wydobywania informacji sformułowanych za pomocą ontologii opisanych np. językiem OWL. Artykuł przedstawia rodzaje języków wykorzystywanych do formułowania ontologicznych zapytań. W szczególności opisuje języki, które mają być przyjazne użytkownikom. Na...

Self–Organizing Map representation for clustering Wikipedia search results

Publication

J. Szymański

- Year 2011

The article presents an approach to automated organization of textual data. The experiments have been performed on selected sub-set of Wikipedia. The Vector Space Model representation based on terms has been used to build groups of similar articles extracted from Kohonen Self-Organizing Maps with DBSCAN clustering. To warrant efficiency of the data processing, we performed linear dimensionality reduction of raw data using Principal...

Full text to download in external service

Automatic Clustering of EEG-Based Data Associated with Brain Activity

Publication

- Year 2018

The aim of this paper is to present a system for automatic assigning electroencephalographic (EEG) signals to appropriate classes associated with brain activity. The EEG signals are acquired from a headset consisting of 14 electrodes placed on skull. Data gathered are first processed by the Independent Component Analysis algorithm to obtain estimates of signals generated by primary sources reflecting the activity of the brain....

Full text to download in external service

Molecular-dynamics simulation of clustering processes in sea-ice floes

Publication

A. Herman

- PHYSICAL REVIEW E - Year 2011

Full text to download in external service

Method for Clustering of Brain Activity Data Derived from EEG Signals

Publication

- FUNDAMENTA INFORMATICAE - Year 2019

A method for assessing separability of EEG signals associated with three classes of brain activity is proposed. The EEG signals are acquired from 23 subjects, gathered from a headset consisting of 14 electrodes. Data are processed by applying Discrete Wavelet Transform (DWT) for the signal analysis and an autoencoder neural network for the brain activity separation. Processing involves 74 wavelets from 3 DWT families: Coiflets,...

Full text available to download

Breast Cancer Heterogeneity Investigation: Multiple k-Means Clustering Approach

Publication

J. Tobiasz
C. Hatzis
J. Polanska

- Year 2019

Full text to download in external service

Impact of Clustering on a Synthetic Instance Generation in Imbalanced Data Streams Classification

Publication

I. Czarnowski
D. Martins

- Year 2022

Full text to download in external service

An Approach for Journal Summarization Using Clustering Based Micro-Summary Generation

Publication

H. Mojeed
U. Sanoh
S. Salihu
A. Balogun
A. Bajeh
A. Akintola
M. Mabayoje
F. Usman-Hamzah
H. A. Mojeed

- Year 2020

Full text to download in external service

Pre-selection and assessment of green organic solvents by clustering chemometric tools

Publication

M. Tobiszewski
M. Nedyalkova
S. Madurga
F. Pena-Pereira
J. Namieśnik
V. Simeonov

- ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY - Year 2018

The study presents the result of the application of chemometric tools for selection of physicochemical parameters of solvents for predicting missing variables – bioconcentration factors, water-octanol and octanol-air partitioning constants. EPI Suite software was successfully applied to predict missing values for solvents commonly considered as “green”. Values for logBCF, logKOW and logKOA were modelled for 43 rather nonpolar solvents...

Full text to download in external service

Fuzzy Divisive Hierarchical Clustering of Solvents According to Their Experimentally and Theoretically Predicted Descriptors

Publication

M. Nedyalkova
C. Sarbu
M. Tobiszewski
V. Simeonov

- Symmetry-Basel - Year 2020

The present study describes a simple procedure to separate into patterns of similarity a large group of solvents, 259 in total, presented by 15 specific descriptors (experimentally found and theoretically predicted physicochemical parameters). Solvent data is usually characterized by its high variability, dierent molecular symmetry, and spatial orientation. Methods of chemometrics can usefully be used to extract and explore accurately...

Full text available to download

Evaluation of Machine Learning Methods for the Experimental Classification and Clustering of Higher Education Institutions

Publication

Ł. Brzezicki
J. Maślankowski

- Year 2022

Higher education institutions have a big impact on the future of skills supplied on the labour market. It means that depending on the changes in labour market, higher education institutions are making changes to fields of study or adding new ones to fulfil the demand on labour market. The significant changes on labour market caused by digital transformation, resulted in new jobs and new skills. Because of the necessity of computer...

Kernel-Based Fuzzy C-Means Clustering Algorithm for RBF Network Initialization

Publication

I. Czarnowski
P. Jędrzejowicz

- Year 2016

Full text to download in external service

The adaptive spatio-temporal clustering method in classifying direct labor costs for the manufacturing industry

Publication

M. Kalinowski
J. Baran
P. Weichbroth

- Year 2021

Employee productivity is critical to the profitability of not only the manufacturing industry. By capturing employee locations using recent advanced tracking devices, one can analyze and evaluate the time spent during a workday of each individual. However, over time, the quantity of the collected data becomes a burden, and decreases the capabilities of efficient classification of direct labor costs. However, the results obtained...

Full text available to download

Interfejs do algorytmu Clustering by Directions ułatwiający formułowanie zapytań w wyszukiwarkach internetowych

Publication

A. Kaczmarek

- Year 2009

Rozdział dotyczy tworzenia zapytań w wyszukiwarkach internetowych. Opisuje sposoby wspierania użytkowników wyszukiwarek w formułowaniu zapytań. Ponadto opisuje zasadę działania opracowanego przez autora algorytmu Clustering by Directions. Algorytm ten przeznaczony jest do wskazywania użytkownikom potencjalnych kierunków, w których mogą kontynuować wyszukiwanie. Kierunki są reprezentowane przez wyrazy, które użytkownik może dodawać...

Increasing K-Means Clustering Algorithm Effectivity for Using in Source Code Plagiarism Detection

Publication

P. Hrkút
M. Ďuračík
M. Mikušová
M. Callejas-cuervo
J. Żukowska

- Year 2019

The problem of plagiarism is becoming increasingly more significant with the growth of Internet technologies and the availability of information resources. Many tools have been successfully developed to detect plagiarisms in textual documents, but the situation is more complicated in the field of plagiarism of source codes, where the problem is equally serious. At present, there are no complex tools available to detect plagiarism...

Impact of the Time Window Length on the Ship Trajectory Reconstruction Based on AIS Data Clustering

Publication

M. Mieczyńska
I. Czarnowski

- Year 2021

Full text to download in external service

Dynamic Re-Clustering Leach-Based (Dr-Leach) Protocol for Wireless Sensor Networks

Publication

A. Ijjeh
A. Ijjeh
H. Al-Issa
S. Thuneibat

- International Journal of Computer Networks & Communications (IJCNC) - Year 2015

Full text to download in external service

Designing RBFNs Structure Using Similarity-Based and Kernel-Based Fuzzy C-Means Clustering Algorithms

Publication

I. Czarnowski
J. Jedrzejowicz
P. Jedrzejowicz

- IEEE Access - Year 2021

Full text to download in external service

Comparison of selected clustering algorithms of raw data obtained by interferometric methods using artificial neural networks

Publication

M. Wlodarczyk-Sielicka
J. Lubczonek
A. Stateczny

- Year 2016

Full text to download in external service

Social learning and knowledge flows in cluster initiatives, In: Sanz S.C., Blanco F.P., Urzelai B. (Eds). Human and Relational Resources (pp. 44-45). the 4th International Conference on Clusters and Industrial Districts CLUSTERING, University of Valencia, Spain, May 23–24 (ISBN: 978-84-09-11926-4).

Publication

M. Rozkwitalska
A. Lis

- Year 2019

Purpose – The purpose of the paper is to explore how learning manifests and knowledge flows in cluster initiatives (CIs) due to interactions undertaken by their members. The paper addresses the research question of how social learning occurs and knowledge flows in CIs. Design/methodology/approach – The qualitative study of four cluster initiatives helped to identify various symptoms of social learning and knowledge flows in...

Web search results clusterization with background knowledge

Publication

J. Szymański

- Year 2009

Clusterization of web pages is an attractive wayfor presenting web resources. Arranging pages into groups ofsimilar topics simplifies and shorten the search process. Thispaper concerns the problem of clustering web pages and presentsour approach to this issue. Our solution is focused on findingsimilarities between documents delivered by different web searchengines. This process was accomplished by applying WordNetdictionary.

Towards Effective Processing of Large Text Collections

Publication

- Year 2012

In the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...

Identification, Assessment and Automated Classification of Requirements Engineering Techniques

Publication

- Year 2019

Selection of suitable techniques to be used in requirements engineering or business analysis activities is not easy, especially considering the large number of new proposals that emerged in recent years. This paper provides a summary of techniques recommended by major sources recognized by the industry. A universal attribute structure for the description of techniques is proposed and used to describe 33 techniques most frequently...

Full text available to download

Evaluation of Path Based Methods for Conceptual Representation of the Text

Publication

- Year 2014

Typical text clustering methods use the bag of words (BoW) representation to describe content of documents. However, this method is known to have several limitations. Employing Wikipedia as the lexical knowledge base has shown an improvement of the text representation for data-mining purposes. Promising extensions of that trend employ hierarchical organization of Wikipedia category system. In this paper we propose three path-based...

Full text to download in external service

Factors that strengthen and weaken the identity of the cluster structures

Publication

A. Lis
A. Lis

- Year 2012

The main aim of this paper is the application of "identity" to the issues related to "clustering process" and particularly - to the cooperation in the clusters and the cluster initiatives. The authors distinguish these factors that have the greatest influence on the formation and maintenance of identity in mentioned networks of cooperation.

Full text to download in external service

Analysis and evaluation of grouping methods for effective cutting tool operation

Publication

- Journal of Machine Construction and Maintenance - Year 2018

This article presents the possibilities for using cluster analysis in the assignment of machine tools in automated manufacturing systems. Based on the similarity of manufacturing processes in the system, cutting tools have been grouped. The objective was to obtain groups of similar objects, which could potentially ensure the reduction of the frequency and time of setups, optimizing the maintenance of tool resources and improving...

Full text available to download

General concept of reduction process for big data obtained by interferometric methods

Publication

M. Wlodarczyk-Sielicka
A. Stateczny

- Year 2017

Interferometric sonar systems apply the phase content of the sonar signal to measure the angle of a wave front returned from the seafloor or from a target. It collect a big data – datasets that are so large or complex that traditional data processing application software is inadequate to deal with them. The recording a large number of data is associated with the difficulty of their efficient use. So data have to be reduced. The main...

Full text to download in external service

Image Segmentation of MRI image for Brain Tumor Detection

Publication

K. Ullah

- Year 2020

this research work presents a new technique for brain tumor detection by the combination of Watershed algorithm with Fuzzy K-means and Fuzzy C-means (KIFCM) clustering. The MATLAB based proposed simulation model is used to improve the computational simplicity, noise sensitivities, and accuracy rate of segmentation, detection and extraction from MR...

Komputerowa weryfikacja układów cyfrowych CMOS utworzonych z podukładów zasilanych ze źródeł o różnych wartościach napięcia

Publication

- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Year 2007

W pracy zaprezentowano wyniki komputerowej weryfikacji cyfrowego układu CMOS utworzonego z klastrów, z których każdy jest zasilany odpowiednio malejącymi wartościami napięć. Zbiór klastrów został utworzony przy pomocy algorytmu ECA (Evolutionary Clustering Algorithm) dla potrzeb redukcji mocy pobieranej ze źródła zasilającego. Otrzymane rozwiązanie, charakteryzujące się zmniejszeniem zapotrzebowania na moc, nie powoduje pogorszenia...

Full text available to download

Interactive Information Search in Text Data Collections

Publication

- Year 2013

This article presents a new idea for retrieving in text repositories, as well as it describes general infrastructure of a system created to implement and test those ideas. The implemented system differs from today’s standard search engine by introducing process of interactive search with users and data clustering. We present the basic algorithms behind our system and measures we used for results evaluation. The achieved results...

Full text to download in external service

Retrieval with Semantic Sieve

Publication

- Year 2013

The article presents an algorithm we called Semantic Sieve applied for refining search results in text documents repository. The algorithm calculates socalled conceptual directions that enables interaction with the user and allows to narrow the set of results to the most relevant ones. We present the system where the algorithm has been implemented. The system also offers in the presentation layer clustering of the results into...

Full text to download in external service

System for tracking multiple trains on a test railway track

Publication

- Advances in Intelligent Systems and Computing - Year 2017

Several problems may arise when multiple trains are to be tracked using two IP camera streams. In this work, real-life conditions are simulated using a railway track model based on the Pomeranian Metropolitan Railway (PKM). Application of automatic clustering of optical flow is investigated. A complete tracking solution is developed using background subtraction, blob analysis, Kalman filtering, and a Hungarian algorithm. In total,...

Full text to download in external service

System for tracking multiple trains on a test railway track

Publication

- Year 2017

Several problems may arise when multiple trains are to be tracked using two IP camera streams. In this work, real-life conditions are simulated using a railway track model based on the Pomeranian Metropolitan Railway (PKM). Application of automatic clustering of optical flow is investigated. A complete tracking solution is developed using background subtraction, blob analysis, Kalman filtering, and a Hungarian algorithm. In total,...

Full text to download in external service

Microbial diversity of inflamed and noninflamed gut biopsy tissues in inflammatory bowel disease.

Publication

S. Shadi
R. Kotlowski
C. Bernstein
D. Krause

- INFLAMMATORY BOWEL DISEASES - Year 2016

BACKGROUND: Inflammatory bowel disease (IBD) is a chronic gastrointestinal condition without any known cause or cure. An imbalance in normal gut biota has been identified as an important factor in the inflammatory process. METHODS: Fifty-eight biopsies from Crohn's disease (CD, n = 10), ulcerative colitis (UC, n = 15), and healthy controls (n = 16) were taken from a population-based case-control study. Automated ribosomal intergenic...

Full text to download in external service

Search

Filters

Catalog

Category

Year

Options