Filters
total: 49
Search results for: K NEAREST NEIGHBORS (KNN)
-
Things You Might Not Know about the k-Nearest Neighbors Algorithm
PublicationRecommender Systems aim at suggesting potentially interesting items to a user. The most common kind of Recommender Systems is Collaborative Filtering which follows an intuition that users who liked the same things in the past, are more likely to be interested in the same things in the future. One of Collaborative Filtering methods is the k Nearest Neighbors algorithm which finds k users who are the most similar to an active user...
-
When Neural Networks Meet Decisional DNA: A Promising New Perspective for Knowledge Representation and Sharing
PublicationABSTRACT In this article, we introduce a novel concept combining neural network technology and Decisional DNA for knowledge representation and sharing. Instead of using traditional machine learning and knowledge discovery methods, this approach explores the way of knowledge extraction through deep learning processes based on a domain’s past decisional events captured by Decisional DNA. We compare our approach with kNN (k-nearest...
-
Music Data Processing and Mining in Large Databases for Active Media
PublicationThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Analyzing the Effectiveness of the Brain–Computer Interface for Task Discerning Based on Machine Learning
PublicationThe aim of the study is to compare electroencephalographic (EEG) signal feature extraction methods in the context of the effectiveness of the classification of brain activities. For classification, electroencephalographic signals were obtained using an EEG device from 17 subjects in three mental states (relaxation, excitation, and solving logical task). Blind source separation employing independent component analysis (ICA) was...
-
Two Stage SVM and kNN Text Documents Classifier
PublicationThe paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...
-
News that Moves the Market: DSEX-News Dataset for Forecasting DSE Using BERT
PublicationStock market is a complex and dynamic industry that has always presented challenges for stakeholders and investors due to its unpredictable nature. This unpredictability motivates the need for more accurate prediction models. Traditional prediction models have limitations in handling the dynamic nature of the stock market. Additionally, previous methods have used less relevant data, leading to suboptimal performance. This study...
-
Discrimination of Apple Liqueurs (Nalewka) Using a Voltammetric Electronic Tongue, UV-Vis and Raman Spectroscopy
PublicationThe capability of a phthalocyanine-based voltammetric electronic tongue to analyze strong alcoholic beverages has been evaluated and compared with the performance of spectroscopic techniques coupled to chemometrics. Nalewka Polish liqueurs prepared from five apple varieties have been used as a model of strong liqueurs. Principal Component Analysis has demonstrated that the best discrimination between liqueurs prepared from different...
-
Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires
PublicationBackground Endometriosis is a condition that significantly affects the quality of life of about 10 % of reproductive-aged women. It is characterized by the presence of tissue similar to the uterine lining (endometrium) outside the uterus, which can lead lead scarring, adhesions, pain, and fertility issues. While numerous factors associated with endometriosis are documented, a wide range of symptoms may still be undiscovered. Methods In...
-
A Study on Influence of Normalization Methods on Music Genre Classification Results Employing kNN Algorithms
PublicationThis paper presents a comparison of different normalization methods applied to the set of feature vectors of music pieces. Test results show the influence of min-nlax and Zero-Mean normalization methods, employing different distance functions (Euclidean, Manhattan, Chebyshev, Minkowski) as a pre-processing for genre classification, on k-Nearest Neighbor (kNN) algorithm classification results.
-
Komputerowo wspomagana klasyfikacja wybranych sygnałów elektromiografii powierzchniowej
PublicationWykorzystywanie sygnałów elektromiografii powierzchniowej (ang. Surface Electromyography, SEMG) w procesach sterowania systemami rehabilitacyjnymi stanowi obecnie standardową procedurę. Popularność SEMG wynika z nieinwazyjności metody oraz możliwości szybkiej i precyzyjnej identyfikacji funkcji mięśniowej. W przypadku osób małoletnich proces klasyfikacji sygnałów jest utrudniony ze względu na mniejsze rozmiary i wyższą dynamikę...
-
Music Genre Recognition in the Rough Set-Based Environment
PublicationThe aim of this paper is to investigate music genre recognition in the rough set-based environment. Experiments involve a parameterized music data-base containing 1100 music excerpts. The database is divided into 11 classes cor-responding to music genres. Tests are conducted using the Rough Set Exploration System (RSES), a toolset for analyzing data with the use of methods based on the rough set theory. Classification effectiveness...
-
Voice command recognition using hybrid genetic algorithm
PublicationAbstract: Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer...
-
Vehicle classification based on soft computing algorithms
PublicationExperiments and results regarding vehicle type classification are presented. Three classes of vehicles are recognized: sedans, vans and trucks. The system uses a non-calibrated traffic camera, therefore no direct vehicle dimensions are used. Various vehicle descriptors are tested, including those based on vehicle mask only and those based on vehicle images. The latter ones employ Speeded Up Robust Features (SURF) and gradient images...
-
Investigation of Air Quality beside a Municipal Landfill: The Fate of Malodour Compounds as a Model VOC
PublicationThis paper presents the results of an investigation on ambient air odour quality in the vicinity of a municipal landfill. The investigations were carried out during the spring–winter and the spring seasons using two types of the electronic nose instrument. The field olfactometers were employed to determine the mean odour concentration, which was from 2.1 to 32.2 ou/m3 depending on the measurement site and season of the year. In...
-
From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition
PublicationRecently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...
-
MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES
PublicationAutomatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...
-
Highlighting interlanguage phoneme differences based on similarity matrices and convolutional neural network
PublicationThe goal of this research is to find a way of highlighting the acoustic differences between consonant phonemes of the Polish and Lithuanian languages. For this purpose, similarity matrices are employed based on speech acoustic parameters combined with a convolutional neural network (CNN). In the first experiment, we compare the effectiveness of the similarity matrices applied to discerning acoustic differences between consonant...
-
Heavy Duty Vehicle Fuel Consumption Modelling Based on Exploitation Data by Using Artificial Neural Networks
PublicationOne of the ways to improve the fuel economy of heavy duty trucks is to operate the combustion engine in its most efficient operating points. To do that, a mathematical model of the engine is required, which shows the relations between engine speed, torque and fuel consumption in transient states. In this paper, easy accessible exploitation data collected via CAN bus of the heavy duty truck were used to obtain a model of a diesel...
-
Determination of toxic gases based on the responses of a single electrocatalytic sensor and pattern recognition techniques
PublicationA response from an electrocatalytic gas sensor contains fingerprint information about the type of gas and its concentration. As a result, a single gas sensor can be used for the determination of different gases. However, information about the type of gas and its concentration is hidden in the unique shape of the current–voltage response and it is quite difficult to explore. One of the ways to get precise information about the measured...
-
INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH
PublicationThe Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...
-
Automatic labeling of traffic sound recordings using autoencoder-derived features
PublicationAn approach to detection of events occurring in road traffic using autoencoders is presented. Extensions of existing algorithms of acoustic road events detection employing Mel Frequency Cepstral Coefficients combined with classifiers based on k nearest neighbors, Support Vector Machines, and random forests are used. In our research, the acoustic signal gathered from the microphone placed near the road is split into frames and converted...
-
Virtual spring damper method for nonholonomic robotic swarm self-organization and leader following
PublicationIn this paper, we demonstrate a method for self-organization and leader following of nonholonomic robotic swarm based on spring damper mesh. By self-organization of swarm robots we mean the emergence of order in a swarm as the result of interactions among the single robots. In other words the self-organization of swarm robots mimics some natural behavior of social animals like ants among others. The dynamics of two-wheel robot...
-
Music genre classification applied to bass enhancement for mobile technology
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm is related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt. The classification of music genres is automatically executed employing MPEG 7 parameters and the Principal Component Analysis method applied to reduce information...
-
Employing Subjective Tests and Deep Learning for Discovering the Relationship between Personality Types and Preferred Music Genres
PublicationThe purpose of this research is two-fold: (a) to explore the relationship between the listeners’ personality trait, i.e., extraverts and introverts and their preferred music genres, and (b) to predict the personality trait of potential listeners on the basis of a musical excerpt by employing several classification algorithms. We assume that this may help match songs according to the listener’s personality in social music networks....
-
Application of autoencoder to traffic noise analysis
PublicationThe aim of an autoencoder neural network is to transform the input data into a lower-dimensional code and then to reconstruct the output from this code representation. Applications of autoencoders to classifying sound events in the road traffic have not been found in the literature. The presented research aims to determine whether such an unsupervised learning method may be used for deploying classification algorithms applied to...
-
Evaluation of aspiration problems in L2 English pronunciation employing machine learning
PublicationThe approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...
-
Comparison of Classification Methods for EEG Signals of Real and Imaginary Motion
PublicationThe classification of EEG signals provides an important element of brain-computer interface (BCI) applications, underlying an efficient interaction between a human and a computer application. The BCI applications can be especially useful for people with disabilities. Numerous experiments aim at recognition of motion intent of left or right hand being useful for locked-in-state or paralyzed subjects in controlling computer applications....
-
Sampling-based novel heterogeneous multi-layer stacking ensemble method for telecom customer churn prediction
PublicationIn recent times, customer churn has become one of the most significant issues in business-oriented sectors with telecommunication being no exception. Maintaining current customers is particularly valuable due to the high degree of rivalry among telecommunication companies and the costs of acquiring new ones. The early prediction of churned customers may help telecommunication companies to identify the causes of churn and design...
-
Empirical analysis of tree-based classification models for customer churn prediction
PublicationCustomer churn is a vital and reoccurring problem facing most business industries, particularly the telecommunications industry. Considering the fierce competition among telecommunications firms and the high expenses of attracting and gaining new subscribers, keeping existing loyal subscribers becomes crucial. Early prediction of disgruntled subscribers can assist telecommunications firms in identifying the reasons for churn and...
-
Influence of accelerometer signal pre-processing and classification method on human activity recognition
PublicationA study of data pre-processing influence on accelerometer-based human activity recognition algorithms is presented. The frequency band used to filter-out the accelerometer signals and the number of accelerometers involved were considered in terms of their influence on the recognition accuracy. In the test four methods of classification were used: support vector machine, decision trees, neural network, k-nearest neighbor.
-
Spatial pattern of ASG-EUPOS sites
PublicationThe article presents the spatial pattern analysis of the ASG-EUPOS permanent GNSS stations in Poland. Using different methods and tools (nearest neighbour, Riplay’s K-function, morphology of Thiessen polygons) we proved that the station distribution model changes within scales. At short distances up to 65 km, which are typical lengths in the network, stations are irregularly dispersed. Increasing this distance to 130 km and over...
-
Comparison of the measurement techniques employed for evaluation of ambient air odour quality
PublicationThe paper presents the results of investigation on ambient air odour quality in a vicinity of the industrial sewage treatment plant being a part of the crude oil processing plant. The investigation was performed during spring-winter season using a prototype of electronic nose and the Nasal Ranger field olfactometers. The prototype was equipped with a set of six semiconductor sensors by FIGARO Co. and one PID-type sensor. The field...
-
Basic evaluation of limb exercises based on electromyography and classification methods
PublicationSymptoms caused by cerebral palsy or stroke deprive a person partially or even completely of his ability to move. Nowadays we can observe more technologically advanced rehabilitation devices which incorporate biofeedback into the process of rehabilitation of such people. However, there is still a lack of devices that would analyse, assess, and control (independently or with limited support) specialised movement exercises. Here...
-
In uence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classi cation
PublicationWe present a comprehensive evaluation of the infuence of 'harmonic' and rhythmic sections contained in an audio file on automatic music genre classi cation. The study is performed using the ISMIS database composed of music files, which are represented by vectors of acoustic parameters describing low-level music features. Non-negative Matrix Factorization serves for blind separation of instrument components. Rhythmic components...
-
Mining Knowledge of Respiratory Rate Quantification and Abnormal Pattern Prediction
PublicationThe described application of granular computing is motivated because cardiovascular disease (CVD) remains a major killer globally. There is increasing evidence that abnormal respiratory patterns might contribute to the development and progression of CVD. Consequently, a method that would support a physician in respiratory pattern evaluation should be developed. Group decision-making, tri-way reasoning, and rough set–based analysis...
-
Fast collaborative graph exploration
PublicationWe study the following scenario of online graph exploration. A team of k agents is initially located at a distinguished vertex r of an undirected graph. At every time step, each agent can traverse an edge of the graph. All vertices have unique identifiers, and upon entering a vertex, an agent obtains the list of identifiers of all its neighbors. We ask how many time steps are required to complete exploration, i.e., to make sure...
-
Fast Collaborative Graph Exploration
PublicationWe study the following scenario of online graph exploration. A team of k agents is initially located at a distinguished vertex r of an undirected graph. At every time step, each agent can traverse an edge of the graph. All vertices have unique identifiers, and upon entering a vertex, an agent obtains the list of identifiers of all its neighbors. We ask how many time steps are required to complete exploration, i.e., to make sure...
-
Quasi-one-dimensional exchange interactions and short-range magnetic correlations in CuTeO4
PublicationCuTeO4 has been proposed as a crystallographically distinct, yet electronic structure analog, of the superconducting cuprates. Here, we present a detailed characterization of the physical properties of CuTeO4 to address this proposal. Fitting of magnetic susceptibility data indicates unexpected quasi-one-dimensional, antiferromagnetic correlations at high temperature, with a nearest-neighbor Heisenberg exchange of 1=164(5) K....
-
Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set
PublicationThis work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...
-
Solvation of ionic liquids based on N-alkyl-N-methylmorpholinium cations in N,N-dimethylformamide and dimethyl sulfoxide – A volumetric and acoustic study
PublicationApparent molar volumes and molar isentropic compressibilities for N-ethyl-N-methylmorpholinium and N-decyl-N-methylmorpholinium tetrafluoroborates in N,N-dimethylformamide (DMF) and dimethyl sulfoxide (DMSO) have been determined from density and speed of sound measurements over the temperature range T = (298.15, 303.15, 308.15, 313.15, 318.15 and 328.15) K at experimental pressure p = 0.1 MPa. This data has been used to calculate...
-
Bass Enhancement Settings in Portable Devices Based on Music Genre Recognition
PublicationThe paper presents a novel approach to the Virtual Bass Synthesis (VBS) applied to mobile devices, called Smart VBS (SVBS). The proposed algorithm uses an intelligent, rule-based setting of bass synthesis parameters adjusted to the particular music genre. Harmonic generation is based on a nonlinear device (NLD) method with the intelligent controlling system adapting to the recognized music genre. To automatically classify music...
-
Feasibility Study for Food Intake Tasks Recognition Based on Smart Glasses
PublicationIn this exploratory study 13 adult test subjects have performed different food intake tasks while wearing a three axis accelerometer mounted at a temple of glasses. Two different algorithms for task recognition have been applied and compared. The retrospective data processing leads to better task recognition results when the frequency range of 50 Hz to 100 Hz is analysed within accelerometer signal recordings. A straightforward...
-
Smart Virtual Bass Synthesis Algorithm Based on Music Genre Classification
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm employed automatic music genre recognition to determine the optimum parameters for the synthesis of additional frequencies. The synthesis was carried out using the non-linear device (NLD) and phase vocoder (PV) methods depending on the music excerpt genre. Classification of musical...
-
Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...
-
TF-IDF weighted bag-of-words preprocessed text documents from Simple English Wikipedia
Open Research DataThe SimpleWiki2K-scores dataset contains TF-IDF weighted bag-of-words preprocessed text documents (raw strings are not available) [feature matrix] and their multi-label assignments [label-matrix]. Label scores for each document are also provided for an enhanced multi-label KNN [1] and LEML [2] classifiers. The aim of the dataset is to establish a benchmark...
-
A new method for real-time monitoring of volatiles in frying fumes using proton transfer reaction mass spectrometry with time-of-flight analyse
PublicationTo safeguard the consumers’ well-being, it is necessary to develop novel methods for determination of carcinogens in food, including volatiles generated during frying. The currently used procedures for analysis of volatile fraction of vegetable oils are not based on real-time measurements and thus do not enable the determination of carcinogenic compounds in frying fumes; instead, only the headspace or liquid fraction is sampled....
-
Spatial variability of throughfall in a stand of Scots pine (Pinus sylvestris L.) with deciduous admixture as influenced by canopy cover and stem distance
PublicationVegetation cover affects the amount of precipitation, its chemical composition and its spatial distribution, and this may have implications for the distribution of water, nutrients and contaminants in the subsurface soil layer. The aim of this study was a detailed diagnosis of the spatio-temporal variability in the amount of throughfall (TF) and its chemical components in a 72-year-old pine stand with an admixture of oak and birch....
-
Topology recognition and leader election in colored networks
PublicationTopology recognition and leader election are fundamental tasks in distributed computing in networks. The first of them requires each node to find a labeled isomorphic copy of the network, while the result of the second one consists in a single node adopting the label 1 (leader), with all other nodes adopting the label 0 and learning a path to the leader. We consider both these problems in networks whose nodes are equipped with...
-
The effect of boron concentration on the electrical, morphological and optical properties of boron-doped nanocrystalline diamond sheets: Tuning the diamond-on-graphene vertical junction
PublicationIn this paper, the effect of boron doping on the electrical, morphological and structural properties of free-standing nanocrystalline diamond sheets (thickness ~ 1 μm) was investigated. For this purpose, we used diamond films delaminated from a mirror-polished tantalum substrate following a microwave plasma-assisted chemical vapor deposition process, each grown with a different [B]/[C] ratio (up to 20,000 ppm) in the gas phase....