Filtry
wszystkich: 24
Wyniki wyszukiwania dla: knn
-
Two Stage SVM and kNN Text Documents Classifier
PublikacjaThe paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...
-
Improving css-KNN Classification Performance by Shifts in Training Data
PublikacjaThis paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...
-
A Study on Influence of Normalization Methods on Music Genre Classification Results Employing kNN Algorithms
PublikacjaThis paper presents a comparison of different normalization methods applied to the set of feature vectors of music pieces. Test results show the influence of min-nlax and Zero-Mean normalization methods, employing different distance functions (Euclidean, Manhattan, Chebyshev, Minkowski) as a pre-processing for genre classification, on k-Nearest Neighbor (kNN) algorithm classification results.
-
Voice command recognition using hybrid genetic algorithm
PublikacjaAbstract: Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer...
-
Investigation of Air Quality beside a Municipal Landfill: The Fate of Malodour Compounds as a Model VOC
PublikacjaThis paper presents the results of an investigation on ambient air odour quality in the vicinity of a municipal landfill. The investigations were carried out during the spring–winter and the spring seasons using two types of the electronic nose instrument. The field olfactometers were employed to determine the mean odour concentration, which was from 2.1 to 32.2 ou/m3 depending on the measurement site and season of the year. In...
-
MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES
PublikacjaAutomatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...
-
When Neural Networks Meet Decisional DNA: A Promising New Perspective for Knowledge Representation and Sharing
PublikacjaABSTRACT In this article, we introduce a novel concept combining neural network technology and Decisional DNA for knowledge representation and sharing. Instead of using traditional machine learning and knowledge discovery methods, this approach explores the way of knowledge extraction through deep learning processes based on a domain’s past decisional events captured by Decisional DNA. We compare our approach with kNN (k-nearest...
-
Heavy Duty Vehicle Fuel Consumption Modelling Based on Exploitation Data by Using Artificial Neural Networks
PublikacjaOne of the ways to improve the fuel economy of heavy duty trucks is to operate the combustion engine in its most efficient operating points. To do that, a mathematical model of the engine is required, which shows the relations between engine speed, torque and fuel consumption in transient states. In this paper, easy accessible exploitation data collected via CAN bus of the heavy duty truck were used to obtain a model of a diesel...
-
Music Data Processing and Mining in Large Databases for Active Media
PublikacjaThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Usuwanie tła w video nagraniach pochodzących z monitorowania basenu pływackiego
PublikacjaAutomatyczna obróbka obrazu w czasie rzeczywistym jest kluczowa dla wielu rozwiązań monitoringu wykorzystywanych m.in. w celach bezpieczeństwa. Często jednym z ważniejszych etapów obróbki jest oddzielenie tła od obiektów na pierwszym planie, tak aby wykluczyć wszystkie nieistotne informacje z obrazu. Celem niniejszej pracy jest podsumowanie doświadczenia zdobytego podczas śledzenia pływaków oraz pokazanie możliwości skutecznego...
-
Comparative analysis of various transformation techniques for voiceless consonants modeling
PublikacjaIn this paper, a comparison of various transformation techniques, namely Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT) and Discrete Walsh Hadamard Transform (DWHT) are performed in the context of their application to voiceless consonant modeling. Speech features based on these transformation techniques are extracted. These features are mean and derivative values of cepstrum coefficients, derived from each transformation....
-
Analyzing the Effectiveness of the Brain–Computer Interface for Task Discerning Based on Machine Learning
PublikacjaThe aim of the study is to compare electroencephalographic (EEG) signal feature extraction methods in the context of the effectiveness of the classification of brain activities. For classification, electroencephalographic signals were obtained using an EEG device from 17 subjects in three mental states (relaxation, excitation, and solving logical task). Blind source separation employing independent component analysis (ICA) was...
-
Monitoring of odour nuisance in the Tricity Agglomeration
PublikacjaThe paper describes a principle of operation of odour nuisance monitoring network, which is being designed in the Tricity Agglomeration. Moreover, it presents the preliminary results of an investigation on ambient air quality with respect to odour nuisance in a vicinity of the municipal landfill. The investigation was performed during spring-winter season using a prototype of electronic nose and the Nasal Ranger field olfactometers. The...
-
Comparison of the measurement techniques employed for evaluation of ambient air odour quality
PublikacjaThe paper presents the results of investigation on ambient air odour quality in a vicinity of the industrial sewage treatment plant being a part of the crude oil processing plant. The investigation was performed during spring-winter season using a prototype of electronic nose and the Nasal Ranger field olfactometers. The prototype was equipped with a set of six semiconductor sensors by FIGARO Co. and one PID-type sensor. The field...
-
The Hough transform in the classification process of inland ships
PublikacjaThis article presents an analysis of the possibilities of using image processing methods for feature extraction that allows kNN classification based on a ship’s image delivered from an on-water video surveillance system. The subject of the analysis is the Hough transform which enables the detection of straight lines in an image. The recognized straight lines and the information about them serve as features in the classification...
-
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
PublikacjaThe speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...
-
TF-IDF weighted bag-of-words preprocessed text documents from Simple English Wikipedia
Dane BadawczeThe SimpleWiki2K-scores dataset contains TF-IDF weighted bag-of-words preprocessed text documents (raw strings are not available) [feature matrix] and their multi-label assignments [label-matrix]. Label scores for each document are also provided for an enhanced multi-label KNN [1] and LEML [2] classifiers. The aim of the dataset is to establish a benchmark...
-
News that Moves the Market: DSEX-News Dataset for Forecasting DSE Using BERT
PublikacjaStock market is a complex and dynamic industry that has always presented challenges for stakeholders and investors due to its unpredictable nature. This unpredictability motivates the need for more accurate prediction models. Traditional prediction models have limitations in handling the dynamic nature of the stock market. Additionally, previous methods have used less relevant data, leading to suboptimal performance. This study...
-
Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires
PublikacjaBackground Endometriosis is a condition that significantly affects the quality of life of about 10 % of reproductive-aged women. It is characterized by the presence of tissue similar to the uterine lining (endometrium) outside the uterus, which can lead lead scarring, adhesions, pain, and fertility issues. While numerous factors associated with endometriosis are documented, a wide range of symptoms may still be undiscovered. Methods In...
-
Discrimination of Apple Liqueurs (Nalewka) Using a Voltammetric Electronic Tongue, UV-Vis and Raman Spectroscopy
PublikacjaThe capability of a phthalocyanine-based voltammetric electronic tongue to analyze strong alcoholic beverages has been evaluated and compared with the performance of spectroscopic techniques coupled to chemometrics. Nalewka Polish liqueurs prepared from five apple varieties have been used as a model of strong liqueurs. Principal Component Analysis has demonstrated that the best discrimination between liqueurs prepared from different...
-
Empirical analysis of tree-based classification models for customer churn prediction
PublikacjaCustomer churn is a vital and reoccurring problem facing most business industries, particularly the telecommunications industry. Considering the fierce competition among telecommunications firms and the high expenses of attracting and gaining new subscribers, keeping existing loyal subscribers becomes crucial. Early prediction of disgruntled subscribers can assist telecommunications firms in identifying the reasons for churn and...
-
Sampling-based novel heterogeneous multi-layer stacking ensemble method for telecom customer churn prediction
PublikacjaIn recent times, customer churn has become one of the most significant issues in business-oriented sectors with telecommunication being no exception. Maintaining current customers is particularly valuable due to the high degree of rivalry among telecommunication companies and the costs of acquiring new ones. The early prediction of churned customers may help telecommunication companies to identify the causes of churn and design...
-
In uence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classi cation
PublikacjaWe present a comprehensive evaluation of the infuence of 'harmonic' and rhythmic sections contained in an audio file on automatic music genre classi cation. The study is performed using the ISMIS database composed of music files, which are represented by vectors of acoustic parameters describing low-level music features. Non-negative Matrix Factorization serves for blind separation of instrument components. Rhythmic components...
-
Inteligentna Synteza Niskich Częstotliwości w urządzeniach mobilnych
PublikacjaW pracy przedstawiono algorytm inteligentnej adaptacji parametrów syntezy niskich częstotliwości w urządzeniach przenośnych w zależności od odtwarzanego gatunku muzycznego (Smart VBS). Proponowany algorytm wykorzystuje metody generacji harmonicznych oparte na generatorze funkcji nieliniowych (NLD) i wokoderze fazowym (PV). Dla znalezienia optymalnych parametrów syntezy przeprowadzono testy subiektywne sprawdzające powiązanie parametrów...