displaying 1000 best results Help
Search results for: AUTOMATIC GENRE CLASSIFICATION
-
Classifying type of vehicles on the basis of data extracted from audio signal characteristics
PublicationThe aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox,...
-
Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets
PublicationCelem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory...
-
Computed aided system for separation and classification of the abnormal erythrocytes in human blood
PublicationThe human peripheral blood consists of cells (red cells, white cells, and platelets) suspended in plasma. In the following research the team assessed an influence of nanodiamond particles on blood elements over various periods of time. The material used in the study consisted of samples taken from ten healthy humans of various age, different blood types and both sexes. The markings were leaded by adding to the blood unmodified...
-
Efficiency of Artificial Intelligence Methods for Hearing Loss Type Classification: an Evaluation
PublicationThe evaluation of hearing loss is primarily conducted by pure tone audiometry testing, which is often regarded as golden standard for assessing auditory function. If the presence of hearing loss is determined, it is possible to differentiate between three types of hearing loss: sensorineural, conductive, and mixed. This study presents a comprehensive comparison of a variety of AI classification models, performed on 4007 pure tone...
-
Deep neural networks approach to skin lesions classification — A comparative analysis
PublicationThe paper presents the results of research on the use of Deep Neural Networks (DNN) for automatic classification of the skin lesions. The authors have focused on the most effective kind of DNNs for image processing, namely Convolutional Neural Networks (CNN). In particular, three kinds of CNN were analyzed: VGG19, Residual Networks (ResNet) and the hybrid of VGG19 CNN with the Support Vector Machine (SVM). The research was carried...
-
Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...
-
Automatic Clustering of EEG-Based Data Associated with Brain Activity
PublicationThe aim of this paper is to present a system for automatic assigning electroencephalographic (EEG) signals to appropriate classes associated with brain activity. The EEG signals are acquired from a headset consisting of 14 electrodes placed on skull. Data gathered are first processed by the Independent Component Analysis algorithm to obtain estimates of signals generated by primary sources reflecting the activity of the brain....
-
Selection of an artificial pre-training neural network for the classification of inland vessels based on their images
PublicationArtificial neural networks (ANN) are the most commonly used algorithms for image classification problems. An image classifier takes an image or video as input and classifies it into one of the possible categories that it was trained to identify. They are applied in various areas such as security, defense, healthcare, biology, forensics, communication, etc. There is no need to create one’s own ANN because there are several pre-trained...
-
Dangerous sound event recognition using Support Vector Machine classifiers
PublicationA method of recognizing events connected to danger based on their acoustic representation through Support Vector Machine classification is presented. The method proposed is particularly useful in an automatic surveillance system. The set of 28 parameters used in the classifier consists of dedicated parameters and MPEG-7 features. Methods for parameter calculation are presented, as well as a design of SVM model used for classification....
-
Specification-Oriented Automatic Design of Topologically Agnostic Antenna Structure
PublicationDesign of antennas for modern applications is a challenging task that combines cognition-driven development of topology intertwined with tuning of its parameters using rigorous numerical optimization. However, the process can be streamlined by neglecting the engineering insight in favor of automatic de-termination of structure geometry. In this work, a specification-oriented design of topologically agnostic antenna is considered....
-
Mask Detection and Classification in Thermal Face Images
PublicationFace masks are recommended to reduce the transmission of many viruses, especially SARS-CoV-2. Therefore, the automatic detection of whether there is a mask on the face, what type of mask is worn, and how it is worn is an important research topic. In this work, the use of thermal imaging was considered to analyze the possibility of detecting (localizing) a mask on the face, as well as to check whether it is possible to classify...
-
SYNAT_PCA_48
Open Research DataThere is a series of datasets containing feature vectors derived from music tracks. The dataset contains 51582 music tracks (22 music genres) and feature vector after Principal Component Analysis (PCA) performing, so there are 48-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier...
-
SYNAT_PCA_11
Open Research DataThe dataset contains 51582 music tracks (22 music genres) and feature vector after Principal Component Analysis (PCA) performing, so there are 11-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of more than...
-
ReFlexeNN - the Wearable EMG Interface with Neural Network Based Gesture Classification
PublicationThe electromyographic activity of muscles was measured using a wireless biofeedback device. The aim of the study was to examine the possibility of creating an automatic muscle tension classifier. Several measurement series were conducted and the participant performed simple physical exercises - forcing the muscle to increase its activity accordingly to the selected scale. A small wireless device was attached to the electrodes placed...
-
An Approach to Bass Enhancement in Portable Computers Employing Smart Virtual Bass Synthesis Algorithms
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The developed algorithms are related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt and to the type of a portable device in use. To find optimum synthesis parameters of the VBS algorithms, subjective listening tests based on a parametric procedure...
-
MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES
PublicationAutomatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...
-
SYNAT_MUSIC_GENRE_FV_173
Open Research DataThis is the original dataset containing 51582 music tracks (22 music genres) and 173 element-feature vector [1-6,9]. A collection of more than 50000 music excerpts described with a set of descriptors obtained through the analysis of 30-second mp3 recordings was gathered in a database called SYNAT. The SYNAT database was realized by the Gdansk University...
-
Estimation of object size in the calibrated camera image = Estymacja rozmiaru obiektów w obrazach ze skalibrowanej kamery
PublicationIn the paper, a method of estimation of the physical sizes of the objects tracked by the camera is presented. First, the camera is calibrated, then the proposed algorithm is used to estimate the real width and height of the tracked moving objects. The results of size estimation are then used for classification of the moving objects. Two methods of camera calibration are compared, test results are presented and discussed. The proposed...
-
Further developments of parameterization methods of audio stream analysis for secuirty purposes
PublicationThe paper presents an automatic sound recognition algorithm intended for application in an audiovisual security monitoring system. A distributed character of security systems does not allow for simultaneous observation of multiple multimedia streams, thus an automatic recognition algorithm must be introduced. In the paper, a module for the parameterization and automatic detection of audio events is described. The spectral analyses...
-
MACHINE LEARNING SYSTEM FOR AUTOMATED BLOOD SMEAR ANALYSIS
PublicationIn this paper the authors propose a decision support system for automatic blood smear analysis based on microscopic images. The images are pre-processed in order to remove irrelevant elements and to enhance the most important ones - the healthy blood cells (erythrocytes) and the pathologic (echinocytes). The separated blood cells are analyzed in terms of their most important features by the eigenfaces method. The features are the...
-
Audio-visual surveillance system for application in bank operating room
PublicationAn audio-visual surveillance system able to detect, classify and to localize acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of acoustic...
-
Processing of acoustical data in a multimodal bank operating room surveillance system
PublicationAn automatic surveillance system capable of detecting, classifying and localizing acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of...
-
Automatic localization and continous tracking of mobile sound source using passive acoustic radar
PublicationA concept, practical realization and applications of the passive acoustic radar for localization and continuous tracking of fixed and mobile sound sources such as: cars, trucks, aircrafts and sources of shooting, explosions were presented in the paper. The device consists of the new kind of multi-channel miniature three dimensional sound intensity sensors invented by the Microflown company and a group of digital signal processing...
-
Offshore benthic habitat mapping based on object-based image analysis and geomorphometric approach. A case study from the Slupsk Bank, Southern Baltic Sea
PublicationBenthic habitat mapping is a rapidly growing field of underwater remote sensing studies. This study provides the first insight for high-resolution hydroacoustic surveys in the Slupsk Bank Natura 2000 site, one of the most valuable sites in the Polish Exclusive Zone of the Southern Baltic. This study developed a quick and transparent, automatic classification workflow based on multibeam echosounder and side-scan sonar surveys to...
-
Speech Analytics Based on Machine Learning
PublicationIn this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...
-
Categorization of Cloud Workload Types with Clustering
PublicationThe paper presents a new classification schema of IaaS cloud workloads types, based on the functional characteristics. We show the results of an experiment of automatic categorization performed with different benchmarks that represent particular workload types. Monitoring of resource utilization allowed us to construct workload models that can be processed with machine learning algorithms. The direct connection between the functional...
-
Fault diagnosis of marine 4-stroke diesel engines using a one-vs-one extreme learning ensemble
PublicationThis paper proposes a novel approach for intelligent fault diagnosis for stroke Diesel marine engines, which are commonly used in on-road and marine transportation. The safety and reliability of a ship's work rely strongly on the performance of such an engine; therefore, early detection of any type of failure that affects the engine is of crucial importance. Automatic diagnostic systems are of special importance because they can...
-
Generowanie modeli symulacyjnych na potrzeby systemu ekspertowego wspomagającego projektowanie układów automatyki statku
PublicationOmówiono automatyczne generowanie modeli symulacyjnych na potrzeby systemu ekspertowego wspomagającego projektowanie układów automatyki statków. Na podstawie przyjętych założeń projektowych system ekspertowy zleca badania wybranych struktur podsystemów elektroenergetycznych statków. Aplikacja symulacyjna pobiera z biblioteki modele matematyczne elementów składowych struktur, a następnie zestawia modele symulacyjne, wykonuje badania...
-
Visual Features for Improving Endoscopic Bleeding Detection Using Convolutional Neural Networks
PublicationThe presented paper investigates the problem of endoscopic bleeding detection in endoscopic videos in the form of a binary image classification task. A set of definitions of high-level visual features of endoscopic bleeding is introduced, which incorporates domain knowledge from the field. The high-level features are coupled with respective feature descriptors, enabling automatic capture of the features using image processing methods....
-
POTENCJALNE MOŻLIWOŚCI APLIKACJ TECHNIKI E-NOS W DIAGNOSTYCE MEDYCZNEJ=APPLICATION POTENTIALITIES OF E-NOSE TECHNIQUE IN MEDICAL DIAGNOSTICS
PublicationW pracy przedstawiono i omówiono zasadę działania instrumentu analitycznego - elektronicznego nosa (e-nos) zdolnego rozróżnić i sklasyfikować intensywność zapachu. Urządzenia te służą do automatycznej analizy i rozróżniania próbek zapachowych o złożonym składzie, do rozpoznawania ich charakterystycznych właściwości i najczęściej przeznaczone są do szybkiej analizy jakościowej. Dzięki unikatowym właściwościom technika ta znalazła...
-
Multi-Stage Video Analysis Framework
PublicationThe chapter is organized as follows. Section 2 presents the general structure of the proposed framework and a method of data exchange between system elements. Section 3 is describing the low-level analysis modules for detection and tracking of moving objects. In Section 4 we present the object classification module. Sections 5 and 6 describe specialized modules for detection and recognition of faces and license plates, respectively....
-
Klasyfikacja sygnału EKG przy użyciu konwolucyjnych sieci neuronowych
PublicationAutomation and improvement of diagnostic process is a vital element of medicine development and patient’s condition self-control. For a long time different ECG signal classification methods exist and are successfully applied, nevertheless their accuracy is not always satisfying enough. The lack of identification of an existing abnormality, which is very similar to a normal heartbeat is the biggest issue - for example premature...
-
Klasyfikacja sygnału EKG przy użyciu konwolucyjnych sieci neuronowych
PublicationAutomation and improvement of diagnostic process is a vital element of medicine development and patient’s condition self-control. For a long time different ECG signal classification methods exist and are successfully applied, nevertheless their accuracy is not always satisfying enough. The lack of identification of an existing abnormality, which is very similar to a normal heartbeat is the biggest issue - for example premature...
-
Comparison of Lithuanian and Polish Consonant Phonemes Based on Acoustic Analysis – Preliminary Results
PublicationThe goal of this research is to find a set of acoustic parameters that are related to differences between Polish and Lithuanian language consonants. In order to identify these differences, an acoustic analysis is performed, and the phoneme sounds are described as the vectors of acoustic parameters. Parameters known from the speech domain as well as those from the music information retrieval area are employed. These parameters are...
-
Predictions of cervical cancer identification by photonic method combined with machine learning
PublicationCervical cancer is one of the most commonly appearing cancers, which early diagnosis is of greatest importance. Unfortunately, many diagnoses are based on subjective opinions of doctors—to date, there is no general measurement method with a calibrated standard. The problem can be solved with the measurement system being a fusion of an optoelectronic sensor and machine learning algorithm to provide reliable assistance for doctors...
-
Relationship between album cover design and music genres.
PublicationThe aim of the study is to find out whether there exists a relationship between typographic, compositional and coloristic elements of the music album cover design and music contained in the album. The research study involves basic statistical analysis of the manually extracted data coming from the worldwide album covers. The samples represent 34 different music genres, coming from nine countries from around the world. There are...
-
Badanie stanu nawierzchni drogowej z wykorzystaniem uczenia maszynowego
PublicationW artykule opisano budowę systemu informowania o stanie nawierzchni drogowej z wykorzystaniem metod cyfrowego przetwarzania obrazów oraz uczenia maszynowego. Efektem wykonanych prac badawczych jest eksperymentalna platforma, pozwalająca na rejestrację uszkodzeń na drogach, system do analizy, przetwarzania i klasyfikacji danych oraz webowa aplikacja użytkownika do przeglądu stanu nawierzchni w wybranej lokalizacji.
-
Evaluation of aspiration problems in L2 English pronunciation employing machine learning
PublicationThe approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...
-
The aggregation of objects representing Gdańsk district buildings - scale 1:10000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
The aggregation of objects representing buildings in the Kartuzy district - scale 1:10000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
The aggregation of objects representing Gdańsk district buildings - scale 1:25000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
The aggregation of objects representing buildings in the Kartuzy district - scale 1:25000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
Deep learning-based waste detection in natural and urban environments
PublicationWaste pollution is one of the most significant environmental issues in the modern world. The importance of recycling is well known, both for economic and ecological reasons, and the industry demands high efficiency. Current studies towards automatic waste detection are hardly comparable due to the lack of benchmarks and widely accepted standards regarding the used metrics and data. Those problems are addressed in this article by...
-
Behavior Analysis and Dynamic Crowd Management in Video Surveillance System
PublicationA concept and practical implementation of a crowd management system which acquires input data by the set of monitoring cameras is presented. Two leading threads are considered. First concerns the crowd behavior analysis. Second thread focuses on detection of a hold-ups in the doorway. The optical flow combined with soft computing methods (neural network) is employed to evaluate the type of crowd behavior, and fuzzy logic aids detection...
-
Sleep Apnea Detection by Means of Analyzing Electrocardiographic Signal
PublicationObstructive sleep apnea (OSA) is a condition of cyclic, periodic ob-struction (stenosis) of the upper respiratory tract. OSA could be associated with serious cardiovascular problems, such as hypertension, arrhythmias, hearth failure or peripheral vascular disease. Understanding the way of connection between OSA and cardiovascular diseases is important to choose proper treatment strategy. In this paper, we present a method for integrated...
-
Application of Wavelet Transform and Fractal Analysis for Esophageal pH-Metry to Determine a New Method to Diagnose Gastroesophageal Reflux Disease
PublicationIn this paper, a new method for analysing gastroesophageal reflux disease (GERD) is shown. This novel method uses wavelet transform (WT) and wavelet-based fractal analysis (WBFA) on esophageal pH-metry measurements. The esophageal pH-metry is an important diagnostic tool supporting the physician’s work in diagnosing some forms of reflux diseases. Interpreting the results of 24-h pH-metry monitoring is time-consuming, and the conclusions...
-
The aggregation of objects representing Katowice district buildings - scale 1:25000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
The aggregation of objects representing Katowice district buildings - scale 1:10000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
Distributed Framework for Visual Event Detection in Parking Lot Area
PublicationThe paper presents the framework for automatic detection of various events occurring in a parking lot basing on multiple camera video analysis. The framework is massively distributed, both in the logical and physical sense. It consists of several entities called node stations that use XMPP protocol for internal communication and SRTP protocol with Jingle extension for video streaming. Recognized events include detecting parking...
-
Playback detection using machine learning with spectrogram features approach
PublicationThis paper presents 2D image processing approach to playback detection in automatic speaker verification (ASV) systems using spectrograms as speech signal representation. Three feature extraction and classification methods: histograms of oriented gradients (HOG) with support vector machines (SVM), HAAR wavelets with AdaBoost classifier and deep convolutional neural networks (CNN) were compared on different data partitions in respect...