Search results for: training set
-
Creating neural models using an adaptive algorithm for optimal size of neural network and training set.
PublicationZaprezentowano adaptacyjny algorytm generujący modele neuronowe liniowych układów mikrofalowych, zdolny do oszacowania optymalnego rozmiaru zbiory uczącego i sieci neuronowej. Stworzono kilka modeli nieciągłości falowodowych i mokropaskowych, a następnie zweryfikowano ich poprawność porównując wyniki analiz metodą dopasowania rodzajów i metodą momentów filtrów pasmowo-przepustowych.
-
Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set
PublicationThis work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...
-
Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging
PublicationIn the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modification of the training program which minimizes the...
-
Real and Virtual Instruments in Machine Learning – Training and Comparison of Classification Results
PublicationThe continuous growth of the computing power of processors, as well as the fact that computational clusters can be created from combined machines, allows for increasing the complexity of algorithms that can be trained. The process, however, requires expanding the basis of the training sets. One of the main obstacles in music classification is the lack of high-quality, real-life recording database for every instrument with a variety...
-
Vehicle detector training with minimal supervision
PublicationRecently many efficient object detectors based on convolutional neural networks (CNN) have been developed and they achieved impressive performance on many computer vision tasks. However, in order to achieve practical results, CNNs require really large annotated datasets for training. While many such databases are available, many of them can only be used for research purposes. Also some problems exist where such datasets are not...
-
Color-based Detection of Bleeding in Endoscopic Images
PublicationIn this paper a color descriptor designed for bleeding detection in endoscopic images is proposed. The development of the algorithm was carried out on a representative training set of 36 images of bleeding and 25 clear images. Another 38 bleeding and 26 normal images were used in the final stage as a test set. All of the considered images were extracted from separate endoscopic examinations. The experiments include color distribution...
-
Active Learning Based on Crowdsourced Data
PublicationThe paper proposes a crowdsourcing-based approach for annotated data acquisition and means to support Active Learning training approach. In the proposed solution, aimed at data engineers, the knowledge of the crowd serves as an oracle that is able to judge whether the given sample is informative or not. The proposed solution reduces the amount of work needed to annotate large sets of data. Furthermore, it allows a perpetual increase...
-
Semantic segmentation training using imperfect annotations and loss masking
PublicationOne of the most significant factors affecting supervised neural network training is the precision of the annotations. Also, in a case of expert group, the problem of inconsistent data annotations is an integral part of real-world supervised learning processes, well-known to researchers. One practical example is a weak ground truth delineation for medical image segmentation. In this paper, we have developed a new method of accurate...
-
LEGO bricks for training classification network
Open Research DataThe data set contains images of 447 different classes of LEGO bricks used for training LEGO bricks classification network. The dataset contains two types of images: photos (10%) and renders (90%) aggregated into respective directories. Each directory (photos and renders) contains 447 directories labeled as the official brick type number. The images...
-
Frequency response spectra applied to assess efficiency of the training techniques
PublicationThe purpose of the research is to assess the increase of the muscle strength and power. Movement of the human body when the moving one impacts a stationary or moving body is taken under consideration. The waveform produced by an impact is transformed into frequency domain. The acceleration record is transformed as a complex spectrum, by the use of a Discrete Fourier Transformation. In this paper the applications of the discrete...
-
Biometric identity verification
PublicationThis chapter discusses methods which are capable of protecting automatic speaker verification systems (ASV) from playback attacks. Additionally, it presents a new approach, which uses computer vision techniques, such as the texture feature extraction based on Local Ternary Patterns (LTP), to identify spoofed recordings. We show that in this case training the system with large amounts of spectrogram patches may be difficult, and...
-
The impact of the AC922 Architecture on Performance of Deep Neural Network Training
PublicationPractical deep learning applications require more and more computing power. New computing architectures emerge, specifically designed for the artificial intelligence applications, including the IBM Power System AC922. In this paper we confront an AC922 (8335-GTG) server equipped with 4 NVIDIA Volta V100 GPUs with selected deep neural network training applications, including four convolutional and one recurrent model. We report...
-
Texture Features for the Detection of Playback Attacks: Towards a Robust Solution
PublicationThis paper describes the new version of a method that is capable of protecting automatic speaker verification (ASV) systems from playback attacks. The presented approach uses computer vision techniques, such as the texture feature extraction based on Local Ternary Patterns (LTP), to identify spoofed recordings. Our goal is to make the algorithm independent from the contents of the training set as much as possible; we look for the...
-
AITP - AI Thermal Pedestrians Dataset
Open Research DataAITP is a pedestrian detection dataset consisting of 9178 annotated thermal images. The training set contains 7801 images on which15448 pedestrians were labeled. The test set has 1377 images on which 2731 objects were marked. All images are in PNG file format (120x160) captured with FLIR Lepton Thermal Camera on the streets of Gdańsk, Poland. All pedestrians...
-
Performance improvement of NN based RTLS by customization of NN structure - heuristic approach
PublicationThe purpose of this research is to improve performance of the Hybrid Scene Analysis – Neural Network indoor localization algorithm applied in Real-time Locating System, RTLS. A properly customized structure of Neural Network and training algorithms for specific operating environment will enhance the system’s performance in terms of localization accuracy and precision. Due to nonlinearity and model complexity, a heuristic analysis...
-
Musical Instrument Identification Using Deep Learning Approach
PublicationThe work aims to propose a novel approach for automatically identifying all instruments present in an audio excerpt using sets of individual convolutional neural networks (CNNs) per tested instrument. The paper starts with a review of tasks related to musical instrument identification. It focuses on tasks performed, input type, algorithms employed, and metrics used. The paper starts with the background presentation, i.e., metadata...
-
Horizon Europe proposals - Administrative Part
Open Research DataThe dataset contains data collected during the HE National Contact Point training on Oct. 12, 2022, reg. the administrative part of Horizon Europe grant proposals. The data set includes presentations concerning administrative forms of 2022 proposals and their content, including participant data; information about abstract writing, keyword choice and...
-
Active Annotation in Evaluating the Credibility of Web-Based Medical Information: Guidelines for Creating Training Data Sets for Machine Learning
PublicationMethods Results Discussion References Abbreviations Copyright Abstract Background: The spread of false medical information on the web is rapidly accelerating. Establishing the credibility of web-based medical information has become a pressing necessity. Machine learning offers a solution that, when properly deployed, can be an effective tool in fighting medical misinformation on the web. Objective: The aim of this study is to...
-
FEEDB: A multimodal database of facial expressions and emotions
PublicationIn this paper a first version of a multimodal FEEDB database of facial expressions and emotions is presented. The database contains labeled RGB-D recordings of people expressing a specific set of expressions that have been recorded using Microsoft Kinect sensor. Such a database can be used for classifier training and testing in face recognition as well as in recognition of facial expressions and human emotions. Also initial experiences...
-
Emotion Recognition and Its Applications
PublicationThe paper proposes a set of research scenarios to be applied in four domains: software engineering, website customization, education and gaming. The goal of applying the scenarios is to assess the possibility of using emotion recognition methods in these areas. It also points out the problems of defining sets of emotions to be recognized in different applications, representing the defined emotional states, gathering the data and...
-
Low-cost data-driven modelling of microwave components using domain confinement and PCA-based dimensionality reduction
PublicationFast data-driven surrogate models can be employed as replacements of computationally demanding full-wave electromagnetic simulations to facilitate the microwave design procedures. Unfortunately, practical application of surrogate modelling is often hindered by the curse of dimensionality and/or considerable nonlinearity of the component characteristics. This paper proposes a simple yet reliable approach to cost-efficient modelling...
-
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
PublicationThis paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...
-
Kamila Kokot-Kanikuła mgr
PeopleKamila Kokot-Kanikuła is a digital media senior librarian at Gdańsk University of Technology (GUT) Library. She works in Digital Archive and Multimedia Creation Department and her main areas of interests include early printed books, digital libraries, Open Access and Open Science. In the Pomeranian Digital Library (PDL) Project she is responsible for creating annual digital plans, transferring files on digital platform, and promoting...
-
Creating new voices using normalizing flows
PublicationCreating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...
-
Cost‐efficient performance‐driven modelling of multi‐band antennas by variable‐fidelity electromagnetic simulations and customized space mapping
PublicationElectromagnetic (EM) simulations have become an indispensable tool in the design of contemporary antennas. EM‐driven tasks, for example, parametric optimization, entail considerable computational efforts, which may be reduced by employing surrogate models. Yet, data‐driven modelling of antenna characteristics is largely hindered by the curse of dimensionality. This may be addressed using the recently reported domain‐confinement...
-
Towards Scalable Simulation of Federated Learning
PublicationFederated learning (FL) allows to train models on decentralized data while maintaining data privacy, which unlocks the availability of large and diverse datasets for many practical applications. The ongoing development of aggregation algorithms, distribution architectures and software implementations aims for enabling federated setups employing thousands of distributed devices, selected from millions. Since the availability of...
-
Tagged images with LEGO bricks - Technic Bricks
Open Research DataThe set contains images of LEGO bricks (from Technic Bricks category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Transportation - Sea and Air
Open Research DataThe set contains images of LEGO bricks (from Transportation - Sea and Air category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Bionicle Hero Factory and Constraction
Open Research DataThe set contains images of LEGO bricks (from Bionicle Hero Factory and Constraction category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Windows and Doors
Open Research DataThe set contains images of LEGO bricks (from Windows and Doors category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Local Texture Pattern Selection for Efficient Face Recognition and Tracking
PublicationThis paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...
-
Evaluating Performance and Accuracy Improvements for Attention-OCR
PublicationIn this paper we evaluated a set of potential improvements to the successful Attention-OCR architecture, designed to predict multiline text from unconstrained scenes in real-world images. We investigated the impact of several optimizations on model’s accuracy, including employing dynamic RNNs (Recurrent Neural Networks), scheduled sampling, BiLSTM (Bidirectional Long Short-Term Memory) and a modified attention model. BiLSTM was...
-
Multiobjective Aerodynamic Optimization by Variable-Fidelity Models and Response Surface Surrogates
PublicationA computationally efficient procedure for multiobjective design optimization with variable-fidelity models and response surface surrogates is presented. The proposed approach uses the multiobjective evolutionary algorithm that works with a fast surrogate model, obtained with kriging interpolation of the low-fidelity model data enhanced by space-mapping correction exploiting a few high-fidelity training points. The initial Pareto...
-
Automated Classifier Development Process for Recognizing Book Pages from Video Frames
PublicationOne of the latest developments made by publishing companies is introducing mixed and augmented reality to their printed media (e.g. to produce augmented books). An important computer vision problem that they are facing is classification of book pages from video frames. The problem is non-trivial, especially considering that typical training data is limited to only one digital original per book page, while the trained classifier...
-
Images of LEGO bricks
Open Research DataThe set contains images of LEGO bricks (from multiple categories). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Accurate Modeling of Antenna Structures by Means of Domain Confinement and Pyramidal Deep Neural Networks
PublicationThe importance of surrogate modeling techniques has been gradually increasing in the design of antenna structures over the recent years. Perhaps the most important reason is a high cost of full-wave electromagnetic (EM) analysis of antenna systems. Although imperative in ensuring evaluation reliability, it entails considerable computational expenses. These are especially pronounced when carrying out EM-driven design tasks such...
-
Domain segmentation for low-cost surrogate-assisted multi-objective design optimisation of antennas
PublicationAbstract: Information regarding the best possible design trade-offs of an antenna structure can be obtained through multiobjective optimisation (MO). Unfortunately, MO is extremely challenging if full-wave electromagnetic (EM) simulation models are used for performance evaluation. Yet, for the majority of contemporary antennas, EM analysis is the only tool that ensures reliability. This study introduces a procedure for accelerated...
-
Recognition of hazardous acoustic events employing parallel processing on a supercomputing cluster . Rozpoznawanie niebezpiecznych zdarzeń dźwiękowych z wykorzystaniem równoległego przetwarzania na klastrze superkomputerowym
PublicationA method for automatic recognition of hazardous acoustic events operating on a super computing cluster is introduced. The methods employed for detecting and classifying the acoustic events are outlined. The evaluation of the recognition engine is provided: both on the training set and using real-life signals. The algorithms yield sufficient performance in practical conditions to be employed in security surveillance systems. The...
-
Low-Cost Multi-Objective Optimization Yagi-Uda Antenna in Multi-Dimensional Parameter Space
PublicationA surrogate-based technique for fast multi-objective optimization of a multi-parameter planar Yagi-Uda antenna structure is presented. The proposed method utilizes response surface approximation (RSA) models constructed using training samples obtained from evaluation of the low-fidelity antenna model. Utilization of the RSA models allowsfor fast determination of the best possible trade-offs between conflicting objectives in multi-objective...
-
Pose classification in the gesture recognition using the linear optical sensor
PublicationGesture sensors for mobile devices, which have a capability of distinguishing hand poses, require efficient and accurate classifiers in order to recognize gestures based on the sequences of primitives. Two methods of poses recognition for the optical linear sensor were proposed and validated. The Gaussian distribution fitting and Artificial Neural Network based methods represent two kinds of classification approaches. Three types...
-
Tagged images with bees
Open Research DataImages taken from bee hive with tagged bees. The images are prepared for training yolo5 deep neural network (supplied with the data).
-
Video of LEGO bricks on conveyor belt - flags and signs
Open Research DataThe set contains videos of LEGO bricks (flags and signs) moving on a white conveyor belt. The images were prepared for training neural network for recognition of LEGO bricks. The bricks were separated as much as possible and in most cases they should not overlap. The images were taken from different sides by stationary camera located over the final...
-
Tagged images with LEGO bricks - Bricks Sloped
Open Research DataThe set contains images of LEGO bricks (from Bricks Sloped category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Tiles
Open Research DataThe set contains images of LEGO bricks (from Tiles category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Windscreens and Fuselage
Open Research DataThe set contains images of LEGO bricks (from Windscreens and Fuselage category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Bricks Special
Open Research DataThe set contains images of LEGO bricks (from Bricks Special category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Bricks
Open Research DataThe set contains images of LEGO bricks (from Bricks category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Technic Beams
Open Research DataThe set contains images of LEGO bricks (from Technic Beams category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Technic Pins
Open Research DataThe set contains images of LEGO bricks (from Technic Pins category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Minifig Accessories
Open Research DataThe set contains images of LEGO bricks (from Minifig Accessories category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.