displaying 1000 best results Help
Search results for: RECONSTRUCTION OF SPEECH SIGNALS
-
The accuracy of pulse rate estimation from the sequence of face images
PublicationThe goal of this paper is to analyze the accuracy of pulse rate estimation from the sequence of face images. Simulated and real signals were used to evaluate two pulse rate estimators; one for frequency domain and the second one for time domain using the autocorrelation function. The results show that the mean difference between the reference measurements and estimated pulse rate values are about 2bpm. In the analysis of short...
-
Application of Maximum Lenght Sequence in Silent Sonar
PublicationSilent sonars are designed to reduce the distance over which their sounding pulses can be detected by intercept sonars. In order to meet this objective, we can use periodical sounding signals that have low power, a very long duration and wide spectrum. If used in the silent sonar's receiver, matched filtration ensures very good detection of motionless or slow moving targets. However, it is more difficult to detect echo signals...
-
A Multi-Antenna Scheme for Early Detection and Mitigation of Intermediate GNSS Spoofing
PublicationThis article presents a method for detecting and mitigating intermediate GNSS spoofing. In this type of attack, at its early stage, a spoofer transmits counterfeit signals which have slight time offsets compared to true signals arriving from satellites. The anti-spoofing method proposed in this article fuses antenna array processing techniques with a multipath detection algorithm. The latter is necessary to separate highly correlated...
-
Automatic Clustering of EEG-Based Data Associated with Brain Activity
PublicationThe aim of this paper is to present a system for automatic assigning electroencephalographic (EEG) signals to appropriate classes associated with brain activity. The EEG signals are acquired from a headset consisting of 14 electrodes placed on skull. Data gathered are first processed by the Independent Component Analysis algorithm to obtain estimates of signals generated by primary sources reflecting the activity of the brain....
-
Playback detection using machine learning with spectrogram features approach
PublicationThis paper presents 2D image processing approach to playback detection in automatic speaker verification (ASV) systems using spectrograms as speech signal representation. Three feature extraction and classification methods: histograms of oriented gradients (HOG) with support vector machines (SVM), HAAR wavelets with AdaBoost classifier and deep convolutional neural networks (CNN) were compared on different data partitions in respect...
-
Application of Shape From Shading Technique for Side Scan Sonar Images
PublicationSide scan sonar (SSS) is one of the most widely used imaging systems in the underwater environment. It is relatively cheap and easy to deploy in comparison with more powerful sensors like multibeam echosounder or synthetic aperture sonar. Although, the SSS does not provide directly the seafloor bathymetry measurements. Its outputs are usually in a form of grey level acoustic images of seafloor. However, the analysis of such images...
-
MATCHED FILTER APPROACH FOR MICROSEISMIC SIGNAL PROCESSING OF REAL DATA FROM EAST POMERANIA SHALE GAS
PublicationThe microseismic monitoring is a method of monitoring of fracture propagation during hydraulic fracturing (HF)process. An array of several hundred geophones is placed on the surface to record little ground tremors induced by fracturing process. Filtration and summation of signals from geophones is essential to identify and locate fracturing events from underground. Authors propose a method of matched filtering, that is usually...
-
Direct spectrum detection based on Bayesian approach
PublicationThe paper investigates the Bayesian framework's performance for a direct detection of spectrum parameters from the compressive measurements. The reconstruction signal stage is eliminated in by the Bayesian Compressive Sensing algorithm, which causes that the computational complexity and processing time are extremely reduced. The computational efficiency of the presented procedure is significantly...
-
Using Different Information Channels for Affect-Aware Video Games - A Case Study
PublicationThis paper presents the problem of creating affect-aware video games that use different information channels, such as image, video, physiological signals, input devices, and player’s behaviour, for emotion recognition. Presented case studies of three affect-aware games show certain conditions and limitations for using specific signals to recognize emotions and lead to interesting conclusions.
-
Verification of Satellite Railway Track Position Measurements Making Use of Standard Co-Ordinate Determination Techniques
PublicationThe article presents the results of satellite railway track position measurements performed by a multidisciplinary research team, the members of which represented Gdansk University of Technology and Gdynia Maritime University. Measuring methods are described which were used for reconstructing the railway track axis position and diagnosing railway track geometry deformations. As well as that, the description of the novel method...
-
Detection and Mitigation of GPS Spoofing Based on Antenna Array Processing
PublicationIn this article authors present an application of spatial processing methods for GPS spoofing detection and mitigation. In the first part of this article, a spoofing detection method, based on phase delay measurements, is proposed. Accuracy and precision of phase delay estimation is assessed for various qualities of received signal. Spoofing detection thresholds are determined. Efficiency of this method is evaluated in terms of...
-
Experimental Extraction of Secure Correlations from a Noisy Private State
PublicationWe report experimental generation of a noisy entangled four-photon state that exhibits a separation between the secure key contents and distillable entanglement, a hallmark feature of the recently established quantum theory of private states. The privacy analysis, based on the full tomographic reconstruction of the prepared state, is utilized in a proof-of-principle key generation. The inferiority of distillation-based strategies...
-
Application of ANN and PCA to two-phase flow evaluation using radioisotopes
PublicationIn the two-phase flow measurements a method involving the absorption of gamma radiation can be applied among others. Analysis of the signals from the scintillation probes can be used to determine the number of flow parameters and to recognize flow structure. Three types of flow regimes as plug, bubble, and transitional plug – bubble flows were considered in this work. The article shows how features of the signals in the time and...
-
Portable raman spectrometer with two excitation wavelengths
PublicationSelected problems of development of a portable Raman spectrometer having small size, reduced power consumption and robust construction are shown. The device dedicated for semiskilled personnel uses two lasers: 785nm and 355nm. Results of preliminary tests are presented. Also data processing procedures as well as problems referred to Raman signals acquisition through packages and influence of interfering signals are discussed.
-
Evaluation Criteria for Affect-Annotated Databases
PublicationIn this paper a set of comprehensive evaluation criteria for affect-annotated databases is proposed. These criteria can be used for evaluation of the quality of a database on the stage of its creation as well as for evaluation and comparison of existing databases. The usefulness of these criteria is demonstrated on several databases selected from affect computing domain. The databases contain different kind of data: video or still...
-
Intelligent multimedia solutions supporting special education needs.
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Intelligent video and audio applications for learning enhancement
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Wavelet-based denoising method for real phonocardiography signal recorded by mobile devices in noisy environment
PublicationThe main obstacle in development of intelligent autodiagnosis medical systems based on the analysis of phonocardiography (PCG) signals is noise. The noise can be caused by digestive and respiration sounds, movements or even signals from the surrounding environment and it is characterized by wide frequency and intensity spectrum. This spectrum overlaps the heart tones spectrum, which makes the problem of PCG signal filtrating complex....
-
Time Delay Estimation in Two-Phase Flow Investigation Using the γ-Ray Attenuation Technique.
PublicationTime delay estimation is an important research question having many applications in a range of technologies. Measurement of a two-phase flow in a pipeline or an open channel using radioisotopes is an example of such application. For instance, the determination of velocity of dispersed phase in that case is based on estimation of the time delay between two stochastic signals provided by scintillation probes. The proper analysis...
-
Powojenny rozdział w historii architektury i zagospodarowania portu gdyńskiego
PublicationThe port of Gdynia, considered one of the largest civil construction projects of the 20th century in Europe, was built at an express pace for only a dozen or so years from the mid-1920s to the end of the 1930s. Unusual solutions from the construction period in the port engineering category, as well as outstanding architectural works The industrial area quickly became architectural symbols of economic growth and the general rebirth...
-
Projects of noise level on railway lines
Open Research DataProjects of acoustic signals from railway lines for further analysis in the BK Connect program.
-
projects of noise level on railway lines
Open Research DataProjects of acoustic signals from railway lines for further analysis in the BK Connect software.
-
On Radar DoA Estimation and Tilted Rotating Electronically Scanned Arrays
PublicationWe consider DoA estimation in a monopulse radar system employing a tilted rotating array. We investigate the case of nonzero steering angles, in which case the mapping between the target’s azimuth and elevation in the global coordinate system and their counterparts in the array local coordinate system becomes increasingly nonlinear and coupled. Since estimating the azimuth using coherently integrated signals might be difficult because...
-
Bimodal deep learning model for subjectively enhanced emotion classification in films
PublicationThis research delves into the concept of color grading in film, focusing on how color influences the emotional response of the audience. The study commenced by recalling state-of-the-art works that process audio-video signals and associated emotions by machine learning. Then, assumptions of subjective tests for refining and validating an emotion model for assigning specific emotional labels to selected film excerpts were presented....
-
Problems in estimation of hand grip force based on EMG signal
PublicationThere has recently been a significant increase in the number of publications on and applications of bioelectric signals for diagnostic purposes. While the use of ECG (electrocardiography) is not surprising, the use of signals from registration of brain activity (EEG) and muscles activity (EMG) still finds new applications in various fields. The authors focus on the use of EMG signals for estimating hand grip force. Currently,...
-
Badania i możliwości adaptacji ruin kościoła we Wocławach
PublicationW artykule opisano wyniki badań ruin kościoła z XIV w. we Wocławach. Przedstawiono też problem adaptacji ruin obiektów sakralnych, możliwości zagospodarowania oraz możliwe warianty odbudowy ruin. Po wojnie trzynastoletniej kościół został rozbudowany o nawy boczne i wieżę. Układ świątyni został zredukowany w czasie odbudowy po pożarze w XVIII w. W czasie II wojny światowej kościół został zniszczony. Do dziś nie został odbudowany...
-
Deep neural networks for data analysis
e-Learning CoursesThe aim of the course is to familiarize students with the methods of deep learning for advanced data analysis. Typical areas of application of these types of methods include: image classification, speech recognition and natural language understanding. Celem przedmiotu jest zapoznanie studentów z metodami głębokiego uczenia maszynowego na potrzeby zaawansowanej analizy danych. Do typowych obszarów zastosowań tego typu metod należą:...
-
Analyzing sets of phylogenetic trees obtained from bayesian MCMC process using topology metrics
PublicationThe reconstruction of evolutionary trees is one of the primary objectives in phylogenetics. Such a tree represents historical evolutionary relationship between different species or organisms. Tree comparisons are used for multiple purposes, from unveiling the history of species to deciphering evolutionary associations amongorganisms and geographical areas.In the paper, we describe a general method for comparing hylogenetic trees....
-
Szymon Andrzejewski dr
PeopleMaster’s degree at the University of Gdańsk in 2008 Major in political system and self-government. Overgraduate studies at the Gdańsk University of Technology „Management and evaluation of projects financed from EU funds” and at AGH University of Science and Technology Noise protection against noise and vibration. Student of sociology PhD studies at the University of Gdańsk from 2016. The research scope is democracy and institutions...
-
Experimental and numerical analysis of wave propagation in ground anchors
PublicationThe article focuses on wave propagation phenomenon in ground anchors. The main aim of the investigation is the non-destructive diagnostics and the assessment of the state of ground anchors, using the guided wave propagation method. Laboratory models of anchors with different lengths of the anchor body were tested and voltage signals of propagating waves were registered at several locations. For all tested specimens corresponding...
-
Analysis-by-synthesis paradigm evolved into a new concept
PublicationThis work aims at showing how the well-known analysis-by-synthesis paradigm has recently been evolved into a new concept. However, in contrast to the original idea stating that the created sound should not fail to pass the foolproof synthesis test, the recent development is a consequence of the need to create new data. Deep learning models are greedy algorithms requiring a vast amount of data that, in addition, should be correctly...
-
Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging
PublicationIn the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modification of the training program which minimizes the...
-
MODALITY corpus - SPEAKER 35 - COMMANDS C1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - SEQUENCE S6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - COMMANDS C5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - SEQUENCE S4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 10 - SEQUENCE S1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 39 - COMMANDS C1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - SEQUENCE S3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - COMMANDS C3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 33 - SEQUENCE S1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - COMMANDS C2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - COMMANDS C3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - SEQUENCE S4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - SEQUENCE S6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - SEQUENCE S5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - COMMANDS C4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - COMMANDS C4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...