displaying 1000 best results Help
Search results for: RECONSTRUCTION OF SPEECH SIGNALS
-
Virtual Keyboard controlled by eye gaze employing speech synthesis
PublicationThe article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...
-
Employing flowgraphs for forward route reconstruction in video surveillance system
PublicationPawlak’s flowgraphs were utilized as a base idea and knowledge container for prediction and decision making algorithms applied to experimental video surveillance system. The system is used for tracking people inside buildings in order to obtain information about their appearance and movement. The fields of view of the cameras did not overlap. Therefore, when an object was moving through unsupervised areas, prediction was needed...
-
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
PublicationThe speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...
-
Detection Range of Intercept Sonar for CWFM Signals
PublicationStealth in military sonars applications may be ensured through the use of low power signals making them difficult to intercept by the enemy. In recent years, silent sonar design has been investigated by the Department of Marine Electronic Systems of the Gdansk University of Technology. This article provides an analysis of how an intercept sonar operated by the enemy can detect silent sonar signals. To that end a theoretical intercept...
-
Selection of excitation signals for high-impedance spectroscopy
PublicationA method of fast impedance spectroscopy of technical objects with high impedance (|Zx| > 1 GOhm) is evaluated in this paper. An object is excited with a signal generated by a digital-to-analog converter (DAC) located on the U2531A DAQ module. Response signals proportional to current flowing through and voltage across the measured object are sampled by analog-to-digital converters (ADC) in the DAQ module. The object impedance spectrum...
-
Detection of impulsive disturbances in archive audio signals
PublicationIn this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...
-
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
PublicationThe main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...
-
A survey of automatic speech recognition deep models performance for Polish medical terms
PublicationAmong the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....
-
A non-uniform real-time speech time-scale stretching method
PublicationAn algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...
-
Cartographic Representation of Route Reconstruction Results in Video Surveillance System
PublicationThe video streams available in a surveillance system distributed on the wide area may be accompanied by metadata are obtained as a result of video processing. Many algorithms applied to surveillance systems, e.g. event detection or object tracking, are strictly connected with localization of the object and reconstruction of its route. Drawing related information on a plan of a building or on a map of the city can facilitate the...
-
Silent Signals The Covert Network Shaping the Future
PublicationSilent Signals The Covert Network Shaping the Future In a world dominated by information flow and rapid technological advancements, the existence of hidden networks and unseen influences has never been more relevant. "Silent Signals: The Covert Network Shaping the Future" delves deep into the mysterious and often opaque world of covert communication networks. This influential work sheds light on the silent...
-
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
PublicationIn this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...
-
Emotions in polish speech recordings
Open Research DataThe data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...
-
Thermal sequences database of the skin flaps in breast reconstruction and burns
PublicationThis paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...
-
Thermal sequences database of the skin flaps in breast reconstruction and burns
PublicationThis paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...
-
The use of the static thermography in monitoring flap perfusion in breast reconstruction with TRAM flap
PublicationThis paper shows results of the static thermography for intraoperative and postoperative imaging of TRAM flap perfusion. The results were compared with the clinical examination of flap perfusion. The study was conducted on a group of 38 female patients who underwent breast reconstruction.
-
Identification of models and signals robust to occasional outliers
PublicationIn this paper estimation algorithms derived in the sense of the least sum of absolute errors are considered for the purpose of identification of models and signals. In particular, off-line and approximate on-line estimation schemes discussed in the work are aimed at both assessing the coefficients of discrete-time stationary models and tracking the evolution of time-variant characteristics of monitored signals. What is interesting,...
-
Identification of models and signals robust to occasional outliers
PublicationIn this paper estimation algorithms derived in the sense of the least sum of absolute errors are considered for the purpose of identification of models and signals. In particular, off-line and approximate on-line estimation schemes discussed in the work are aimed at both assessing the coefficients of discrete-time stationary models and tracking the evolution of time-variant characteristics of monitored signals. What is interesting,...
-
Active dynamic thermography method for TRAM flap blood perfusion mapping in breast reconstruction
PublicationThis paper presents the new method of the transverse rectus abdominis musculocutaneous flap blood perfusion mapping based on the active dynamic thermography. The method is aimed at aiding a surgeon during breast reconstruction procedure. A pair of dTnorm and t90_10 parameters were used as parametric image descriptors of the flap blood perfusion. The method was tested on 38 patients that were subjected to breast reconstruction procedure....
-
Wavelet filtering of signals without using model functions
PublicationThe effective wavelet filtering of real signals is impossible without determining their shape. The shape of a real signal is related to its wavelet spectrum. For shape analysis, a continuous color wavelet spectrogram of signal level is often used. The disadvantage of continuous wavelet spectrogram is the complexity of analyzing a blurry color image. A real signal with additive noise strongly distorts the spectrogram based on continuous...
-
Advanced Processing of Telecommunications Signals
e-Learning Courses -
Seafloor relief reconstruction from side scan sonar data
PublicationSide scan sonar is one of the most widely used imaging systems in the underwater environment. It is relatively cheap and easy to deploy, in comparison with more powerful sensors. Although side scan sonar does not provide seafloor bathymetry directly, its records are directly related to seafloor images. In the paper, the method for 3D seafloor relief reconstruction from side scan sonar data is presented. The method is based on the...
-
Reconstruction of 3D structure of positive corona streamer by local methods
PublicationThe computer algorithms were used for reconstruction of streamer 3D structure. We propose the 3D tree structure model of corona discharge streamer composed with nodes and edges between chosen couples of nodes, which enables easy computation of some important parameters ofstreamers. The 3D model can be derived directly from two projection images by global methods like evolutionary searching or particle simulations. In this paper...
-
Detection of the Direct Sequence Spread Spectrum Signals with BPSK Modulation
PublicationThis paper presents a method of the DS CDMA signals with BPSK modulation detection through the examination of the enhanced signal spectrum density. On the base of experiments carried out on the real radio communication signals the impact of a narrowband emission occurring in the examined frequency band on the detection process effectiveness was shown. The results of the experiment aimed at the detection of the satellite navigation...
-
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
PublicationIn this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
-
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
PublicationIn this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
-
Damage detection in plates based on Lamb wavefront shape reconstruction
PublicationMany of the current studies in the area of damage detection using elastic wave propagation are based on deploying sensor networks with a large number of piezoelectric transducers to detect small-size cracks. A major limitation of these studies is that cracks are usually larger and have different shapes in real cases. Moreover, using a large number of sensing nodes for damage detection is both costly and computationally intensive....
-
Reconstruction of thin films polyazomethine based on microscopic images
PublicationPurpose: The aim of this paper was to investigate changes in surface morphology of thin films ofpolyazomethine PPI. Thin films were prepared using low-temperature chemical vapor deposition (CVD)method.Design/methodology/approach: The changes in surface topography was observed by the atomicforce microscope AFM and scanning electron microscope SEM. The results of roughness have beenprepared in the software WSxM NanoTec Spanish...
-
Results of tests on speech intelligibility in reverberant conditions
Open Research DataThe dataset contains the results of tests that aimed to provide a relationship between the rate of speech (RoS) and reverberation conditions characterized by the Speech Transmission Index (STI).
-
Automated detection of pronunciation errors in non-native English speech employing deep learning
PublicationDespite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...
-
An attempt to create a digital reconstruction of the Copper Ship = Próba cyfrowej rekonstrukcji kadłuba wraku Miedziowca
PublicationThis study presents an attempt to create a digital reconstruction of the W-5 shipwreck (the Copper Ship) based on data acquired by 3D scanning of structural components held at the National Maritime Museum in Gdańsk and on a physical reconstruction model of the ship’s hull. A digital reconstruction would facilitate analysis of various possible options for the structural design of the hull, and would enable the preparation of a model for...
-
Reception of GNSS Signals Under Jamming Conditions
PublicationThe article focuses on performance of Global Navigation Satellite System receivers in environment where intentional interference is present. First part is a general description of GNSS systems. Secondly, types of positioning service disturbances are specified. In the third part authors present a scheme of measurement stand which is used to evaluate the influence of interference on reception of navigation signals. Next, research...
-
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
PublicationWe propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...
-
Time-frequency analysis of acoustic signals using concentrated spectrogram
PublicationThe paper presents improved method of time-frequency (TF) analysis of discrete-time signals. The method involves signal's local group delay (LGD) and channelized instantaneous frequency (CIF) to purposely redistribute all Short-time Fourier transform (STFT) lines. Additionally, the energy concentration index (ECI) and some histogram-like statistics are used to evaluate readability of estimated TF distributions of the energy. Recorded...
-
Mowa nienawiści (hate speech) a odpowiedzialność dostawców usług internetowych w orzecznictwie sądów europejskich
PublicationThe article analyses the phenomenon of hate speech in the Internet contrasted with the problem of responsability of Internet Service Providers for cases of such abuses of freedom of expression. The text provides an analysis of jurisprudence of two European Courts. On the one hand it presents the position of the European Court of Human Rights on the problem of hate speech: its definition and the liability for it as an exception...
-
Graph Representation Integrating Signals for Emotion Recognition and Analysis
PublicationData reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...
-
Visual Lip Contour Detection for the Purpose of Speech Recognition
PublicationA method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
-
Usage of concentrated spectrogram for analysis of acoustical signals
PublicationA novel precise method of signal analysis in the time-frequency domain is presented. A signal energy distribution is estimated by discard and displacement of energy parts of the classical spectrogram. A channelized instantaneous frequency and a local group delay are used in order to energy replacement. Additionally, newly introduced representations such as: a channelized instantaneous bandwidth and a local group duration are used...
-
Analysis and interpretation of radiometric signals in a liquid-gas bubble flow
PublicationThe article presents the analysis of signals from a radiometric system consisting of two scintillation probes and two gamma radiation sealed sources. Calculations and interpretation were carried out for the bubble flow of the water-air mixture in the horizontal pipeline. The analysis of the obtained signals was done in time and frequency domain. In the frequency domain, a range of usable frequencies was identified, which were associated...
-
Artur Gańcza dr inż.
PeopleI received the M.Sc. degree from the Gdańsk University of Technology (GUT), Gdańsk, Poland, in 2019. I am currently a Ph.D. student at GUT, with the Department of Automatic Control, Faculty of Electronics, Telecommunications and Informatics. My professional interests include speech recognition, system identification, adaptive signal processing and linear algebra.
-
3D Object Shape Reconstruction from Underwater Multibeam Data and Over Ground Lidar Scanning
PublicationThe technologies of sonar and laser scanning are an efficient and widely used source of spatial information with regards to underwater and over ground environment respectively. The measurement data are usually available in the form of groups of separate points located irregularly in three-dimensional space, known as point clouds. This data model has known disadvantages, therefore in many applications a different form of representation,...
-
Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems
PublicationThe aim of this paper is to present a hybrid algorithm that combines the advantages ofartificial neural networks and hidden Markov models in speech recognition for control purpos-es. The scope of the paper includes review of currently used solutions, description and analysis of implementation of selected artificial neural network (NN) structures and hidden Markov mod-els (HMM). The main part of the paper consists of a description...
-
Human-computer interactions in speech therapy using a blowing interface
PublicationIn this paper we present a new human-computer interface for the quantitative measurement of blowing activities. The interface can measure the air flow and air pressure during the blowing activity. The measured values are stored and used to control the state of the graphical objects in the graphical user interface. In speech therapy children will find easier to play attractive therapeutic games than to perform repetitive and tedious,...
-
Strategies in Trauma and Limb Reconstruction
Journals -
Journal of Medical Signals and Sensors
Journals -
MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS
Journals -
Journal of Limb Lengthening & Reconstruction
Journals -
Annual signals observed in regional GPS networks
PublicationAbstract: This paper describes analyses concerning annual signals in GPS-derived coordinates. The data was processed in the Military University of Technology Local Analysis Centre with Bernese 5.0 software. We used observations from 129 permanent GPS stations which belong to the Polish Active Geodetic Network (ASG-EUPOS), for the period of GPS weeks 1465-1729, corresponding to about 5 years. The annual signals have been estimated...
-
Respiratory signals derived from capacitive electrocardiogram on the smart chair
PublicationCapacitive electrocardiogram (CECG) tends to deliver basic cardiac signals without need to use traditional glued electrodes. In the paper analysis of possibility if the ECG derived respiratory waveforms out of the CECG.
-
Processing of LiDAR and Multibeam Sonar Point Cloud Data for 3D Surface and Object Shape Reconstruction
PublicationUnorganised point cloud dataset, as a transitional data model in several applications, usually contains a considerable amount of undesirable irregularities, such as strong variability of local point density, missing data, overlapping points and noise caused by scattering characteristics of the environment. For these reasons, further processing of such data, e.g. for construction of higher order geometric models of the topography...