Filters
total: 1406
filtered: 1033
displaying 1000 best results Help
Search results for: RECONSTRUCTION OF SPEECH SIGNALS
-
Detection of impulsive disturbances in archive audio signals
PublicationIn this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...
-
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
PublicationThe main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...
-
A survey of automatic speech recognition deep models performance for Polish medical terms
PublicationAmong the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....
-
A non-uniform real-time speech time-scale stretching method
PublicationAn algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...
-
Cartographic Representation of Route Reconstruction Results in Video Surveillance System
PublicationThe video streams available in a surveillance system distributed on the wide area may be accompanied by metadata are obtained as a result of video processing. Many algorithms applied to surveillance systems, e.g. event detection or object tracking, are strictly connected with localization of the object and reconstruction of its route. Drawing related information on a plan of a building or on a map of the city can facilitate the...
-
Silent Signals The Covert Network Shaping the Future
PublicationSilent Signals The Covert Network Shaping the Future In a world dominated by information flow and rapid technological advancements, the existence of hidden networks and unseen influences has never been more relevant. "Silent Signals: The Covert Network Shaping the Future" delves deep into the mysterious and often opaque world of covert communication networks. This influential work sheds light on the silent...
-
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
PublicationIn this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...
-
Thermal sequences database of the skin flaps in breast reconstruction and burns
PublicationThis paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...
-
Thermal sequences database of the skin flaps in breast reconstruction and burns
PublicationThis paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...
-
The use of the static thermography in monitoring flap perfusion in breast reconstruction with TRAM flap
PublicationThis paper shows results of the static thermography for intraoperative and postoperative imaging of TRAM flap perfusion. The results were compared with the clinical examination of flap perfusion. The study was conducted on a group of 38 female patients who underwent breast reconstruction.
-
Identification of models and signals robust to occasional outliers
PublicationIn this paper estimation algorithms derived in the sense of the least sum of absolute errors are considered for the purpose of identification of models and signals. In particular, off-line and approximate on-line estimation schemes discussed in the work are aimed at both assessing the coefficients of discrete-time stationary models and tracking the evolution of time-variant characteristics of monitored signals. What is interesting,...
-
Identification of models and signals robust to occasional outliers
PublicationIn this paper estimation algorithms derived in the sense of the least sum of absolute errors are considered for the purpose of identification of models and signals. In particular, off-line and approximate on-line estimation schemes discussed in the work are aimed at both assessing the coefficients of discrete-time stationary models and tracking the evolution of time-variant characteristics of monitored signals. What is interesting,...
-
Active dynamic thermography method for TRAM flap blood perfusion mapping in breast reconstruction
PublicationThis paper presents the new method of the transverse rectus abdominis musculocutaneous flap blood perfusion mapping based on the active dynamic thermography. The method is aimed at aiding a surgeon during breast reconstruction procedure. A pair of dTnorm and t90_10 parameters were used as parametric image descriptors of the flap blood perfusion. The method was tested on 38 patients that were subjected to breast reconstruction procedure....
-
Wavelet filtering of signals without using model functions
PublicationThe effective wavelet filtering of real signals is impossible without determining their shape. The shape of a real signal is related to its wavelet spectrum. For shape analysis, a continuous color wavelet spectrogram of signal level is often used. The disadvantage of continuous wavelet spectrogram is the complexity of analyzing a blurry color image. A real signal with additive noise strongly distorts the spectrogram based on continuous...
-
Reconstruction of 3D structure of positive corona streamer by local methods
PublicationThe computer algorithms were used for reconstruction of streamer 3D structure. We propose the 3D tree structure model of corona discharge streamer composed with nodes and edges between chosen couples of nodes, which enables easy computation of some important parameters ofstreamers. The 3D model can be derived directly from two projection images by global methods like evolutionary searching or particle simulations. In this paper...
-
Seafloor relief reconstruction from side scan sonar data
PublicationSide scan sonar is one of the most widely used imaging systems in the underwater environment. It is relatively cheap and easy to deploy, in comparison with more powerful sensors. Although side scan sonar does not provide seafloor bathymetry directly, its records are directly related to seafloor images. In the paper, the method for 3D seafloor relief reconstruction from side scan sonar data is presented. The method is based on the...
-
Detection of the Direct Sequence Spread Spectrum Signals with BPSK Modulation
PublicationThis paper presents a method of the DS CDMA signals with BPSK modulation detection through the examination of the enhanced signal spectrum density. On the base of experiments carried out on the real radio communication signals the impact of a narrowband emission occurring in the examined frequency band on the detection process effectiveness was shown. The results of the experiment aimed at the detection of the satellite navigation...
-
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
PublicationIn this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
-
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
PublicationIn this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
-
Damage detection in plates based on Lamb wavefront shape reconstruction
PublicationMany of the current studies in the area of damage detection using elastic wave propagation are based on deploying sensor networks with a large number of piezoelectric transducers to detect small-size cracks. A major limitation of these studies is that cracks are usually larger and have different shapes in real cases. Moreover, using a large number of sensing nodes for damage detection is both costly and computationally intensive....
-
Reconstruction of thin films polyazomethine based on microscopic images
PublicationPurpose: The aim of this paper was to investigate changes in surface morphology of thin films ofpolyazomethine PPI. Thin films were prepared using low-temperature chemical vapor deposition (CVD)method.Design/methodology/approach: The changes in surface topography was observed by the atomicforce microscope AFM and scanning electron microscope SEM. The results of roughness have beenprepared in the software WSxM NanoTec Spanish...
-
Automated detection of pronunciation errors in non-native English speech employing deep learning
PublicationDespite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...
-
An attempt to create a digital reconstruction of the Copper Ship = Próba cyfrowej rekonstrukcji kadłuba wraku Miedziowca
PublicationThis study presents an attempt to create a digital reconstruction of the W-5 shipwreck (the Copper Ship) based on data acquired by 3D scanning of structural components held at the National Maritime Museum in Gdańsk and on a physical reconstruction model of the ship’s hull. A digital reconstruction would facilitate analysis of various possible options for the structural design of the hull, and would enable the preparation of a model for...
-
Reception of GNSS Signals Under Jamming Conditions
PublicationThe article focuses on performance of Global Navigation Satellite System receivers in environment where intentional interference is present. First part is a general description of GNSS systems. Secondly, types of positioning service disturbances are specified. In the third part authors present a scheme of measurement stand which is used to evaluate the influence of interference on reception of navigation signals. Next, research...
-
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
PublicationWe propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...
-
Time-frequency analysis of acoustic signals using concentrated spectrogram
PublicationThe paper presents improved method of time-frequency (TF) analysis of discrete-time signals. The method involves signal's local group delay (LGD) and channelized instantaneous frequency (CIF) to purposely redistribute all Short-time Fourier transform (STFT) lines. Additionally, the energy concentration index (ECI) and some histogram-like statistics are used to evaluate readability of estimated TF distributions of the energy. Recorded...
-
Mowa nienawiści (hate speech) a odpowiedzialność dostawców usług internetowych w orzecznictwie sądów europejskich
PublicationThe article analyses the phenomenon of hate speech in the Internet contrasted with the problem of responsability of Internet Service Providers for cases of such abuses of freedom of expression. The text provides an analysis of jurisprudence of two European Courts. On the one hand it presents the position of the European Court of Human Rights on the problem of hate speech: its definition and the liability for it as an exception...
-
Graph Representation Integrating Signals for Emotion Recognition and Analysis
PublicationData reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...
-
Visual Lip Contour Detection for the Purpose of Speech Recognition
PublicationA method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
-
Usage of concentrated spectrogram for analysis of acoustical signals
PublicationA novel precise method of signal analysis in the time-frequency domain is presented. A signal energy distribution is estimated by discard and displacement of energy parts of the classical spectrogram. A channelized instantaneous frequency and a local group delay are used in order to energy replacement. Additionally, newly introduced representations such as: a channelized instantaneous bandwidth and a local group duration are used...
-
Analysis and interpretation of radiometric signals in a liquid-gas bubble flow
PublicationThe article presents the analysis of signals from a radiometric system consisting of two scintillation probes and two gamma radiation sealed sources. Calculations and interpretation were carried out for the bubble flow of the water-air mixture in the horizontal pipeline. The analysis of the obtained signals was done in time and frequency domain. In the frequency domain, a range of usable frequencies was identified, which were associated...
-
3D Object Shape Reconstruction from Underwater Multibeam Data and Over Ground Lidar Scanning
PublicationThe technologies of sonar and laser scanning are an efficient and widely used source of spatial information with regards to underwater and over ground environment respectively. The measurement data are usually available in the form of groups of separate points located irregularly in three-dimensional space, known as point clouds. This data model has known disadvantages, therefore in many applications a different form of representation,...
-
Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems
PublicationThe aim of this paper is to present a hybrid algorithm that combines the advantages ofartificial neural networks and hidden Markov models in speech recognition for control purpos-es. The scope of the paper includes review of currently used solutions, description and analysis of implementation of selected artificial neural network (NN) structures and hidden Markov mod-els (HMM). The main part of the paper consists of a description...
-
Human-computer interactions in speech therapy using a blowing interface
PublicationIn this paper we present a new human-computer interface for the quantitative measurement of blowing activities. The interface can measure the air flow and air pressure during the blowing activity. The measured values are stored and used to control the state of the graphical objects in the graphical user interface. In speech therapy children will find easier to play attractive therapeutic games than to perform repetitive and tedious,...
-
Annual signals observed in regional GPS networks
PublicationAbstract: This paper describes analyses concerning annual signals in GPS-derived coordinates. The data was processed in the Military University of Technology Local Analysis Centre with Bernese 5.0 software. We used observations from 129 permanent GPS stations which belong to the Polish Active Geodetic Network (ASG-EUPOS), for the period of GPS weeks 1465-1729, corresponding to about 5 years. The annual signals have been estimated...
-
Respiratory signals derived from capacitive electrocardiogram on the smart chair
PublicationCapacitive electrocardiogram (CECG) tends to deliver basic cardiac signals without need to use traditional glued electrodes. In the paper analysis of possibility if the ECG derived respiratory waveforms out of the CECG.
-
Processing of LiDAR and Multibeam Sonar Point Cloud Data for 3D Surface and Object Shape Reconstruction
PublicationUnorganised point cloud dataset, as a transitional data model in several applications, usually contains a considerable amount of undesirable irregularities, such as strong variability of local point density, missing data, overlapping points and noise caused by scattering characteristics of the environment. For these reasons, further processing of such data, e.g. for construction of higher order geometric models of the topography...
-
TDOA Navigation Using CDMA2000 Signals – Experimental Results
PublicationThis paper presents results of an experiments on the possibility to estimate position of a CDMA2000 receiver on the basis of TDOA measurements. The hardware and software structure of a navigation receiver used during investigation is briefly described with focus on drawbacks and limitations. The main part of this paper contains basic information about CDMA2000 network in northern Poland, which signals were recorded during tests...
-
3D seafloor reconstruction using data from side scan and synthetic aperture sonar
PublicationSide scan and synthetic aperture sonars are widely used imaging systems in the underwater environment. They are relatively cheap and easy to deploy, in comparison with more powerful sensors, like multibeam echosounders. Although side scan and synthetic aperture sonars does not provide seafloor bathymetry directly, their records are finally related to seafloor images. Moreover, the analysis of such images performed by human eye...
-
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
PublicationArtificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...
-
Log signals simulation.
PublicationSymulatory logów (urządzeń mierzących prędkość w nawigacji morskiej), które używane są do testowania oraz szkolenia operatorów radarów i systemów antykolizyjnych, powinny posiadać również wyjście impulsowe, które w logu rzeczywistym pochodzi z licznika przebytej drogi, w postaci zadanej liczby impulsów na milę morską. Urządzenie takie to przetwornik cyfrowo-częstotliwościowy w formie programowanego cyfrowo dzielnika częstotliwości....
-
Post‐Second World War Reconstruction of Polish Cities: The Interplay Between Politics and Paradigms
PublicationBy the end of the Second World War, many of the Polish cities—and especially their historic centres—were in ruins. This was caused by both bombings and sieges conducted by the Nazis and Soviets. The particular group of cities is associated with former German lands—now called the “Recovered Territories”—which were incorporated into the borders of Poland as compensation for its Eastern Borderlands lost to the Soviet Union. These...
-
Stress Detection of Children With ASD Using Physiological Signals
PublicationThis paper proposes a physiological signal-based stress detection approach for children with autism spectrum disorder (ASD) to be used in social and assistive robot inter- vention. Electrodermal activity (EDA) and blood volume pulse (BVP) signals are collected with an E4 smart wristband from children with ASD in different countries. The peak count and signal amplitude features are derived from EDA signal and used in order to detect...
-
Comparison of perforator location in dynamic and static thermographic imaging with Doppler ultrasound in breast reconstruction surgery
PublicationThis paper co mpares the effectiveness of the dTnorm and t90_10 parametrizations in dynamic thermography for imaging location of perforators in TRAM flaps in the intraoperative period. The results were compared with the location detected in a Doppler ultrasound examination. Cold and heat stimulation was used in dynamic thermography. Additionally, these results were compared with static...
-
Database of speech and facial expressions recorded with optimized face motion capture settings
PublicationThe broad objective of the present research is the analysis of spoken English employing a multiplicity of modalities. An important stage of this process, discussed in the paper, is creating a database of speech accompanied with facial expressions. Recordings of speakers were made using an advanced system for capturing facial muscle motion. A brief historical outline, current applications, limitations and the ways of capturing face...
-
Detection of dialogue in movie soundtrack for speech intelligibility enhancement
PublicationA method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility....
-
The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish
PublicationThe article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...
-
Estimation of the short-term predictor parameters of speech under noisy conditions
Publication -
Using concentrated spectrogram for analysis of audio acoustic signals
PublicationThe paper presents results of time-frequency analysis of audio acoustic signals using the method of Concentrated Spectrograph also known as ''Cross-spectral method'' or ''Reassignment method''. Presented algorithm involves signal's local group delay and channelized instantaneous frequency to relevantly redistribute all Short-time Fourier transform lines in time-frequency plain. The main intention of the paper is to compare various...
-
APPLICATION OF VIBRATION SIGNALS IN RAILWAY TRACK DIAGNOSTICS USING A MOBILE RAILWAY PLATFORM
PublicationThe article presents a comprehensive method for using vibration signals to diagnose railway tracks. The primary objective is to gather detailed information on track conditions through a passive experiment. This involves using mobile diagnostic tools and techniques to assess railway infrastructure. The article elaborates on the range of diagnostic activities conducted in accordance with detailed railway regulations and highlights...