Search results for: RECONSTRUCTION OF SPEECH SIGNALS - Bridge of Knowledge

Search

Search results for: RECONSTRUCTION OF SPEECH SIGNALS

Filters

total: 1406
filtered: 1033

clear all filters


Chosen catalog filters

  • Category

  • Year

  • Options

clear Chosen catalog filters disabled

Search results for: RECONSTRUCTION OF SPEECH SIGNALS

  • Detection of impulsive disturbances in archive audio signals

    Publication

    In this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...

    Full text available to download

  • SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM

    The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

    Full text available to download

  • A survey of automatic speech recognition deep models performance for Polish medical terms

    Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

    Full text to download in external service

  • A non-uniform real-time speech time-scale stretching method

    Publication

    An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...

  • Cartographic Representation of Route Reconstruction Results in Video Surveillance System

    Publication

    The video streams available in a surveillance system distributed on the wide area may be accompanied by metadata are obtained as a result of video processing. Many algorithms applied to surveillance systems, e.g. event detection or object tracking, are strictly connected with localization of the object and reconstruction of its route. Drawing related information on a plan of a building or on a map of the city can facilitate the...

    Full text to download in external service

  • Silent Signals The Covert Network Shaping the Future

    Publication

    - Year 2007

    Silent Signals The Covert Network Shaping the Future In a world dominated by information flow and rapid technological advancements, the existence of hidden networks and unseen influences has never been more relevant. "Silent Signals: The Covert Network Shaping the Future" delves deep into the mysterious and often opaque world of covert communication networks. This influential work sheds light on the silent...

    Full text available to download

  • A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

    Publication
    • G. Tamulevicius
    • G. Korvel
    • A. B. Yayak
    • P. Treigys
    • J. Bernataviciene
    • B. Kostek

    - Electronics - Year 2020

    In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

    Full text available to download

  • Thermal sequences database of the skin flaps in breast reconstruction and burns

    Publication

    This paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...

  • Thermal sequences database of the skin flaps in breast reconstruction and burns

    This paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...

    Full text available to download

  • The use of the static thermography in monitoring flap perfusion in breast reconstruction with TRAM flap

    Publication

    - Year 2016

    This paper shows results of the static thermography for intraoperative and postoperative imaging of TRAM flap perfusion. The results were compared with the clinical examination of flap perfusion. The study was conducted on a group of 38 female patients who underwent breast reconstruction.

    Full text to download in external service

  • Identification of models and signals robust to occasional outliers

    Publication

    In this paper estimation algorithms derived in the sense of the least sum of absolute errors are considered for the purpose of identification of models and signals. In particular, off-line and approximate on-line estimation schemes discussed in the work are aimed at both assessing the coefficients of discrete-time stationary models and tracking the evolution of time-variant characteristics of monitored signals. What is interesting,...

  • Identification of models and signals robust to occasional outliers

    Publication

    In this paper estimation algorithms derived in the sense of the least sum of absolute errors are considered for the purpose of identification of models and signals. In particular, off-line and approximate on-line estimation schemes discussed in the work are aimed at both assessing the coefficients of discrete-time stationary models and tracking the evolution of time-variant characteristics of monitored signals. What is interesting,...

    Full text to download in external service

  • Active dynamic thermography method for TRAM flap blood perfusion mapping in breast reconstruction

    Publication

    - QIRT Journal - Year 2017

    This paper presents the new method of the transverse rectus abdominis musculocutaneous flap blood perfusion mapping based on the active dynamic thermography. The method is aimed at aiding a surgeon during breast reconstruction procedure. A pair of dTnorm and t90_10 parameters were used as parametric image descriptors of the flap blood perfusion. The method was tested on 38 patients that were subjected to breast reconstruction procedure....

    Full text available to download

  • Wavelet filtering of signals without using model functions

    Publication

    The effective wavelet filtering of real signals is impossible without determining their shape. The shape of a real signal is related to its wavelet spectrum. For shape analysis, a continuous color wavelet spectrogram of signal level is often used. The disadvantage of continuous wavelet spectrogram is the complexity of analyzing a blurry color image. A real signal with additive noise strongly distorts the spectrogram based on continuous...

    Full text available to download

  • Reconstruction of 3D structure of positive corona streamer by local methods

    Publication
    • M. Kocik
    • M. Tański
    • J. Mizeraczyk
    • R. Ichiki
    • S. Kanazawa
    • J. Dembski

    - Year 2009

    The computer algorithms were used for reconstruction of streamer 3D structure. We propose the 3D tree structure model of corona discharge streamer composed with nodes and edges between chosen couples of nodes, which enables easy computation of some important parameters ofstreamers. The 3D model can be derived directly from two projection images by global methods like evolutionary searching or particle simulations. In this paper...

    Full text to download in external service

  • Seafloor relief reconstruction from side scan sonar data

    Publication

    Side scan sonar is one of the most widely used imaging systems in the underwater environment. It is relatively cheap and easy to deploy, in comparison with more powerful sensors. Although side scan sonar does not provide seafloor bathymetry directly, its records are directly related to seafloor images. In the paper, the method for 3D seafloor relief reconstruction from side scan sonar data is presented. The method is based on the...

    Full text available to download

  • Detection of the Direct Sequence Spread Spectrum Signals with BPSK Modulation

    Publication

    - POLISH JOURNAL OF ENVIRONMENTAL STUDIES - Year 2011

    This paper presents a method of the DS CDMA signals with BPSK modulation detection through the examination of the enhanced signal spectrum density. On the base of experiments carried out on the real radio communication signals the impact of a narrowband emission occurring in the examined frequency band on the detection process effectiveness was shown. The results of the experiment aimed at the detection of the satellite navigation...

    Full text available to download

  • Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency

    In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.

  • Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency

    Publication

    - Year 2007

    In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.

  • Damage detection in plates based on Lamb wavefront shape reconstruction

    Publication

    - MEASUREMENT - Year 2021

    Many of the current studies in the area of damage detection using elastic wave propagation are based on deploying sensor networks with a large number of piezoelectric transducers to detect small-size cracks. A major limitation of these studies is that cracks are usually larger and have different shapes in real cases. Moreover, using a large number of sensing nodes for damage detection is both costly and computationally intensive....

    Full text available to download

  • Reconstruction of thin films polyazomethine based on microscopic images

    Publication

    - Year 2011

    Purpose: The aim of this paper was to investigate changes in surface morphology of thin films ofpolyazomethine PPI. Thin films were prepared using low-temperature chemical vapor deposition (CVD)method.Design/methodology/approach: The changes in surface topography was observed by the atomicforce microscope AFM and scanning electron microscope SEM. The results of roughness have beenprepared in the software WSxM NanoTec Spanish...

  • Automated detection of pronunciation errors in non-native English speech employing deep learning

    Publication

    - Year 2023

    Despite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...

    Full text available to download

  • An attempt to create a digital reconstruction of the Copper Ship = Próba cyfrowej rekonstrukcji kadłuba wraku Miedziowca

    Publication

    - Year 2014

    This study presents an attempt to create a digital reconstruction of the W-5 shipwreck (the Copper Ship) based on data acquired by 3D scanning of structural components held at the National Maritime Museum in Gdańsk and on a physical reconstruction model of the ship’s hull. A digital reconstruction would facilitate analysis of various possible options for the structural design of the hull, and would enable the preparation of a model for...

    Full text to download in external service

  • Reception of GNSS Signals Under Jamming Conditions

    The article focuses on performance of Global Navigation Satellite System receivers in environment where intentional interference is present. First part is a general description of GNSS systems. Secondly, types of positioning service disturbances are specified. In the third part authors present a scheme of measurement stand which is used to evaluate the influence of interference on reception of navigation signals. Next, research...

  • Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech

    Publication
    • D. Korzekwa
    • J. Lorenzo-trueba
    • T. Drugman
    • S. Calamaro
    • B. Kostek

    - Year 2021

    We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

    Full text available to download

  • Time-frequency analysis of acoustic signals using concentrated spectrogram

    Publication

    The paper presents improved method of time-frequency (TF) analysis of discrete-time signals. The method involves signal's local group delay (LGD) and channelized instantaneous frequency (CIF) to purposely redistribute all Short-time Fourier transform (STFT) lines. Additionally, the energy concentration index (ECI) and some histogram-like statistics are used to evaluate readability of estimated TF distributions of the energy. Recorded...

  • Mowa nienawiści (hate speech) a odpowiedzialność dostawców usług internetowych w orzecznictwie sądów europejskich

    Publication

    - Year 2015

    The article analyses the phenomenon of hate speech in the Internet contrasted with the problem of responsability of Internet Service Providers for cases of such abuses of freedom of expression. The text provides an analysis of jurisprudence of two European Courts. On the one hand it presents the position of the European Court of Human Rights on the problem of hate speech: its definition and the liability for it as an exception...

  • Graph Representation Integrating Signals for Emotion Recognition and Analysis

    Data reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...

    Full text available to download

  • Visual Lip Contour Detection for the Purpose of Speech Recognition

    Publication

    A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...

  • Usage of concentrated spectrogram for analysis of acoustical signals

    Publication

    A novel precise method of signal analysis in the time-frequency domain is presented. A signal energy distribution is estimated by discard and displacement of energy parts of the classical spectrogram. A channelized instantaneous frequency and a local group delay are used in order to energy replacement. Additionally, newly introduced representations such as: a channelized instantaneous bandwidth and a local group duration are used...

    Full text to download in external service

  • Analysis and interpretation of radiometric signals in a liquid-gas bubble flow

    Publication

    - EPJ Web of Conferences - Year 2019

    The article presents the analysis of signals from a radiometric system consisting of two scintillation probes and two gamma radiation sealed sources. Calculations and interpretation were carried out for the bubble flow of the water-air mixture in the horizontal pipeline. The analysis of the obtained signals was done in time and frequency domain. In the frequency domain, a range of usable frequencies was identified, which were associated...

    Full text available to download

  • 3D Object Shape Reconstruction from Underwater Multibeam Data and Over Ground Lidar Scanning

    The technologies of sonar and laser scanning are an efficient and widely used source of spatial information with regards to underwater and over ground environment respectively. The measurement data are usually available in the form of groups of separate points located irregularly in three-dimensional space, known as point clouds. This data model has known disadvantages, therefore in many applications a different form of representation,...

    Full text available to download

  • Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems

    The aim of this paper is to present a hybrid algorithm that combines the advantages ofartificial neural networks and hidden Markov models in speech recognition for control purpos-es. The scope of the paper includes review of currently used solutions, description and analysis of implementation of selected artificial neural network (NN) structures and hidden Markov mod-els (HMM). The main part of the paper consists of a description...

    Full text available to download

  • Human-computer interactions in speech therapy using a blowing interface

    Publication

    In this paper we present a new human-computer interface for the quantitative measurement of blowing activities. The interface can measure the air flow and air pressure during the blowing activity. The measured values are stored and used to control the state of the graphical objects in the graphical user interface. In speech therapy children will find easier to play attractive therapeutic games than to perform repetitive and tedious,...

    Full text to download in external service

  • Annual signals observed in regional GPS networks

    Publication
    • J. Bogusz
    • M. Figurski

    - Acta Geodynamica et Geomaterialia - Year 2014

    Abstract: This paper describes analyses concerning annual signals in GPS-derived coordinates. The data was processed in the Military University of Technology Local Analysis Centre with Bernese 5.0 software. We used observations from 129 permanent GPS stations which belong to the Polish Active Geodetic Network (ASG-EUPOS), for the period of GPS weeks 1465-1729, corresponding to about 5 years. The annual signals have been estimated...

    Full text available to download

  • Respiratory signals derived from capacitive electrocardiogram on the smart chair

    Publication

    - Year 2020

    Capacitive electrocardiogram (CECG) tends to deliver basic cardiac signals without need to use traditional glued electrodes. In the paper analysis of possibility if the ECG derived respiratory waveforms out of the CECG.

    Full text available to download

  • Processing of LiDAR and Multibeam Sonar Point Cloud Data for 3D Surface and Object Shape Reconstruction

    Publication

    Unorganised point cloud dataset, as a transitional data model in several applications, usually contains a considerable amount of undesirable irregularities, such as strong variability of local point density, missing data, overlapping points and noise caused by scattering characteristics of the environment. For these reasons, further processing of such data, e.g. for construction of higher order geometric models of the topography...

    Full text to download in external service

  • TDOA Navigation Using CDMA2000 Signals – Experimental Results

    Publication

    - Year 2014

    This paper presents results of an experiments on the possibility to estimate position of a CDMA2000 receiver on the basis of TDOA measurements. The hardware and software structure of a navigation receiver used during investigation is briefly described with focus on drawbacks and limitations. The main part of this paper contains basic information about CDMA2000 network in northern Poland, which signals were recorded during tests...

  • 3D seafloor reconstruction using data from side scan and synthetic aperture sonar

    Publication

    Side scan and synthetic aperture sonars are widely used imaging systems in the underwater environment. They are relatively cheap and easy to deploy, in comparison with more powerful sensors, like multibeam echosounders. Although side scan and synthetic aperture sonars does not provide seafloor bathymetry directly, their records are finally related to seafloor images. Moreover, the analysis of such images performed by human eye...

    Full text available to download

  • Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets

    Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

    Full text available to download

  • Log signals simulation.

    Publication

    - Year 2004

    Symulatory logów (urządzeń mierzących prędkość w nawigacji morskiej), które używane są do testowania oraz szkolenia operatorów radarów i systemów antykolizyjnych, powinny posiadać również wyjście impulsowe, które w logu rzeczywistym pochodzi z licznika przebytej drogi, w postaci zadanej liczby impulsów na milę morską. Urządzenie takie to przetwornik cyfrowo-częstotliwościowy w formie programowanego cyfrowo dzielnika częstotliwości....

  • Post‐Second World War Reconstruction of Polish Cities: The Interplay Between Politics and Paradigms

    Publication

    By the end of the Second World War, many of the Polish cities—and especially their historic centres—were in ruins. This was caused by both bombings and sieges conducted by the Nazis and Soviets. The particular group of cities is associated with former German lands—now called the “Recovered Territories”—which were incorporated into the borders of Poland as compensation for its Eastern Borderlands lost to the Soviet Union. These...

    Full text available to download

  • Stress Detection of Children With ASD Using Physiological Signals

    Publication
    • S. N. B. Aktas
    • P. Uluer
    • B. Coskun
    • E. Toprak
    • D. E. Barkana
    • H. Kose
    • T. Zorcec
    • B. Robins
    • A. Landowska

    - Year 2022

    This paper proposes a physiological signal-based stress detection approach for children with autism spectrum disorder (ASD) to be used in social and assistive robot inter- vention. Electrodermal activity (EDA) and blood volume pulse (BVP) signals are collected with an E4 smart wristband from children with ASD in different countries. The peak count and signal amplitude features are derived from EDA signal and used in order to detect...

    Full text to download in external service

  • Comparison of perforator location in dynamic and static thermographic imaging with Doppler ultrasound in breast reconstruction surgery

    Publication

    - Year 2016

    This paper co mpares the effectiveness of the dTnorm and t90_10 parametrizations in dynamic thermography for imaging location of perforators in TRAM flaps in the intraoperative period. The results were compared with the location detected in a Doppler ultrasound examination. Cold and heat stimulation was used in dynamic thermography. Additionally, these results were compared with static...

    Full text to download in external service

  • Database of speech and facial expressions recorded with optimized face motion capture settings

    The broad objective of the present research is the analysis of spoken English employing a multiplicity of modalities. An important stage of this process, discussed in the paper, is creating a database of speech accompanied with facial expressions. Recordings of speakers were made using an advanced system for capturing facial muscle motion. A brief historical outline, current applications, limitations and the ways of capturing face...

    Full text available to download

  • Detection of dialogue in movie soundtrack for speech intelligibility enhancement

    Publication

    - Year 2014

    A method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility....

    Full text to download in external service

  • The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish

    Publication

    - Year 2024

    The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...

    Full text available to download

  • Estimation of the short-term predictor parameters of speech under noisy conditions

    Publication

    Full text to download in external service

  • Using concentrated spectrogram for analysis of audio acoustic signals

    Publication

    The paper presents results of time-frequency analysis of audio acoustic signals using the method of Concentrated Spectrograph also known as ''Cross-spectral method'' or ''Reassignment method''. Presented algorithm involves signal's local group delay and channelized instantaneous frequency to relevantly redistribute all Short-time Fourier transform lines in time-frequency plain. The main intention of the paper is to compare various...

    Full text available to download

  • APPLICATION OF VIBRATION SIGNALS IN RAILWAY TRACK DIAGNOSTICS USING A MOBILE RAILWAY PLATFORM

    Publication

    - Archives of Transport - Year 2024

    The article presents a comprehensive method for using vibration signals to diagnose railway tracks. The primary objective is to gather detailed information on track conditions through a passive experiment. This involves using mobile diagnostic tools and techniques to assess railway infrastructure. The article elaborates on the range of diagnostic activities conducted in accordance with detailed railway regulations and highlights...

    Full text to download in external service