Wyniki wyszukiwania dla: rate of speech estimation

Real-time speech-rate modification experiments

Publikacja

- Rok 2010

An algorithm designed for real-time speech time scale modification (stretching) is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness of signal processing algorithms are examined experimentally together...

Pełny tekst do pobrania w serwisie zewnętrznym

Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding

Publikacja

M. Kuropatwinski
W. Kleijn
M. Kuropatwiński

- Rok 2001

Pełny tekst do pobrania w serwisie zewnętrznym

On the EM algorithm for the estimation of speech AR parameters in noise

Publikacja

M. Kuropatwinski
B. Kleijn
M. Kuropatwiński

- Rok 2014

Pełny tekst do pobrania w serwisie zewnętrznym

Estimation of respiration rate using an accelerometer and thermal camera in eGlasses

Publikacja

- Rok 2016

Respiration rate is a very important vital sign. Different methods of respiration rate measurement or estimation have been developed. However, especially interesting are those that enable remote and unobtrusive monitoring. In this study, we investigated the use of smart glasses for the estimation of respiration rate especially useful for indoors applications. Two methods were analyzed. The first one is based on measurements of...

Pełny tekst do pobrania w serwisie zewnętrznym

The accuracy of pulse rate estimation from the sequence of face images

Publikacja

J. Rumiński

- Rok 2016

The goal of this paper is to analyze the accuracy of pulse rate estimation from the sequence of face images. Simulated and real signals were used to evaluate two pulse rate estimators; one for frequency domain and the second one for time domain using the autocorrelation function. The results show that the mean difference between the reference measurements and estimated pulse rate values are about 2bpm. In the analysis of short...

Pełny tekst do pobrania w serwisie zewnętrznym

Chirp-rate estimation of FM signals in the time-frequency domain

Publikacja

K. Czarnecki

- HYDROACOUSTICS - Rok 2014

Novel dynamic representations of a complex signal in the time-frequency domain including: a channelized instantaneous complex frequency (CICF), a complex local group delay (CLGD) and a channelized instantaneous chirp-rate (CICR) are introduced. The proposed approach is based on the use of the gradient of the short-time Fourier transform complex phase. An interpretation of the newly-introduced distributions especially of the CICR...

Pełny tekst do pobrania w portalu

Estimation of the short-term predictor parameters of speech under noisy conditions

Publikacja

M. Kuropatwinski
W. Kleijn
M. Kuropatwiński

- IEEE Transactions on Audio Speech and Language Processing - Rok 2006

Pełny tekst do pobrania w serwisie zewnętrznym

Speech formant frequency and pitch estimation using instantaneous complex frequency

Publikacja

M. [. Kaniewska

- Rok 2008

W pracy opisany został algorytm estymacji częstotliwości podstawowej oraz częstotliwości środkowych i pasm formantów mowy z wykorzystaniem zespolonej pulsacji chwilowej. W artykule przedstawiono również wyniki działania algorytmu dla polskich samogłosek.

Chirp Rate and Instantaneous Frequency Estimation: Application to Recursive Vertical Synchrosqueezing

Publikacja

D. Fourer
F. Auger
K. Czarnecki
S. Meignen
P. Flandrin

- IEEE SIGNAL PROCESSING LETTERS - Rok 2017

This letter introduces new chirp rate and instantaneous frequency estimators designed for frequency-modulated signals. These estimators are first investigated from a deterministic point of view, then compared together in terms of statistical efficiency. They are also used to design new recursive versions of the vertically synchrosqueezed short-time Fourier transform, using a previously published method (D. Fourer, F. Auger, and...

Pełny tekst do pobrania w portalu

Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency

Publikacja

- Elektronika : konstrukcje, technologie, zastosowania - Rok 2008

In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.

Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency

Publikacja

- Rok 2007

In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.

Sonar Pulse Detection Using Chirp Rate Estimation and CFAR Algorithms

Publikacja

K. Abratkiewicz

- HYDROACOUSTICS - Rok 2017

This paper presents a new approach to sonar pulse detection. The method uses chirp rate estimators and algorithms for the adaptive threshold, commonly used in radiolocation. The proposed approach allows detection of pulses of unknown parameters, which may be used in passive hydrolocation or jamming detection in underwater communication. Such an analysis is possible thanks to a new kind of imaging, which presents signal energy in...

Pełny tekst do pobrania w portalu

Respiratory Rate Estimation Based on Detected Mask Area in Thermal Images

Publikacja

- Rok 2023

The popularity of non-contact methods of measuring vital signs, particularly respiratory rate, has increased during the SARS-COV-2 pandemic. Breathing parameters can be estimated by analysis of temperature changes observed in thermal images of nostrils or mouth regions. However, wearing virus-protection face masks prevents direct detection of such face regions. In this work, we propose to use an automatic mask detection approach...

Pełny tekst do pobrania w serwisie zewnętrznym

Evaluating Accuracy of Respiratory Rate Estimation from Super Resolved Thermal Imagery

Publikacja

- Rok 2019

Non-contact estimation of Respiratory Rate (RR) has revolutionized the process of establishing the measurement by surpassing some issues related to attaching sensors to a body, e.g. epidermal stripping, skin disruption and pain. In this study, we perform further experiments with image processing-based RR estimation by using various image enhancement algorithms. Specifically, we employ Super Resolution (SR) Deep Learning (DL) network...

Pełny tekst do pobrania w portalu

Stochastic Integration and Long Term Predictor Estimation under Noisy Conditions for Speech Enhancement

Publikacja

M. Kuropatwinski
W. Kleijn
M. Kuropatwiński

- Rok 2005

Pełny tekst do pobrania w serwisie zewnętrznym

Respiration rate estimation using non-linear observers in application to wastewater treatment plant

Publikacja

- JOURNAL OF PROCESS CONTROL - Rok 2023

A problem of respiration rate estimation using two new non-linear observers for a wastewater treatment plant is addressed in this paper. In particular, a non-linear adaptive Luenberger-like observer and a super twisting sliding mode observer have been derived to produce stable and bounded estimates of the respiration rate. During the synthesis of the particular observer, an appropriate mathematical utility model was used. The observability...

Pełny tekst do pobrania w serwisie zewnętrznym

Respiration rate estimation for model predictive control of dissolved oxygen in wastewater treatment plant

Publikacja

- Rok 2008

Respiration rate is very important parameter for biological processes in wastewater treatment plant (WWTP). The sequential algorithm for estimate the respiration rate is proposed and investigated. The Kalman filter (KF) is used. Simulation tests for the benchmark WWTP are presented.Respiracja jest bardzo ważnym parametrem dla prawidłowego przebiegu procesów biologicznych w oczyszczalni ścieków. W artykule przedstawiono i zbadano...

Pełny tekst do pobrania w serwisie zewnętrznym

Estimation of time-frequency complex phase-based speech attributes using narrow band filter banks

Publikacja

K. Abratkiewicz
K. Czarnecki
D. Fourer
F. Auger

- Rok 2017

In this paper, we present nonlinear estimators of nonstationary and multicomponent signal attributes (parameters, properties) which are instantaneous frequency, spectral (or group) delay, and chirp-rate (also known as instantaneous frequency slope). We estimate all of these distributions in the time-frequency domain using both finite and infinite impulse response (FIR and IIR) narrow band filers for speech analysis. Then, we present...

Pełny tekst do pobrania w portalu

Minimum mean square error estimation of speech short-term predictor parameters under noisy conditions

Publikacja

M. Kuropatwinski
W. Kleijn
M. Kuropatwiński

- Rok 2003

Pełny tekst do pobrania w serwisie zewnętrznym

A novel method of local chirp-rate estimation of LFM chirp signals in the time-frequency domain

Publikacja

- Rok 2013

In the paper, novel dynamic representations of a complex signal in the time-frequency domain are introduced. The proposed approach is based on using the gradient of the short-time Fourier transform complex phase. A channelized instantaneous complex frequency (CICF) and a complex local group delay (CLGD) are included in the presented signal representations. An application of the newly-introduced distributions is demonstrated by...

Pełny tekst do pobrania w serwisie zewnętrznym

Improving Accuracy of Contactless Respiratory Rate Estimation by Enhancing Thermal Sequences with Deep Neural Networks

Publikacja

- Applied Sciences-Basel - Rok 2019

Estimation of vital signs using image processing techniques have already been proved to have a potential for supporting remote medical diagnostics and replacing traditional measurements that usually require special hardware and electrodes placed on a body. In this paper, we further extend studies on contactless Respiratory Rate (RR) estimation from extremely low resolution thermal imagery by enhancing acquired sequences using Deep...

Pełny tekst do pobrania w portalu

Improving Accuracy of Respiratory Rate Estimation by Restoring High Resolution Features With Transformers and Recursive Convolutional Models

Publikacja

A. Kwaśniewska
M. Szankin
J. Rumiński
A. Sarah
D. Gamba

- Rok 2021

Non-contact evaluation of vital signs has been becoming increasingly important, especially in light of the COVID- 19 pandemic, which is causing the whole world to examine people’s interactions in public places at a scale never seen before. However, evaluating one’s vital signs can be a relatively complex procedure, which requires both time and physical contact between examiner and examinee. These re- quirements limit the number...

Pełny tekst do pobrania w portalu

Hydraulic Conductivity Estimation Test Impact on Long-Term Acceptance Rate and Soil Absorption System Design

Publikacja

J. NIEĆ
M. Spychała

- Water - Rok 2014

Pełny tekst do pobrania w serwisie zewnętrznym

The motion influence on respiration rate estimation from low-resolution thermal sequences during attention focusing tasks

Publikacja

- Rok 2017

Global aging has led to a growing expectancy for creating home-based platforms for indoor monitoring of elderly people. A motivation is to provide a non-intrusive technique, which does not require special activities of a patient but allows for remote monitoring of elderly people while assisting them with their daily activities. The goal of our study was to evaluate motion performed by a person focused on a specific task and check...

Pełny tekst do pobrania w portalu

Estimation of autotrophic maximum specific growth rate constant-experience from the long-term operation of a laboratory-scale Sequencing Batch Reactor System

Publikacja

Y. Su
J. Mąkinia
K. Pagilla

- WATER ENVIRONMENT RESEARCH - Rok 2008

W pracy wykorzystano wyniki badań z długookresowej eksploatacji laboratoryjnego reaktora typu SBR do estymacji stałej szybkości przyrostu bakterii nitryfikacyjnych. Badania obejmowały zarówno pomiary empiryczne (doświadczenia przy niskim stosunku substratu do biomasy) oraz symulacje komputerowe. Do symulacji dynamicznych wykorzystano jedną wartość współczynnika = 1,2 1/d, pomimo różnych wartości mierzonych jednostkowych szybkości...

A non-uniform real-time speech time-scale stretching method

Publikacja

- Rok 2011

An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...

Speech codec enhancements utilizing time compression and perceptual coding

Publikacja

- Rok 2007

A method for encoding wideband speech signal employing standardized narrowband speech codecs is presented as well as experimental results concerning detection of tonal spectral components. The speech signal sampled with a higher sampling rate than it is suitable for narrowband coding algorithm is compressed in order to decrease the amount of samples. Next, the time-compressed representation of a signal is encoded using a narrowband...

Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit

Publikacja

- Diagnostic Pathology - Rok 2012

Methods developed for real-time time scale modification (TSM) of speech signal are presented. They are based onthe non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of theproposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearingimpaired children and elderly listeners. It was shown that for the speech with average rate equal to or...

Pełny tekst do pobrania w portalu

High quality speech codec employing sines+noise+transients model

Publikacja

- Archives of Acoustics - Rok 2006

A method of high quality wideband speech signal representation employing sines+transients+noise model is presented. The need for a wideband speech coding approach as well as various methods for analysis and synthesis of sines, residual and transient states of speech signal is discussed. The perceptual criterion is applied in the proposed approach during encoding of sines amplitudes in order to reduce bandwidth requirements and...

Pełny tekst do pobrania w portalu

Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning

Publikacja

A. Czyżewski

- Journal of the Acoustical Society of America - Rok 2023

Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Pełny tekst do pobrania w portalu

Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition

Publikacja

- Rok 2016

The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...

Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions

Publikacja

- SENSORS - Rok 2021

The paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...

Pełny tekst do pobrania w portalu

Examining Influence of Distance to Microphone on Accuracy of Speech Recognition

Publikacja

- Rok 2015

The problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal...

Pełny tekst do pobrania w serwisie zewnętrznym

Comparison of various speech time-scale modificartion methods

Publikacja

- Archives of Acoustics - Rok 2011

The objective of this work is to investigate the influence of the different time-scale modification (TSM) methods on the quality of the speech stretched up using the designed non-uniform real-time speech time-scale modification algorithm (NU-RTSM). The algorithm provides a combination of the typical TSM algorithm with the vowels, consonants, stutter, transients and silence detectors. Based on the information about the content and...

Improved method for real-time speech stretching

Publikacja

- Rok 2012

n algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...

Pełny tekst do pobrania w serwisie zewnętrznym

A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times

Publikacja

- SENSORS - Rok 2022

Objective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...

Pełny tekst do pobrania w portalu

A Method of Real-Time Non-uniform Speech Stretching

Publikacja

- Rok 2012

Developed method of real-time non-uniform speech stretching is presented.The proposed solution is based on the well-known SOLA algorithm(Synchronous Overlap and Add). Non-uniform time-scale modification isachieved by the adjustment of time scaling factor values in accordance with thesignal content. Dependently on the speech unit (vowels/consonants), instantaneousrate of speech (ROS), and speech signal presence, values of the scalingfactor...

Pełny tekst do pobrania w serwisie zewnętrznym

Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement

Publikacja

G. Korvel
K. Kąkol
O. Kurasova
B. Kostek

- IEEE Access - Rok 2020

The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech...

Pełny tekst do pobrania w portalu

Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine

Publikacja

P. Falkowski-Gilski
G. Debita

- Archives of Acoustics - Rok 2023

In order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability...

Pełny tekst do pobrania w portalu

Computer-assisted pronunciation training—Speech synthesis is almost all you need

Publikacja

D. Korzekwa
J. Lorenzo-trueba
T. Drugman
B. Kostek

- SPEECH COMMUNICATION - Rok 2022

The research community has long studied computer-assisted pronunciation training (CAPT) methods in non-native speech. Researchers focused on studying various model architectures, such as Bayesian networks and deep learning methods, as well as on the analysis of different representations of the speech signal. Despite significant progress in recent years, existing CAPT methods are not able to detect pronunciation errors with high...

Pełny tekst do pobrania w portalu

Improving the quality of speech in the conditions of noise and interference

Publikacja

B. Kostek
K. Kąkol

- Journal of the Acoustical Society of America - Rok 2018

The aim of the work is to present a method of intelligent modification of the speech signal with speech features expressed in noise, based on the Lombard effect. The recordings utilized sets of words and sentences as well as disturbing signals, i.e., pink noise and the so-called babble speech. Noise signal, calibrated to various levels at the speaker's ears, was played over two loudspeakers located 2 m away from the speaker. In...

Pełny tekst do pobrania w serwisie zewnętrznym

Variable Ratio Sample Rate Conversion Based on Fractional Delay Filter

Publikacja

M. Blok
P. Drózda

- Archives of Acoustics - Rok 2014

In this paper a sample rate conversion algorithm which allows for continuously changing resampling ratio has been presented. The proposed implementation is based on a variable fractional delay filter which is implemented by means of a Farrow structure. Coefficients of this structure are computed on the basis of fractional delay filters which are designed using the offset window method. The proposed approach allows us to freely...

Pełny tekst do pobrania w portalu

Transient detection for speech coding applications

Publikacja

- International Journal of Computer Science and Network Security - Rok 2006

Signal quality in speech codecs may be improved by selecting transients from speech signal and encoding them using a suitable method. This paper presents an algorithm for transient detection in speech signal. This algorithm operates in several frequency bands. Transient detection functions are calculated from energy measured in short frames of the signal. The final selection of transient frames is based on results of detection...

Pełny tekst do pobrania w serwisie zewnętrznym

Applying the Lombard Effect to Speech-in-Noise Communication

Publikacja

G. Korvel
K. Kąkol
P. Treigys
B. Kostek

- Electronics - Rok 2023

This study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...

Pełny tekst do pobrania w portalu

Constructing a Dataset of Speech Recordingswith Lombard Effect

Publikacja

D. Weber
S. Zaporowski
D. Korzekwa

- Rok 2020

Thepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...

Improving Objective Speech Quality Indicators in Noise Conditions

Publikacja

K. Kąkol
G. Korvel
B. Kostek

- Rok 2020

This work aims at modifying speech signal samples and test them with objective speech quality indicators after mixing the original signals with noise or with an interfering signal. Modifications that are applied to the signal are related to the Lombard speech characteristics, i.e., pitch shifting, utterance duration changes, vocal tract scaling, manipulation of formants. A set of words and sentences in Polish, recorded in silence,...

Pełny tekst do pobrania w serwisie zewnętrznym

EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY

Publikacja

- Rok 2014

The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...

EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY

Publikacja

- Rok 2014

The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...

Methodology and technology for the polymodal allophonic speech transcription

Publikacja

- Journal of the Acoustical Society of America - Rok 2016

A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Pełny tekst do pobrania w serwisie zewnętrznym

Methodology and technology for the polymodal allophonic speech transcription

Publikacja

- Journal of the Acoustical Society of America - Rok 2016

A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: rate of speech estimation