displaying 1000 best results Help
Search results for: RECONSTRUCTION OF SPEECH SIGNALS
-
Investigations of the Methods of Time Delay Measurement of Stochastic Signals Using Cross-correlation with the Hilbert Transform
PublicationThe article presents the results of simulation studies of four methods of estimating time delay for random signals using cross-correlation with the Hilbert Transform. Selected models of mutually delayed stochastic signals were used in the simulations, corresponding to the signals obtained from scintillation detectors in radioisotope measurements of liquid-gas two-phase flow. Standard deviations of the values of the individual functions...
-
Analysis of Vibration and Acoustic Signals for Noncontact Measurement of Engine Rotation Speed
PublicationThe non-contact measurement of engine speed can be realized by analyzing engine vibration frequency. However, the vibration signal is distorted by harmonics and noise in the measurement. This paper presents a novel method for the measurement of engine rotation speed by using the cross-correlation of vibration and acoustic signals. This method can enhance the same frequency components in engine vibration and acoustic signal. After...
-
Global Optimization for Recovery of Clipped Signals Corrupted With Poisson-Gaussian Noise
PublicationWe study a variational formulation for reconstructing nonlinearly distorted signals corrupted with a Poisson-Gaussian noise. In this situation, the data fidelity term consists of a sum of a weighted least squares term and a logarithmic one. Both of them are precomposed by a nonlinearity, modelling a clipping effect, which is assumed to be rational. A regularization term, being a piecewise rational approximation of the ℓ0 function...
-
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
PublicationIn this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...
-
Rough Set-Based Classification of EEG Signals Related to Real and Imagery Motion
PublicationA rough set-based approach to classification of EEG signals registered while subjects were performing real and imagery motions is presented in the paper. The appropriate subset of EEG channels is selected, the recordings are segmented, and features are extracted, based on time-frequency decomposition of the signal. Rough set classifier is trained in several scenarios, comparing accuracy of classification for real and imagery motion....
-
Comparison of Classification Methods for EEG Signals of Real and Imaginary Motion
PublicationThe classification of EEG signals provides an important element of brain-computer interface (BCI) applications, underlying an efficient interaction between a human and a computer application. The BCI applications can be especially useful for people with disabilities. Numerous experiments aim at recognition of motion intent of left or right hand being useful for locked-in-state or paralyzed subjects in controlling computer applications....
-
An automatic system for identification of random telegraph signal (RTS) noise in noise signals
PublicationIn the paper the automatic and universal system for identification of Random Telegraph Signal (RTS) noise as a non-Gaussian component of the inherent noise signal of semiconductor devices is presented. The system for data acquisition and processing is described. Histograms of the instantaneous values of the noise signals are calculated as the basis for analysis of the noise signal to determine the number of local maxima of histograms...
-
Real and imaginary motion classification based on rough set analysis of EEG signals for multimedia applications
PublicationRough set-based approach to the classification of EEG signals of real and imaginary motion is presented. The pre-processing and signal parametrization procedures are described, the rough set theory is briefly introduced, and several classification scenarios and parameters selection methods are proposed. Classification results are provided and discussed with their potential utilization for multimedia applications controlled by the...
-
Localization of impulsive disturbances in audio signals using template matching
PublicationIn this paper, a new solution to the problem of elimination of impulsive disturbances from audio signals, based on the matched filtering technique, is proposed. The new approach stems from the observation that a large proportion of noise pulses corrupting audio recordings have highly repetitive shapes that match several typical “patterns”. In many cases a representative set of exemplary pulse waveforms can be extracted from the...
-
Systematic Literature Review for Emotion Recognition from EEG Signals
PublicationResearchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...
-
Systematic Literature Review for Emotion Recognition from EEG Signals
PublicationResearchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...
-
Stress Detection of Children with Autism using Physiological Signals in Kaspar Robot-Based Intervention Studies
PublicationThis study aims to develop a stress detection system using the blood volume pulse (BVP) signals of children with Autism Spectrum Disorder (ASD) during robot-based interven- tion. This study presents the heart rate variability (HRV) analysis method to detect the stress, where HRV features are extracted from raw BVP signals recorded from an E4 wristband during interaction studies with the social robot Kaspar. Low frequency power...
-
Physics augmented classification of fNIRS signals
PublicationBackground. Predictive classification favours performance over semantics. In traditional predictive classification pipelines, feature engineering is often oblivious to the underlying phenomena. Hypothesis. In applied domains such as functional Near Infrared Spectroscopy (fNIRS), the exploitation of physical knowledge may improve the discriminative quality of our observation set. Aims. Give exemplary evidence that intervening the...
-
Novel approaches to wideband speech coding
PublicationDwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...
-
Broadband interference in speech reinforcement systems
PublicationArtykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...
-
Integration of speech enhancement and coding techniques
Publication -
A system for multitask noisy speech enhancement.
PublicationW artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...
-
Multitask Noisy Speech Enhancement System
PublicationW referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...
-
Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students
PublicationThe user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublicationThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
Plasma models, contribution matrix for detector setup and generated projections for plasma emissivity reconstruction in fusion devices
Open Research DataThe original plasma models for fusion devices, together with the complementary detector setup in the form of a contribution matrix and generated projections. Samples are packed inside a Plasma Tomography Format (PTF) files which is a part of the Plasma Tomography in Fusion Devices Python package, and inside the general JSON format. The constructed dataset...
-
Automatic system for audio-video material reconstruction and archiving
PublicationReferat przedstawia propozycję modelu systemu automatycznej archiwizacji i rekonstrukcji nagrań audio-wideo. Założeniem tego rozwiązania jest uczynienie procesu rekonstrukcji nagrań bardziej niezależnym od człowieka. Ma to na celu redukcję kosztów rekonstrukcji przetwarzanych nagrań. Z powodu dużej liczby archiwalnych nagrań audio-wideo istnieje potrzeba stworzenia systemu który umożliwi automatyczną indeksację ich treści. Pomoże...
-
Earthworks calculations due to reconstruction of railway geometrical layout
PublicationThe paper characterizes continuation of ongoing work of computer program MUGO which is connected with earthworks. The program is related to the modernization of the railway track layout. The methodology for the calculating the size of earthworks in the areas of embankments on the two way railway line is discussed in detail.
-
Performance of CMS muon reconstruction in cosmic-ray events
Publication -
Measuring quantum entanglement without prior state reconstruction.
PublicationWykazano, że w przypadku dwóch kubitów splątanie formowania może być mierzone bez rekonstrukcji stanu kwantowego.
-
3D reconstruction seafloor from side-scan records
PublicationArtykuł przedstawia sposób wykorzystania języka opisu wirtualnej rzeczywistości (VRML) do trójwymiarowej wizualizacji dna morskiego. W szczególności zaprezentowano techniki rekonstrukcji trójwymiarowego obrazu z danych pochodzących z sonaru bocznego.
-
Localization of impulsive disturbances in archive audio signals using predictive matched filtering
PublicationThe problem of elimination of impulsive disturbances from archive audio signals is considered and its new solution, called predictive matched filtering, is proposed. The new approach is based on the observation that a large percentage of noise pulses corrupting archive audio recordings have highly repetitive shapes that match several typical “patterns”, called click templates. To localize noise pulses, click templates can be correlated...
-
COMPUTER SPEECH AND LANGUAGE
Journals -
SEMINARS IN SPEECH AND LANGUAGE
Journals -
Speech and Language Technology
Journals -
Speech Language and Hearing
Journals -
Quarterly Journal of Speech
Journals -
SpringerBriefs in Speech Technology
Journals -
Audiology and Speech Research
Journals -
Voice and Speech Review
Journals -
An Analysis of the Relationship between the Architecture and the Structure of a Vessel on the Example of the Reconstruction Design Process of the Historical Sailing Yacht "General Zaruski" Carried Out between 2009 and 2012
PublicationThe article analyzes the relation between architecture and structure of a vessel on the example of the reconstruction design of a wooden sailing yacht "General Zaruski" built in Ekanӓs, Sweden in 1939. Based on the documentation of "Kaparen" (sister yacht), "Mloda Gwardia" (ex "General Zaruski") and the reconstruction classification project made by the authors, the impact of functional, spatial and aesthetic design objectives (e.g....
-
A pilot study to assess manufacturing processes using selected point measures of vibroacoustic signals generated on a multitasking machine
PublicationThe article presents the method for the evaluation of selected manufacturing processes using the analysis of vibration and sound signals. This method is based on the use of sensors installed outside the machining zone, allowing to be used quickly and reliably in real production conditions. The article contains a developed measurement methodology based on the specific location of microphones and vibration transducers mounted on...
-
Application of Complementary Signals in Built-In Self Testers for Mixed-Signal Embedded Electronic Systems
PublicationThis paper concerns the implementation of shape-designed complementary signals (CSs), matched to the frequency characteristic of the circuit under test, in built-in self testers (BISTs), dedicated to mixed-signal embedded electronic systems for testing their analog sections. The essence of the proposed method and solution of CS BIST is low-cost realization on the base of hardware and software resources of microcontrollers used...
-
Signals of the 5G Standalone Radio Interface
Open Research DataThe research work conducted within the scope of NATO-STO (North Atlantic Treaty Organization – Science and Technology Organization) IST-187 group assumed investigation of the 5G gNodeB performance. The downlink (DL) signals of the FDD (Frequency Division Duplex) 5G-Standalone station were registered in isolated and controlled laboratory conditions....
-
Estimation of time-frequency complex phase-based speech attributes using narrow band filter banks
PublicationIn this paper, we present nonlinear estimators of nonstationary and multicomponent signal attributes (parameters, properties) which are instantaneous frequency, spectral (or group) delay, and chirp-rate (also known as instantaneous frequency slope). We estimate all of these distributions in the time-frequency domain using both finite and infinite impulse response (FIR and IIR) narrow band filers for speech analysis. Then, we present...
-
Position Estimation in Mixed Indoor-Outdoor Environment Using Signals of Opportunity and Deep Learning Approach
PublicationTo improve the user's localization estimation in indoor and outdoor environment a novel radiolocalization system using deep learning dedicated to work both in indoor and outdoor environment is proposed. It is based on the radio signatures using radio signals of opportunity from LTE an WiFi networks. The measurements of channel state estimators from LTE network and from WiFi network are taken by using the developed application....
-
Signals Features Extraction in Radioisotope Liquid-Gas Flow Measurements using Autocorrelation Function
PublicationKnowledge of the two-phase flow structure is essential for the proper conduct of industrial processes. Description of liquid-gas flow regimes is possible by using of data analysis in time, frequency, or state-space domain. In this research studies, the autocorrelation function is applied for analysis of signals obtained for liquid-gas flow by use gamma-ray absorption. The experiments were carried out on the laboratory hydraulic...
-
A novel method of local chirp-rate estimation of LFM chirp signals in the time-frequency domain
PublicationIn the paper, novel dynamic representations of a complex signal in the time-frequency domain are introduced. The proposed approach is based on using the gradient of the short-time Fourier transform complex phase. A channelized instantaneous complex frequency (CICF) and a complex local group delay (CLGD) are included in the presented signal representations. An application of the newly-introduced distributions is demonstrated by...
-
Ontological Modeling for Contextual Data Describing Signals Obtained from Electrodermal Activity for Emotion Recognition and Analysis
PublicationMost of the research in the field of emotion recognition is based on datasets that contain data obtained during affective computing experiments. However, each dataset is described by different metadata, stored in various structures and formats. This research can be counted among those whose aim is to provide a structural and semantic pattern for affective computing datasets, which is an important step to solve the problem of data...
-
A method of identification of RTS components in noise signals
PublicationW artykule przedstawiono oryginalną metodę wydzielania szumu RTS (Random Telegraph Signal) - dwupoziomowego lub wielopoziomowego - z sygnału szumowego. Podstawą oceny jakości metody jest założenie, że wartości chwilowe szumu RTS mają rozkład niegaussowski natomiast pozostała część sygnału ma rozkład gaussowski.Algorytm identyfikacji wielopoziomowych szumów RTS w sygnałach szumowych małej częstotliwości oparty został na aproksymacji...
-
Application of nonlinearity measures to chemical sensor signals
PublicationSzumy rezystancji sensorów gazu zawierają istotną informację, która może być przedstawiona nie tylko przez ich gęstość widmową mocy. Analiza tych szumów za pomocą różnych miar nieliniowości może prowadzić do znacznego wzrostu selektywności i czułości czujników gazu. Stwierdzono, że dla dostępnych na rynku czujników gazu zastosowanie funkcji bispektrum dostarcza dodatkowej informacji, potrzebnej do detekcji różnych gazów. Analizując...
-
Transmission of digital signals in a nonstationary hydroacoustic channel.
PublicationZ telekomunikacyjnego punktu widzenia właściwości transmisyjne kanału hydroakustyczny są ograniczone przez występowanie wielokrotnych odbić fali dźwiękowej od dna i powierzchni wody oraz niestacjonarność wprowadzaną głównie przez ruch powierzchni wody. Artykuł przedstawia model własności transmisyjnych kanału. Wprowadzono niestacjonarność do odpowiedzi impulsowych kanału przez założenie przypadkowej zmienności czasów przyjścia...
-
Influence of modulation detection threshold on speech intelligibility
Publication -
New generation speech aid for stuttering people
PublicationWspółczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...
-
New generation speech aid for stuttering people
PublicationWspółczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...