Filtry
wszystkich: 1398
wybranych: 1026
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: RECONSTRUCTION OF SPEECH SIGNALS
-
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
PublikacjaA method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...
-
Computer vision techniques applied for reconstruction of seafloor 3D images from side scan and synthetic aperture sonars data
PublikacjaThe Side Scan Sonar and Synthetic Aperture Sonar are well known echo signal processing technologies that produce 2D images of the seafloor. Both systems combines a number of acoustic pings to form a high resolution image of seafloor. It was shown in numerous papers that 2D images acquired by such systems can be transformed into 3D models of seafloor surface by algorithmic approach using intensity information, contained in a grayscaled...
-
Method for Clustering of Brain Activity Data Derived from EEG Signals
PublikacjaA method for assessing separability of EEG signals associated with three classes of brain activity is proposed. The EEG signals are acquired from 23 subjects, gathered from a headset consisting of 14 electrodes. Data are processed by applying Discrete Wavelet Transform (DWT) for the signal analysis and an autoencoder neural network for the brain activity separation. Processing involves 74 wavelets from 3 DWT families: Coiflets,...
-
Chirp-rate estimation of FM signals in the time-frequency domain
PublikacjaNovel dynamic representations of a complex signal in the time-frequency domain including: a channelized instantaneous complex frequency (CICF), a complex local group delay (CLGD) and a channelized instantaneous chirp-rate (CICR) are introduced. The proposed approach is based on the use of the gradient of the short-time Fourier transform complex phase. An interpretation of the newly-introduced distributions especially of the CICR...
-
Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model
PublikacjaThe paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated
-
Suppression of distortions in signals received from Doppler sensor for vehicle speed measurement
PublikacjaDoppler sensors are commonly used for movement detection and speed measurement. However, electromagnetic interference and imperfections in sensor construction result in degradation of the signal to noise ratio. As a result, detection of signals reflected from moving objects becomes problematic. The paper proposes an algorithm for reduction of distortions and noise in the signal received from a simple, dual-channel type of a Doppler...
-
Noise profiling for speech enhancement employing machine learning models
PublikacjaThis paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...
-
New perspective on the in vivo use of cold stress dynamic thermography in integumental reconstruction with the use of skin-muscle flaps
PublikacjaAmong the problems encountered by plastic surgeons is the reconstruction of defects following tumors. One of the reconstructive options is TRAM flap. Despite that anatomy is well-explored, marginal flap necrosis may develop. To minimize complications imaging examinations was designed to determine the degree of flap perfusion. One of them is the thermographic examination.
-
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
PublikacjaIn this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...
-
Diagnosing wind turbine condition employing a neural network to the analysis of vibroacoustic signals
PublikacjaIt is important from the economic point of view to detect damage early in the wind turbines before failures occur. For this purpose, a monitoring device was built that analyzes both acoustic signals acquired from the built-in non-contact acoustic intensity probe, as well as from the accelerometers, mounted on the internal devices in the nacelle. The signals collected in this way are used for long-term training of the autoencoder...
-
Improving the Accuracy of Automatic Reconstruction of 3D Complex Buildings Models from Airborne Lidar Point Clouds
PublikacjaDue to high requirements of variety of 3D spatial data applications with respect to data amount and quality, automatized, effcient and reliable data acquisition and preprocessing methods are needed. The use of photogrammetry techniques—as well as the light detection and ranging (LiDAR) automatic scanners—are among attractive solutions. However, measurement data are in the form of unorganized point clouds, usually requiring transformation...
-
Comparison of Methods for Real and Imaginary Motion Classification from EEG Signals
PublikacjaA method for feature extraction and results of classification of EEG signals obtained from performed and imagined motion are presented. A set of 615 features was obtained to serve for the recognition of type and laterality of motion using 8 different classifications approaches. A comparison of achieved classifiers accuracy is presented in the paper, and then conclusions and discussion are provided. Among applied algorithms the...
-
Intelligent processing of stuttered speech.
PublikacjaW artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się sztucznych sieci neuronowych i zbiorów przybliżonych.
-
Simulation of incremental encoder signals
PublikacjaPrzedstawiono generator sygnału impulsowego do symulacji sygnału z przetwornika obrotowo-impulsowego w stanach przejściowych. Omówiono algorytmy wyznaczenia przedziałów międzyimpulsowych dla trzech rodzajów zmian prędkości obrotowej: liniowej, wykładniczej oraz sinusoidalnej. Przeanalizowano błędy kwantowania wynikające z cyfrowej realizacji generatora.
-
Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine
PublikacjaIn order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability...
-
Signals features extraction in radioisotope liquid-gas flow measurements using wavelet analysis
PublikacjaKnowledge of the structure of a flow is significant for the proper conduct of a number of industrial processes. In this case, a description of a two-phase flow regimes is possible by use of the time-series analysis in time, frequency and state-space domain. In this article the Discrete Wavelet Transform (DWT) is applied for analysis of signals obtained for water-air flow using gamma ray absorption. The presented method was illustrated...
-
Analysis of the Ways to Identify Rail Running Surface Defects by Means of Vibration Signals
PublikacjaTh e article discusses a preliminary concept of a method enabling the identifi cation of chosen rail running surface defects, such as squats, spalling, and running surface defects, by analysing the parameters of vibration signals. It features a description of the methodology of the conducted tests, the scope thereof, and the selection of the measurement points with specifi c defect types. Th e article covers selected results of...
-
Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances
PublikacjaArchive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...
-
Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition
Publikacja -
An EIT reconstruction algorithm based on noisy data.
PublikacjaPraca przedstawia algorytm rekonstrukcji oparty o zmodyfikowany algorytm Gaussa - Newtona. Algorytm uwzględnia istnienie elektrod pomiarowych w tomografii elektroimpedancyjnej. Elektrody charakteryzują się rozmiarem i impedancją. Dodatkowo algorytm zakłada istnienie szumu w sygnale mierzonym. Zostało pokazane, że dobór optymalnego wzorca pobudzenia znacząco poprawia odporność algorytmu rekonstrukcyjnego na szum w danych. Dwie...
-
Reconstruction of input signal of sensor with frequency output
Publikacja -
Time reconstruction and performance of the CMS electromagnetic calorimeter
Publikacja -
Detection and Direction-of-Arrival Estimation of Weak Spread Spectrum Signals Received with Antenna Array
PublikacjaThis paper presents a method for the joint detection and direction of arrival (DOA) estimation of low probability of detection (LPD) signals. The proposed approach is based on using the antenna array to receive spread-spectrum signals hidden below the noise floor. Array processing exploits the spatial correlation between phase-delayed copies of the signal and allows us to evaluate the parameter used to make the decision about the...
-
New First - Path Detector for LTE Positioning Reference Signals
PublikacjaIn today's world, where positioning applications reached a huge popularity and became virtually ubiquitous, there is a strong need for determining a device location as accurately as possible. A particularly important role in positioning play cellular networks, such as Long Term Evolution (LTE). In the LTE Observed Time Difference of Arrival (OTDOA) positioning method, precision of device location estimation depends on accuracy...
-
Propagation of initially sawtooth periodic and impulsive signals in a quasi-isentropic magnetic gas
PublikacjaThe characteristics of propagation of sawtooth periodic and impulsive signals at a transducer are analytically studied in this work. A plasma under consideration is motionless and uniform at equilibrium, and its perturbations are described by a system of ideal magnetohydrodynamic equations. Some generic heating/cooling function, which in turn depends on equilibrium thermodynamic parameters, may destroy adiabaticity of a flow and...
-
Direct modulation for conventional matrix converters using analytical signals and barycentric coordinates
PublikacjaThis paper proposes the generalized direct modulation for Conventional Matrix Converters (CMC) using the concept of analytical signals and barycentric coordinates. The paper proposes a novel approach to the Pulse Width Modulation (PWM) duty cycle computing, which allows faster prototyping of direct control algorithms. The explanation of the new idea using analytical considerations demonstrating the principles of direct voltage...
-
Methods for quality improvement of multibeam and LiDAR point cloud data in the context of 3D surface reconstruction
PublikacjaPoint cloud dataset is the transitional data model used in several marine and land remote-sensing applications. During further steps of processing, the transformation of point cloud spatial data to more complex models containing higher order geometric structures like edges and facets may be possible, if an appropriate quality level of input data is provided. Point cloud datasets usually contain a considerable amount of undesirable...
-
Investigations of the Methods of Time Delay Measurement of Stochastic Signals Using Cross-correlation with the Hilbert Transform
PublikacjaThe article presents the results of simulation studies of four methods of estimating time delay for random signals using cross-correlation with the Hilbert Transform. Selected models of mutually delayed stochastic signals were used in the simulations, corresponding to the signals obtained from scintillation detectors in radioisotope measurements of liquid-gas two-phase flow. Standard deviations of the values of the individual functions...
-
Analysis of Vibration and Acoustic Signals for Noncontact Measurement of Engine Rotation Speed
PublikacjaThe non-contact measurement of engine speed can be realized by analyzing engine vibration frequency. However, the vibration signal is distorted by harmonics and noise in the measurement. This paper presents a novel method for the measurement of engine rotation speed by using the cross-correlation of vibration and acoustic signals. This method can enhance the same frequency components in engine vibration and acoustic signal. After...
-
Global Optimization for Recovery of Clipped Signals Corrupted With Poisson-Gaussian Noise
PublikacjaWe study a variational formulation for reconstructing nonlinearly distorted signals corrupted with a Poisson-Gaussian noise. In this situation, the data fidelity term consists of a sum of a weighted least squares term and a logarithmic one. Both of them are precomposed by a nonlinearity, modelling a clipping effect, which is assumed to be rational. A regularization term, being a piecewise rational approximation of the ℓ0 function...
-
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
PublikacjaIn this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...
-
Rough Set-Based Classification of EEG Signals Related to Real and Imagery Motion
PublikacjaA rough set-based approach to classification of EEG signals registered while subjects were performing real and imagery motions is presented in the paper. The appropriate subset of EEG channels is selected, the recordings are segmented, and features are extracted, based on time-frequency decomposition of the signal. Rough set classifier is trained in several scenarios, comparing accuracy of classification for real and imagery motion....
-
An automatic system for identification of random telegraph signal (RTS) noise in noise signals
PublikacjaIn the paper the automatic and universal system for identification of Random Telegraph Signal (RTS) noise as a non-Gaussian component of the inherent noise signal of semiconductor devices is presented. The system for data acquisition and processing is described. Histograms of the instantaneous values of the noise signals are calculated as the basis for analysis of the noise signal to determine the number of local maxima of histograms...
-
Comparison of Classification Methods for EEG Signals of Real and Imaginary Motion
PublikacjaThe classification of EEG signals provides an important element of brain-computer interface (BCI) applications, underlying an efficient interaction between a human and a computer application. The BCI applications can be especially useful for people with disabilities. Numerous experiments aim at recognition of motion intent of left or right hand being useful for locked-in-state or paralyzed subjects in controlling computer applications....
-
Real and imaginary motion classification based on rough set analysis of EEG signals for multimedia applications
PublikacjaRough set-based approach to the classification of EEG signals of real and imaginary motion is presented. The pre-processing and signal parametrization procedures are described, the rough set theory is briefly introduced, and several classification scenarios and parameters selection methods are proposed. Classification results are provided and discussed with their potential utilization for multimedia applications controlled by the...
-
Localization of impulsive disturbances in audio signals using template matching
PublikacjaIn this paper, a new solution to the problem of elimination of impulsive disturbances from audio signals, based on the matched filtering technique, is proposed. The new approach stems from the observation that a large proportion of noise pulses corrupting audio recordings have highly repetitive shapes that match several typical “patterns”. In many cases a representative set of exemplary pulse waveforms can be extracted from the...
-
Systematic Literature Review for Emotion Recognition from EEG Signals
PublikacjaResearchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...
-
Systematic Literature Review for Emotion Recognition from EEG Signals
PublikacjaResearchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...
-
Stress Detection of Children with Autism using Physiological Signals in Kaspar Robot-Based Intervention Studies
PublikacjaThis study aims to develop a stress detection system using the blood volume pulse (BVP) signals of children with Autism Spectrum Disorder (ASD) during robot-based interven- tion. This study presents the heart rate variability (HRV) analysis method to detect the stress, where HRV features are extracted from raw BVP signals recorded from an E4 wristband during interaction studies with the social robot Kaspar. Low frequency power...
-
Broadband interference in speech reinforcement systems
PublikacjaArtykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...
-
Integration of speech enhancement and coding techniques
Publikacja -
Novel approaches to wideband speech coding
PublikacjaDwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...
-
A system for multitask noisy speech enhancement.
PublikacjaW artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...
-
Multitask Noisy Speech Enhancement System
PublikacjaW referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...
-
Physics augmented classification of fNIRS signals
PublikacjaBackground. Predictive classification favours performance over semantics. In traditional predictive classification pipelines, feature engineering is often oblivious to the underlying phenomena. Hypothesis. In applied domains such as functional Near Infrared Spectroscopy (fNIRS), the exploitation of physical knowledge may improve the discriminative quality of our observation set. Aims. Give exemplary evidence that intervening the...
-
Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students
PublikacjaThe user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublikacjaThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
Measuring quantum entanglement without prior state reconstruction.
PublikacjaWykazano, że w przypadku dwóch kubitów splątanie formowania może być mierzone bez rekonstrukcji stanu kwantowego.
-
Earthworks calculations due to reconstruction of railway geometrical layout
PublikacjaThe paper characterizes continuation of ongoing work of computer program MUGO which is connected with earthworks. The program is related to the modernization of the railway track layout. The methodology for the calculating the size of earthworks in the areas of embankments on the two way railway line is discussed in detail.
-
3D reconstruction seafloor from side-scan records
PublikacjaArtykuł przedstawia sposób wykorzystania języka opisu wirtualnej rzeczywistości (VRML) do trójwymiarowej wizualizacji dna morskiego. W szczególności zaprezentowano techniki rekonstrukcji trójwymiarowego obrazu z danych pochodzących z sonaru bocznego.