Search results for: SPEECH SIGNAL PARAMETERIZATION - Bridge of Knowledge

Search

Search results for: SPEECH SIGNAL PARAMETERIZATION

Filters

total: 1879
filtered: 1253

clear all filters


Chosen catalog filters

  • Category

  • Year

  • Options

clear Chosen catalog filters disabled

Search results for: SPEECH SIGNAL PARAMETERIZATION

  • Active Control of Highly Autocorrelated Machinery Noise in Multivariate Nonminimum Phase Systems

    In this paper, a novel multivariate active noise control scheme, designed to attenuate disturbances with high autocorrelation characteristics and preserve background signals, is proposed. The algorithm belongs to the class of feedback controllers and, unlike the popular feedforward FX-LMS approach, does not require availability of a reference signal. The proposed approach draws its inspiration from the iterative learning control...

    Full text available to download

  • Creating new voices using normalizing flows

    Publication
    • P. Biliński
    • T. Merritt
    • A. Ezzerg
    • K. Pokora
    • S. Cygert
    • K. Yanagisawa
    • R. Barra-Chicote
    • D. Korzekwa

    - Year 2022

    Creating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...

    Full text available to download

  • PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS

    Publication

    - Year 2015

    The quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...

  • Towards classification of patients based on surface EMG data of temporomandibular joint muscles using self-organising maps

    Publication

    - Biomedical Signal Processing and Control - Year 2022

    The study considers the need for an effective method of classification of patients with a temporomandibular joint disorder (TMD). The self-organising map method (SOM) was applied to group patients and used together with the cross-correlation approach to interpret the processed (rectified and smoothed by using root mean square (RMS) algorithm) surface electromyography signal (sEMG) obtained from testing the muscles (two temporal...

    Full text available to download

  • Audio Feature Analysis for Precise Vocalic Segments Classification in English

    Publication

    An approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal...

    Full text to download in external service

  • Application of the neural networks for developing new parametrization of the Tersoff potential for carbon

    Publication

    - TASK Quarterly - Year 2020

    Penta-graphene (PG) is a 2D carbon allotrope composed of a layer of pentagons having sp2- and sp3-bonded carbon atoms. A study carried out in 2018 has shown that the parameterization of the Tersoff potential proposed in 2005 by Ehrhart and Able (T05 potential) performs better than other potentials available for carbon, being able to reproduce structural and mechanical properties of the PG. In this work, we tried to improve the...

    Full text available to download

  • Physics-Based Coarse-Grained Modeling in Bio- and Nanochemistry

    Publication
    • A. Liwo
    • A. K. Sieradzan
    • A. S. Karczyńska
    • E. Lubecka
    • S. A. Samsonov
    • C. Czaplewski
    • P. Krupa
    • M. Mozolewska

    - Year 2021

    Coarse-grained approaches, in which groups of atoms are represented by single interaction sites, are very important in biological and materials sciences because they enable us to cover the size- and time-scales by several orders of magnitude larger than those available all-atom simulations, while largely keeping the details of the systems studied. The coarse-grained approaches differ by the scheme of reduction and by the origin...

    Full text to download in external service

  • Auditory-visual attention stimulator

    New approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...

    Full text to download in external service

  • INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH

    Publication

    The Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...

    Full text available to download

  • Impact of maintenance of floodplains of the Vistula River on high water levels on the section from Włocławek to Toruń

    Publication

    This article describes the methodology of hydraulic calculations to estimate the water levels in open channels for steady gradually varied flow. The presented method has been used to analyse the water level on the Vistula River from Włocławek cross-section to Toruń cross-section. The HEC-RAS modelling system has been used for parameterization of the river channel and floodplains, as well as for flow simulation. The results obtained...

    Full text available to download

  • Theoretical calculation of the physico-chemical properties of 1-butyl-4-methylpyridinium based ionic liquids

    Publication

    ACCEPTED MAIonic liquids (ILs) have attracted much attention for their unique physicochemical properties, which can be designed as needed by altering the ion combinations. Besides experimental work, numerous computational studies have been concerned with prediction of physical properties of ILs. The results of molecular dynamics simulations of ILs depend strongly on the proper force field parameterization. Classical force fields...

    Full text available to download

  • Study Analysis of Transmission Efficiency in DAB+ Broadcasting System

    Publication

    - Year 2018

    DAB+ is a very innovative and universal multimedia broadcasting system. Thanks to its updated multimedia technologies and metadata options, digital radio keeps pace with changing consumer expectations and the impact of media convergence. Broadcasting analog and digital radio services does vary, concerning devices on both transmitting and receiving side, as well as content processing mechanisms. However, the biggest difference is...

    Full text available to download

  • Camera angle invariant shape recognition in surveillance systems

    Publication

    A method for human action recognition in surveillance systems is described. Problems within this task are discussed and a solution based on 3D object models is proposed. The idea is shown and some of its limitations are talked over. Shape description methods are introduced along with their main features. Utilized parameterization algorithm is presented. Classification problem, restricted to bi-nary cases is discussed. Support vector...

  • Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

    Publication

    A voiceless stop consonant phoneme modelling and synthesis framework based on a phoneme modelling in low-frequency range and high-frequency range separately is proposed. The phoneme signal is decomposed into the sums of simpler basic components and described as the output of a linear multiple-input and single-output (MISO) system. The impulse response of each channel is a third order quasi-polynomial. Using this framework, the...

    Full text available to download

  • Prof. Haitham Abu-Rub - A Visit to Poland's Gdansk University of Technology

    Report on visit of Prof. Haitham Abu-Rub in Gdansk University of Technology. Speech on the Smart Grid Centre. Visit in the new smart grid laboratory of the GUT, the Laboratory for Innovative Power Technologies and Integration of Renewable Energy Sources (LINTE^2).

    Full text available to download

  • Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling

    Publication

    - Year 2021

    A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

    Full text to download in external service

  • Modeling and Designing Acoustical Conditions of the Interior – Case Study

    The primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...

    Full text available to download

  • A comparative study of English viseme recognition methods and algorithm

    An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...

    Full text available to download

  • A comparative study of English viseme recognition methods and algorithms

    An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

    Full text available to download

  • Continuous wave sonar with hyperbolic frequency modulation keyed by pseudo-random sequence

    Publication

    A CW FM type sounding signal is used in the classical solution of silent sonar. While the signal provides a relatively simple implementation of digital signal processing, and ensures good detection conditions, unfortunately, in the presence of the Doppler effect, distance measurement results tend to be wrong. This is due to the fact that the received signal’s instantaneous frequency value is dependent both on the distance to the...

    Full text available to download

  • Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online

    Publication

    - Year 2023

    Radio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...

    Full text to download in external service

  • Określenie parametrów modelowania geometrii krzyżownic rozjazdów zwyczajnych dla potrzeb budowy i utrzymania linii kolejowych

    Zdecydowana większość rozjazdów występująca na liniach kolejowych w Polsce to rozjazdy zwyczajne o typowym zestawie parametrów. Z tego powodu analiza przypadków nietypowych (takich jak rozjazdy o zmiennej krzywiźnie toru zwrotnego) może być utrudniona. Wirtualny model geometryczno-konstrukcyjny rozjazdu, generowany w oparciu o metody analityczne, stanowić może narzędzie użyteczne w sferze projektowania, konstrukcji oraz diagnostyki...

    Full text to download in external service

  • Analogue GPS repeater

    Publication

    - Year 2010

    This article concerns the problem of difficulty in correct indoor Global Positioning System (GPS) signals reception due to attenuation. Radio signal repeaters are proposed as means to solve this problem. Firstly, the GPS signal characteristics are described, emphasizing their low power in the point of reception. In the second part, various applications of GPS signal repeater, where indoor GPS signals reception is required, are...

  • Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System

    The main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...

    Full text to download in external service

  • Music Mood Visualization Using Self-Organizing Maps

    Publication

    Due to an increasing amount of music being made available in digital form in the Internet, an automatic organization of music is sought. The paper presents an approach to graphical representation of mood of songs based on Self-Organizing Maps. Parameters describing mood of music are proposed and calculated and then analyzed employing correlation with mood dimensions based on the Multidimensional Scaling. A map is created in which...

    Full text available to download

  • Efficiency of IEEE 802.15.4a UWB Impulse Radio Spectrum Shaping

    Publication

    - Year 2010

    This paper presents results of impulse radio signal spectrum shaping efficiency investigations. Basic parameters of IEEE 802.15.4a UWB signal and outline of proposed spectrum shaping methods are briefly described. The main part of the paper presents influence of signal and algorithms parameters on the results of spectrum shaping.

  • Suppression of distortions in signals received from Doppler sensor for vehicle speed measurement

    Publication

    - Year 2018

    Doppler sensors are commonly used for movement detection and speed measurement. However, electromagnetic interference and imperfections in sensor construction result in degradation of the signal to noise ratio. As a result, detection of signals reflected from moving objects becomes problematic. The paper proposes an algorithm for reduction of distortions and noise in the signal received from a simple, dual-channel type of a Doppler...

    Full text available to download

  • Arm EMG Wavelet-Based Denoising System

    These paper presents research results of muscle EMG signal denoising. In the same time two muscles were examined - an adductor muscle (biceps brachii) and an abductor muscle (tricpeps brachii). The EMG signal was filtered using the wavelet transform technique, having selected the crucial parameters as: wavelet basis function (Daubechies 4), 10 th decomposition level, threshold selection algorithm (Heurestic) and a sln rescaling...

    Full text to download in external service

  • Reduction of parasitic pitch variations in archival musical recordings

    A new method for reducing parasitic pitch variations in archival audio recordings is presented. The method is intended for analyzing movie soundtracks recorded in optical films. It utilizes image processing for calculating and reducing effects of tape shrinkage being one of the main reasons for parasitic pitch variations in audio accompanying moving images. As long as the film tape characteristics are known the new method can be...

    Full text available to download

  • A self-optimization mechanism for generalized adaptive notch smoother

    Publication

    Tracking of nonstationary narrowband signals is often accomplished using algorithms called adaptive notch filters (ANFs). Generalized adaptive notch smoothers (GANSs) extend the concepts of adaptive notch filtering in two directions. Firstly, they are designed to estimate coefficients of nonstationary quasi-periodic systems, rather than signals. Secondly, they employ noncausal processing, which greatly improves their accuracy and...

    Full text to download in external service

  • Matrix-based robust joint fingerprinting and decryption method for multicast distribution of multimedia

    Publication

    This paper addresses the problem of unauthorized redistribution of multimedia content by malicious users (pirates). The solution proposed here is a new joint fingerprinting and decryption method which meets the requirements for both imperceptibility and robustness of fingerprints and scalability in terms of design and distribution of fingerprinted multimedia content. The proposed method uses a simple block cipher based on matrix...

    Full text to download in external service

  • Adaptive identification of sparse underwater acoustic channels with a mix of static and time-varying parameters

    Publication

    - SIGNAL PROCESSING - Year 2022

    We consider identification of sparse linear systems with a mix of static and time-varying parameters. Such systems are typical in underwater acoustics (UWA), for instance, in applications requiring identi- fication of the acoustic channel, such as UWA communications, navigation and continuous-wave sonar. The recently proposed fast local basis function (fLBF) algorithm provides high performance when identi- fying time-varying systems....

    Full text available to download

  • Finite-window RLS algorithms

    Publication

    - SIGNAL PROCESSING - Year 2022

    Two recursive least-squares (RLS) adaptive filtering algorithms are most often used in practice, the exponential and sliding (rectangular) window RLS algorithms. This popularity is mainly due to existence of low-complexity versions of these algorithms. However, these two windows are not always the best choice for identification of fast time-varying systems, when the identification performance is most important. In this paper, we...

    Full text available to download

  • Reversible data hiding in encrypted DICOM images using sorted binary sequences of pixels

    Publication

    In this paper, a novel reversible data hiding method for encrypted DICOM images is proposed. The method utilizes binary decomposition of the input data paired with a sorting process of the obtained binary sequences to ensure efficient data embedding in each predefined data block for specific most significant bit (MSB) planes while exploiting the properties of run-length encoding. The proposed scheme is lossless, and based on the...

    Full text to download in external service

  • Karhunen-Loeve-based approach to tracking of rapidly fading wireless communication channels

    Publication

    When parameters of wireless communication channels vary at a fast rate, simple estimation algorithms, such as weighted least squares (WLS) or least mean squares (LMS) algorithms, cannot estimate them with the accuracy needed to secure the reliable operation of the underlying communication systems. In cases like this, the local basis function (LBF) estimation technique can be used instead, significantly increasing the achievable...

    Full text to download in external service

  • Depth Determination Accuracy of the Modified Prony Method in a Swath Mapping Application

    Publication

    - Year 2018

    This article presents the performance of the modified Prony method in a swath mapping application. Depth determination accuracy is assessed by processing raw signal acquired by an EdgeTech 6205 swath bathymetry system over flat seafloor. An updated version of the method, proposed previously by the authors, is used to determine the number of signal echoes. The number of signal echoes is essential for performing the low-rank approximation...

    Full text to download in external service

  • New method of IEEE 802.15.4a UWB Impulse Radio Spectrum Shaping

    Publication

    - Year 2011

    This paper presents a new technique of IEEE 802.15.4a ultra-wideband signal spectrum control, based on changes in sequences of transmitted pulses with very short duration time. Basic parameters of UWB signal and outline of proposed spectrum shaping methods are briefly described. The main part of the paper presents influence of signal and algorithms parameters on the results of spectrum shaping.

  • Resistant to correlated noise and outliers discrete identification of continuous non-linear non-stationary dynamic objects

    Publication

    In this article, specific methods of parameter estimation were used to identify the coefficients of continuous models represented by linear and nonlinear differential equations. The necessary discrete-time approximation of the base model is achieved by appropriately tuned FIR linear integral filters. The resulting discrete descriptions, which retain the original continuous parameterization, can then be identified using the classical...

    Full text to download in external service

  • Resistant to correlated noise and outliers discrete identification of continuous non-linear non-stationary dynamic objects

    Publication

    In this study, dedicated methods of parameter estimation were used to identify the coefficients of continuous models represented by linear and nonlinear differential equations. The necessary discrete-time approximation of the base model is achieved by appropriately tuned FIR linear integral filters. The resulting discrete descriptions, which retain the original continuous parameterization, can then be identified using the classical...

    Full text to download in external service

  • Towards Cancer Patients Classification Using Liquid Biopsy

    Liquid biopsy is a useful, minimally invasive diagnostic and monitoring tool for cancer disease. Yet, developing accurate methods, given the potentially large number of input features, and usually small datasets size remains very challenging. Recently, a novel feature parameterization based on the RNA-sequenced platelet data which uses the biological knowledge from the Kyoto Encyclopedia of Genes and Genomes, combined with a classifier...

    Full text to download in external service

  • Wavelet filtering of signals without using model functions

    Publication

    The effective wavelet filtering of real signals is impossible without determining their shape. The shape of a real signal is related to its wavelet spectrum. For shape analysis, a continuous color wavelet spectrogram of signal level is often used. The disadvantage of continuous wavelet spectrogram is the complexity of analyzing a blurry color image. A real signal with additive noise strongly distorts the spectrogram based on continuous...

    Full text available to download

  • Noise effect on parameters of quiet sonar with code modulation

    Earlier publications of the paper authors have shown that the use of code keying mixed with the CW FM sound signal allows the significant reduction in the distance measurement error, compared to classic silent CW FM sonar. In addition to the code modulation parameters, the magnitude of this error is influenced by the received input acoustic noise. The article shows the dependence of the input signal-to-noise ratio and the sound...

    Full text available to download

  • Stress Detection of Children With ASD Using Physiological Signals

    Publication
    • S. N. B. Aktas
    • P. Uluer
    • B. Coskun
    • E. Toprak
    • D. E. Barkana
    • H. Kose
    • T. Zorcec
    • B. Robins
    • A. Landowska

    - Year 2022

    This paper proposes a physiological signal-based stress detection approach for children with autism spectrum disorder (ASD) to be used in social and assistive robot inter- vention. Electrodermal activity (EDA) and blood volume pulse (BVP) signals are collected with an E4 smart wristband from children with ASD in different countries. The peak count and signal amplitude features are derived from EDA signal and used in order to detect...

    Full text to download in external service

  • Optimal configuration of an electrode array for measuring ventricles' contraction

    An influence of an electrode-array configuration on an impedance signal composition for a fixed spatial distribution of its sources is examined in the paper. The Finite Element Method and Geselowitz relationship were used for examining three different electrode-arrays. A sensitivity approach was used to evaluate each configuration assuming that localization of the signal source is known. A conductivity change, thus the source of...

    Full text available to download

  • Evaluation Criteria for Affect-Annotated Databases

    In this paper a set of comprehensive evaluation criteria for affect-annotated databases is proposed. These criteria can be used for evaluation of the quality of a database on the stage of its creation as well as for evaluation and comparison of existing databases. The usefulness of these criteria is demonstrated on several databases selected from affect computing domain. The databases contain different kind of data: video or still...

    Full text to download in external service

  • Intelligent multimedia solutions supporting special education needs.

    The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

  • Intelligent video and audio applications for learning enhancement

    The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

    Full text available to download

  • A method of RTS noise identification in noise signals of semiconductor devices in the time domain

    In the paper a new method of Random Telegraph Signal (RTS) noise identification is presented. The method is based on a standardized histogram of instantaneous noise values and processing by Gram-Charlier series. To find a device generating RTS noise by the presented method one should count the number of significant coefficients of the Gram-Charlier series. This would allow to recognize the type of noise. There is always one (first)...

    Full text available to download

  • Identification of Optocoupler Devices with RTS Noise

    The results of noise measurements in low frequency range for CNY 17 type optocouplers are presented. The research were carried out on devices with different values of Current Transfer Ratio (CTR). The methods for identification of Random Telegraph Signal (RTS) in noise signal of optocouplers were proposed. It was found that the Noise Scattering Pattern method (NSP method) enables to identify RTS noise as non-Gaussian component...

  • The development of an underwater telephone for digital communication purposes

    Publication

    - HYDROACOUSTICS - Year 2016

    The underwater telephone HTL-10 has been designed to provide voice and data communication between helicopter and submarines using acoustic waves. It works in a half-duplex mode and uses analogue power-efficient modulation in the form of a single side-band, suppressed carrier, in a wide range of frequencies. It generates the transmitted signal, and processes the received signals. It is implemented with the use of digital signal...

    Full text available to download