Filters
total: 1879
filtered: 1253
displaying 1000 best results Help
Search results for: SPEECH SIGNAL PARAMETERIZATION
-
Active Control of Highly Autocorrelated Machinery Noise in Multivariate Nonminimum Phase Systems
PublicationIn this paper, a novel multivariate active noise control scheme, designed to attenuate disturbances with high autocorrelation characteristics and preserve background signals, is proposed. The algorithm belongs to the class of feedback controllers and, unlike the popular feedforward FX-LMS approach, does not require availability of a reference signal. The proposed approach draws its inspiration from the iterative learning control...
-
Creating new voices using normalizing flows
PublicationCreating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...
-
PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS
PublicationThe quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...
-
Towards classification of patients based on surface EMG data of temporomandibular joint muscles using self-organising maps
PublicationThe study considers the need for an effective method of classification of patients with a temporomandibular joint disorder (TMD). The self-organising map method (SOM) was applied to group patients and used together with the cross-correlation approach to interpret the processed (rectified and smoothed by using root mean square (RMS) algorithm) surface electromyography signal (sEMG) obtained from testing the muscles (two temporal...
-
Audio Feature Analysis for Precise Vocalic Segments Classification in English
PublicationAn approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal...
-
Application of the neural networks for developing new parametrization of the Tersoff potential for carbon
PublicationPenta-graphene (PG) is a 2D carbon allotrope composed of a layer of pentagons having sp2- and sp3-bonded carbon atoms. A study carried out in 2018 has shown that the parameterization of the Tersoff potential proposed in 2005 by Ehrhart and Able (T05 potential) performs better than other potentials available for carbon, being able to reproduce structural and mechanical properties of the PG. In this work, we tried to improve the...
-
Physics-Based Coarse-Grained Modeling in Bio- and Nanochemistry
PublicationCoarse-grained approaches, in which groups of atoms are represented by single interaction sites, are very important in biological and materials sciences because they enable us to cover the size- and time-scales by several orders of magnitude larger than those available all-atom simulations, while largely keeping the details of the systems studied. The coarse-grained approaches differ by the scheme of reduction and by the origin...
-
Auditory-visual attention stimulator
PublicationNew approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...
-
INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH
PublicationThe Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...
-
Impact of maintenance of floodplains of the Vistula River on high water levels on the section from Włocławek to Toruń
PublicationThis article describes the methodology of hydraulic calculations to estimate the water levels in open channels for steady gradually varied flow. The presented method has been used to analyse the water level on the Vistula River from Włocławek cross-section to Toruń cross-section. The HEC-RAS modelling system has been used for parameterization of the river channel and floodplains, as well as for flow simulation. The results obtained...
-
Theoretical calculation of the physico-chemical properties of 1-butyl-4-methylpyridinium based ionic liquids
PublicationACCEPTED MAIonic liquids (ILs) have attracted much attention for their unique physicochemical properties, which can be designed as needed by altering the ion combinations. Besides experimental work, numerous computational studies have been concerned with prediction of physical properties of ILs. The results of molecular dynamics simulations of ILs depend strongly on the proper force field parameterization. Classical force fields...
-
Study Analysis of Transmission Efficiency in DAB+ Broadcasting System
PublicationDAB+ is a very innovative and universal multimedia broadcasting system. Thanks to its updated multimedia technologies and metadata options, digital radio keeps pace with changing consumer expectations and the impact of media convergence. Broadcasting analog and digital radio services does vary, concerning devices on both transmitting and receiving side, as well as content processing mechanisms. However, the biggest difference is...
-
Camera angle invariant shape recognition in surveillance systems
PublicationA method for human action recognition in surveillance systems is described. Problems within this task are discussed and a solution based on 3D object models is proposed. The idea is shown and some of its limitations are talked over. Shape description methods are introduced along with their main features. Utilized parameterization algorithm is presented. Classification problem, restricted to bi-nary cases is discussed. Support vector...
-
Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System
PublicationA voiceless stop consonant phoneme modelling and synthesis framework based on a phoneme modelling in low-frequency range and high-frequency range separately is proposed. The phoneme signal is decomposed into the sums of simpler basic components and described as the output of a linear multiple-input and single-output (MISO) system. The impulse response of each channel is a third order quasi-polynomial. Using this framework, the...
-
Prof. Haitham Abu-Rub - A Visit to Poland's Gdansk University of Technology
PublicationReport on visit of Prof. Haitham Abu-Rub in Gdansk University of Technology. Speech on the Smart Grid Centre. Visit in the new smart grid laboratory of the GUT, the Laboratory for Innovative Power Technologies and Integration of Renewable Energy Sources (LINTE^2).
-
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
PublicationA common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...
-
Modeling and Designing Acoustical Conditions of the Interior – Case Study
PublicationThe primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...
-
A comparative study of English viseme recognition methods and algorithm
PublicationAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...
-
A comparative study of English viseme recognition methods and algorithms
PublicationAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...
-
Continuous wave sonar with hyperbolic frequency modulation keyed by pseudo-random sequence
PublicationA CW FM type sounding signal is used in the classical solution of silent sonar. While the signal provides a relatively simple implementation of digital signal processing, and ensures good detection conditions, unfortunately, in the presence of the Doppler effect, distance measurement results tend to be wrong. This is due to the fact that the received signal’s instantaneous frequency value is dependent both on the distance to the...
-
Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online
PublicationRadio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...
-
Określenie parametrów modelowania geometrii krzyżownic rozjazdów zwyczajnych dla potrzeb budowy i utrzymania linii kolejowych
PublicationZdecydowana większość rozjazdów występująca na liniach kolejowych w Polsce to rozjazdy zwyczajne o typowym zestawie parametrów. Z tego powodu analiza przypadków nietypowych (takich jak rozjazdy o zmiennej krzywiźnie toru zwrotnego) może być utrudniona. Wirtualny model geometryczno-konstrukcyjny rozjazdu, generowany w oparciu o metody analityczne, stanowić może narzędzie użyteczne w sferze projektowania, konstrukcji oraz diagnostyki...
-
Analogue GPS repeater
PublicationThis article concerns the problem of difficulty in correct indoor Global Positioning System (GPS) signals reception due to attenuation. Radio signal repeaters are proposed as means to solve this problem. Firstly, the GPS signal characteristics are described, emphasizing their low power in the point of reception. In the second part, various applications of GPS signal repeater, where indoor GPS signals reception is required, are...
-
Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System
PublicationThe main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...
-
Music Mood Visualization Using Self-Organizing Maps
PublicationDue to an increasing amount of music being made available in digital form in the Internet, an automatic organization of music is sought. The paper presents an approach to graphical representation of mood of songs based on Self-Organizing Maps. Parameters describing mood of music are proposed and calculated and then analyzed employing correlation with mood dimensions based on the Multidimensional Scaling. A map is created in which...
-
Efficiency of IEEE 802.15.4a UWB Impulse Radio Spectrum Shaping
PublicationThis paper presents results of impulse radio signal spectrum shaping efficiency investigations. Basic parameters of IEEE 802.15.4a UWB signal and outline of proposed spectrum shaping methods are briefly described. The main part of the paper presents influence of signal and algorithms parameters on the results of spectrum shaping.
-
Suppression of distortions in signals received from Doppler sensor for vehicle speed measurement
PublicationDoppler sensors are commonly used for movement detection and speed measurement. However, electromagnetic interference and imperfections in sensor construction result in degradation of the signal to noise ratio. As a result, detection of signals reflected from moving objects becomes problematic. The paper proposes an algorithm for reduction of distortions and noise in the signal received from a simple, dual-channel type of a Doppler...
-
Arm EMG Wavelet-Based Denoising System
PublicationThese paper presents research results of muscle EMG signal denoising. In the same time two muscles were examined - an adductor muscle (biceps brachii) and an abductor muscle (tricpeps brachii). The EMG signal was filtered using the wavelet transform technique, having selected the crucial parameters as: wavelet basis function (Daubechies 4), 10 th decomposition level, threshold selection algorithm (Heurestic) and a sln rescaling...
-
Reduction of parasitic pitch variations in archival musical recordings
PublicationA new method for reducing parasitic pitch variations in archival audio recordings is presented. The method is intended for analyzing movie soundtracks recorded in optical films. It utilizes image processing for calculating and reducing effects of tape shrinkage being one of the main reasons for parasitic pitch variations in audio accompanying moving images. As long as the film tape characteristics are known the new method can be...
-
A self-optimization mechanism for generalized adaptive notch smoother
PublicationTracking of nonstationary narrowband signals is often accomplished using algorithms called adaptive notch filters (ANFs). Generalized adaptive notch smoothers (GANSs) extend the concepts of adaptive notch filtering in two directions. Firstly, they are designed to estimate coefficients of nonstationary quasi-periodic systems, rather than signals. Secondly, they employ noncausal processing, which greatly improves their accuracy and...
-
Matrix-based robust joint fingerprinting and decryption method for multicast distribution of multimedia
PublicationThis paper addresses the problem of unauthorized redistribution of multimedia content by malicious users (pirates). The solution proposed here is a new joint fingerprinting and decryption method which meets the requirements for both imperceptibility and robustness of fingerprints and scalability in terms of design and distribution of fingerprinted multimedia content. The proposed method uses a simple block cipher based on matrix...
-
Adaptive identification of sparse underwater acoustic channels with a mix of static and time-varying parameters
PublicationWe consider identification of sparse linear systems with a mix of static and time-varying parameters. Such systems are typical in underwater acoustics (UWA), for instance, in applications requiring identi- fication of the acoustic channel, such as UWA communications, navigation and continuous-wave sonar. The recently proposed fast local basis function (fLBF) algorithm provides high performance when identi- fying time-varying systems....
-
Finite-window RLS algorithms
PublicationTwo recursive least-squares (RLS) adaptive filtering algorithms are most often used in practice, the exponential and sliding (rectangular) window RLS algorithms. This popularity is mainly due to existence of low-complexity versions of these algorithms. However, these two windows are not always the best choice for identification of fast time-varying systems, when the identification performance is most important. In this paper, we...
-
Reversible data hiding in encrypted DICOM images using sorted binary sequences of pixels
PublicationIn this paper, a novel reversible data hiding method for encrypted DICOM images is proposed. The method utilizes binary decomposition of the input data paired with a sorting process of the obtained binary sequences to ensure efficient data embedding in each predefined data block for specific most significant bit (MSB) planes while exploiting the properties of run-length encoding. The proposed scheme is lossless, and based on the...
-
Karhunen-Loeve-based approach to tracking of rapidly fading wireless communication channels
PublicationWhen parameters of wireless communication channels vary at a fast rate, simple estimation algorithms, such as weighted least squares (WLS) or least mean squares (LMS) algorithms, cannot estimate them with the accuracy needed to secure the reliable operation of the underlying communication systems. In cases like this, the local basis function (LBF) estimation technique can be used instead, significantly increasing the achievable...
-
Depth Determination Accuracy of the Modified Prony Method in a Swath Mapping Application
PublicationThis article presents the performance of the modified Prony method in a swath mapping application. Depth determination accuracy is assessed by processing raw signal acquired by an EdgeTech 6205 swath bathymetry system over flat seafloor. An updated version of the method, proposed previously by the authors, is used to determine the number of signal echoes. The number of signal echoes is essential for performing the low-rank approximation...
-
New method of IEEE 802.15.4a UWB Impulse Radio Spectrum Shaping
PublicationThis paper presents a new technique of IEEE 802.15.4a ultra-wideband signal spectrum control, based on changes in sequences of transmitted pulses with very short duration time. Basic parameters of UWB signal and outline of proposed spectrum shaping methods are briefly described. The main part of the paper presents influence of signal and algorithms parameters on the results of spectrum shaping.
-
Resistant to correlated noise and outliers discrete identification of continuous non-linear non-stationary dynamic objects
PublicationIn this article, specific methods of parameter estimation were used to identify the coefficients of continuous models represented by linear and nonlinear differential equations. The necessary discrete-time approximation of the base model is achieved by appropriately tuned FIR linear integral filters. The resulting discrete descriptions, which retain the original continuous parameterization, can then be identified using the classical...
-
Resistant to correlated noise and outliers discrete identification of continuous non-linear non-stationary dynamic objects
PublicationIn this study, dedicated methods of parameter estimation were used to identify the coefficients of continuous models represented by linear and nonlinear differential equations. The necessary discrete-time approximation of the base model is achieved by appropriately tuned FIR linear integral filters. The resulting discrete descriptions, which retain the original continuous parameterization, can then be identified using the classical...
-
Towards Cancer Patients Classification Using Liquid Biopsy
PublicationLiquid biopsy is a useful, minimally invasive diagnostic and monitoring tool for cancer disease. Yet, developing accurate methods, given the potentially large number of input features, and usually small datasets size remains very challenging. Recently, a novel feature parameterization based on the RNA-sequenced platelet data which uses the biological knowledge from the Kyoto Encyclopedia of Genes and Genomes, combined with a classifier...
-
Wavelet filtering of signals without using model functions
PublicationThe effective wavelet filtering of real signals is impossible without determining their shape. The shape of a real signal is related to its wavelet spectrum. For shape analysis, a continuous color wavelet spectrogram of signal level is often used. The disadvantage of continuous wavelet spectrogram is the complexity of analyzing a blurry color image. A real signal with additive noise strongly distorts the spectrogram based on continuous...
-
Noise effect on parameters of quiet sonar with code modulation
PublicationEarlier publications of the paper authors have shown that the use of code keying mixed with the CW FM sound signal allows the significant reduction in the distance measurement error, compared to classic silent CW FM sonar. In addition to the code modulation parameters, the magnitude of this error is influenced by the received input acoustic noise. The article shows the dependence of the input signal-to-noise ratio and the sound...
-
Stress Detection of Children With ASD Using Physiological Signals
PublicationThis paper proposes a physiological signal-based stress detection approach for children with autism spectrum disorder (ASD) to be used in social and assistive robot inter- vention. Electrodermal activity (EDA) and blood volume pulse (BVP) signals are collected with an E4 smart wristband from children with ASD in different countries. The peak count and signal amplitude features are derived from EDA signal and used in order to detect...
-
Optimal configuration of an electrode array for measuring ventricles' contraction
PublicationAn influence of an electrode-array configuration on an impedance signal composition for a fixed spatial distribution of its sources is examined in the paper. The Finite Element Method and Geselowitz relationship were used for examining three different electrode-arrays. A sensitivity approach was used to evaluate each configuration assuming that localization of the signal source is known. A conductivity change, thus the source of...
-
Evaluation Criteria for Affect-Annotated Databases
PublicationIn this paper a set of comprehensive evaluation criteria for affect-annotated databases is proposed. These criteria can be used for evaluation of the quality of a database on the stage of its creation as well as for evaluation and comparison of existing databases. The usefulness of these criteria is demonstrated on several databases selected from affect computing domain. The databases contain different kind of data: video or still...
-
Intelligent multimedia solutions supporting special education needs.
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Intelligent video and audio applications for learning enhancement
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
A method of RTS noise identification in noise signals of semiconductor devices in the time domain
PublicationIn the paper a new method of Random Telegraph Signal (RTS) noise identification is presented. The method is based on a standardized histogram of instantaneous noise values and processing by Gram-Charlier series. To find a device generating RTS noise by the presented method one should count the number of significant coefficients of the Gram-Charlier series. This would allow to recognize the type of noise. There is always one (first)...
-
Identification of Optocoupler Devices with RTS Noise
PublicationThe results of noise measurements in low frequency range for CNY 17 type optocouplers are presented. The research were carried out on devices with different values of Current Transfer Ratio (CTR). The methods for identification of Random Telegraph Signal (RTS) in noise signal of optocouplers were proposed. It was found that the Noise Scattering Pattern method (NSP method) enables to identify RTS noise as non-Gaussian component...
-
The development of an underwater telephone for digital communication purposes
PublicationThe underwater telephone HTL-10 has been designed to provide voice and data communication between helicopter and submarines using acoustic waves. It works in a half-duplex mode and uses analogue power-efficient modulation in the form of a single side-band, suppressed carrier, in a wide range of frequencies. It generates the transmitted signal, and processes the received signals. It is implemented with the use of digital signal...