Search results for: alophoneme analisys, speech processing, dynamic time warping
-
Modified dynamic time warping method applied to handwritten signature authenticity verification
PublicationA signature verification system based on static features and time-domain functions of signals obtained using a tablet has been presented in the paper. The signature verification method, based mainly on dynamic time warping coupled with some signature image features, has been described. The FRR measures reflecting the method’s efficiency have been evaluated for verification attempts performed directly after obtaining model signatures...
-
Real-time speech-rate modification experiments
PublicationAn algorithm designed for real-time speech time scale modification (stretching) is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness of signal processing algorithms are examined experimentally together...
-
Application of dynamic time warping and cepstrograms to text-dependent speaker verification
PublicationThis work provides a description of an automatic speaker verification (ASV) system. In particular, it documents the evolution of all individual stages of the proposed ASV system design from the phase of preprocessing to an operational decision making system. The aim of this research was to achieve the system of the best safety and ease of use in view of users. The objective estimation of this target has been accomplished by assessing...
-
Dot-com and AI bubbles: Can data from the past be helpful to match the price bubble euphoria phase using dynamic time warping?
PublicationThe article investigates the existence of a price bubble in the artificial intelligence market, employing the Generalised Supremum Augmented Dickey-Fuller test and dynamic time warping methodology. It proposes a method to detect the end of the price bubble euphoria phase, generating an average profit of close to 7% over 5 days and over 10.5% over 20 days, with almost 90% effectiveness. The study found that the AI market experienced...
-
Improved method for real-time speech stretching
Publicationn algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...
-
Time-domain prosodic modifications for text-to-speech synthesizer
PublicationAn application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
-
A non-uniform real-time speech time-scale stretching method
PublicationAn algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...
-
A Method of Real-Time Non-uniform Speech Stretching
PublicationDeveloped method of real-time non-uniform speech stretching is presented.The proposed solution is based on the well-known SOLA algorithm(Synchronous Overlap and Add). Non-uniform time-scale modification isachieved by the adjustment of time scaling factor values in accordance with thesignal content. Dependently on the speech unit (vowels/consonants), instantaneousrate of speech (ROS), and speech signal presence, values of the scalingfactor...
-
Comparison of various speech time-scale modificartion methods
PublicationThe objective of this work is to investigate the influence of the different time-scale modification (TSM) methods on the quality of the speech stretched up using the designed non-uniform real-time speech time-scale modification algorithm (NU-RTSM). The algorithm provides a combination of the typical TSM algorithm with the vowels, consonants, stutter, transients and silence detectors. Based on the information about the content and...
-
Speech codec enhancements utilizing time compression and perceptual coding
PublicationA method for encoding wideband speech signal employing standardized narrowband speech codecs is presented as well as experimental results concerning detection of tonal spectral components. The speech signal sampled with a higher sampling rate than it is suitable for narrowband coding algorithm is compressed in order to decrease the amount of samples. Next, the time-compressed representation of a signal is encoded using a narrowband...
-
Parallel implementation of background subtraction algorithms for real-time video processing on a supercomputer platform
PublicationResults of evaluation of the background subtraction algorithms implemented on a supercomputer platform in a parallel manner are presented in the paper. The aim of the work is to chose an algorithm, a number of threads and a task scheduling method, that together provide satisfactory accuracy and efficiency of a real-time processing of high resolution camera images, maintaining the cost of resources usage at a reasonable level. Two...
-
Intelligent processing of stuttered speech.
PublicationW artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się sztucznych sieci neuronowych i zbiorów przybliżonych.
-
Real-time speech streching for supporting hearing impaired schoolchildren
PublicationA study of time scale modification algorithms applied to support hearing impaired schoolchildren is presented. Variety of algorithms are considered, namely: overlap-and add, two variations of synchronous overlapand- add, and the phase vocoder. Their effectiveness as well as real-time processing capabilities are examined.
-
Time-scale modification of speech signals for supporting hearing impaired schoolchildren
PublicationA study of time scale modification algorithmsapplied to hearing impaired schoolchildren supporting ispresented. Variety of algorithms are considered, namely:overlap and add, two variations of synchronized overlapand add, and the phase vocoder. Their effectiveness as wellas real-time processing capabilities are examined.
-
Linear Time-Varying Dynamic-Algebraic Equations of Index One on Time Scales
PublicationIn this paper, we introduce a class of linear time-varying dynamic-algebraic equations (LTVDAE) of tractability index one on ar- bitrary time scales. We propose a procedure for the decoupling of the considered class LTVDAE. Explicit formulae are written down both for transfer operator and the obtained decoupled system. A projector ap- proach is used to prove the main statement of the paper and sufficient conditions of decoupling...
-
Overhead wires detection by FPGA real-time image processing
PublicationThe paper presents design and hardware implementation of real-time image filtering for overhead wires detection divided on image processing and results presentation blocks. The image processing block was separated from the whole implementation, and its delay and hardware complexity was analysed. Also the maximum frequency of image processing of the proposed implementation was estimated.
-
Artur Gańcza dr inż.
PeopleI received the M.Sc. degree from the Gdańsk University of Technology (GUT), Gdańsk, Poland, in 2019. I am currently a Ph.D. student at GUT, with the Department of Automatic Control, Faculty of Electronics, Telecommunications and Informatics. My professional interests include speech recognition, system identification, adaptive signal processing and linear algebra.
-
Prediction of Processor Utilization for Real-Time Multimedia Stream Processing Tasks
PublicationUtilization of MPUs in a computing cluster node for multimedia stream processing is considered. Non-linear increase of processor utilization is described and a related class of algorithms for multimedia real-time processing tasks is defined. For such conditions, experiments measuring the processor utilization and output data loss were proposed and their results presented. A new formula for prediction of utilization was proposed...
-
System of speech signal processing and visualisation for linguistic purposes
Publication -
On time-dependent nonlinear dynamic response of micro-elastic solids
PublicationA new approach to the mechanical response of micro-mechanic problems is presented using the modified couple stress theory. This model captured micro-turns due to micro-particles' rotations which could be essential for microstructural materials and/or at small scales. In a micro media based on the small rotations, sub-particles can also turn except the whole domain rotation. However, this framework is competent for a static medium....
-
Estimation of time-frequency complex phase-based speech attributes using narrow band filter banks
PublicationIn this paper, we present nonlinear estimators of nonstationary and multicomponent signal attributes (parameters, properties) which are instantaneous frequency, spectral (or group) delay, and chirp-rate (also known as instantaneous frequency slope). We estimate all of these distributions in the time-frequency domain using both finite and infinite impulse response (FIR and IIR) narrow band filers for speech analysis. Then, we present...
-
Multi-core processing system for real-time image processing in embedded computer vision applications
PublicationW artykule opisano architekturę wielordzeniowego programowalnego systemu do przetwarzania obrazów w czasie rzeczywistym. Dane obrazu są przetwarzane równocześnie przez wszystkie procesory. System umożliwia niskopoziomowe przetwarzanie obrazów,np. odejmowanie tła, wykrywanie obiektów ruchomych, transformacje geometryczne, indeksowanie wykrytych obiektów, ocena ich kształtu oraz podstawowa analiza trajektorii ruchu. Ang:This paper...
-
Time Domain Modeling of Propeller Forces due to Ventilation in Static and Dynamic Conditions
PublicationThis paper presents experimental and theoretical studies on the dynamic effect on the propeller loading due to ventilation by using a simulation model that generates a time domain solution for propeller forces in varying operational conditions. For ventilation modeling, the simulation model applies a formula based on the idea that the change in lift coefficient due to ventilation computes the change in the thrust coefficient. It...
-
Neural modelling of dynamic systems with time delays based on an adjusted NEAT algorithm
PublicationA problem related to the development of an algorithm designed to find an architecture of artificial neural network used for black-box modelling of dynamic systems with time delays has been addressed in this paper. The proposed algorithm is based on a well-known NeuroEvolution of Augmenting Topologies (NEAT) algorithm. The NEAT algorithm has been adjusted by allowing additional connections within an artificial neural network and...
-
Marking the Allophones Boundaries Based on the DTW Algorithm
PublicationThe paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighborhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border...
-
Data processing methods for dynamic medical thermography.
PublicationArtykuł przedstawia zastosowanie nowej metody syntezy obrazów w termografii dla potrzeb opisu ilościowego właściwości termicznych tkanek. Opis taki umożliwia różnicowanie przypadków medycznych. Metodę zastosowania dla licznych pomiarów fantomowych i in vitro w eksperymentach na zwierzętach (świnia domowa). Przedstawiono i omówiono rezultaty prac.
-
Impact of Shifting Time-Window Post-Processing on the Quality of Face Detection Algorithms
PublicationWe consider binary classification algorithms, which operate on single frames from video sequences. Such a class of algorithms is named OFA (One Frame Analyzed). Two such algorithms for facial detection are compared in terms of their susceptibility to the FSA (Frame Sequence Analysis) method. It introduces a shifting time-window improvement, which includes the temporal context of frames in a post-processing step that improves the...
-
Dynamic fracture of brittle shells in a space-time adaptive isogeometric phase field framework
PublicationPhase field models for fracture prediction gained popularity as the formulation does not require the specification of ad-hoc criteria and no discontinuities are inserted in the body. This work focuses on dynamic crack evolution of brittle shell structures considering large deformations. The energy contributions from in-plane and out-of-plane deformations are separately split into tensile and compressive components and the resulting...
-
Influence of YARN Schedulers on Power Consumption and Processing Time for Various Big Data Benchmarks
PublicationClimate change caused by human activities can influence the lives of everybody onthe planet. The environmental concerns must be taken into consideration by all fields of studyincludingICT. Green Computing aims to reduce negative effects of IT on the environment while,at the same time, maintaining all of the possible benefits it provides. Several Big Data platformslike Apache Spark orYARNhave become widely used in analytics and...
-
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
Journals -
Mariusz Kaczmarek dr hab. inż.
PeopleReceived M.Sc., Eng. in Electronics in 1995 from Gdansk University of Technology, Ph.D. in Medical Electronics in 2003 and habilitation in Biocybernetics and Biomedical Engineering in 2017. He was an investigator in about 13 projects receiving a number of awards, including four best papers, practical innovations (7 medals and awards) and also the Andronicos G. Kantsios Award and Siemens Award. Main research activities: the issues...
-
IEEE Transactions on Audio Speech and Language Processing
Journals -
Adaptive Optimal Discrete-Time Output-Feedback Using an Internal Model Principle and Adaptive Dynamic Programming
PublicationIn order to address the output feedback issue for linear discrete-time systems, this work suggests a brand-new adaptive dynamic programming (ADP) technique based on the internal model principle (IMP). The proposed method, termed as IMP-ADP, does not require complete state feedback, merely the measurement of input and output data. More specifically, based on the IMP, the output control problem can first be converted into a stabilization...
-
Assessment of the Impact of GNSS Processing Strategies on the Long-Term Parameters of 20 Years IWV Time Series
PublicationAdvanced processing of collected global navigation satellite systems (GNSS) observations allows for the estimation of zenith tropospheric delay (ZTD), which in turn can be converted to the integrated water vapour (IWV). The proper estimation of GNSS IWV can be affected by the adopted GNSS processing strategy. To verify which of its elements cause deterioration and which improve the estimated GNSS IWV, we conducted eight reprocessings...
-
Dynamic inequalities and equations of Volterra type on time scales
PublicationPraca dotyczy całkowo-różniczkowych równań dynamicznych typu Volterry z warunkami początkowymi. Stosując twierdzenie Banacha o punkcie stałym pokazano istnienie jedynego rozwiązania liniowego równania dynamicznego. Stosując metodę iteracji monotonicznych pokazano istnienie rozwiązań ekstremalnych dla problemów nieliniowych. Badano też nierówności dynamiczne. Praca zawiera również uwagi dotyczące zagadnień różniczkowych i różnicowych.
-
Stability of softly switched multiregional dynamic output controllers with a static antiwindup filter: A discrete-time case
PublicationThis paper addresses the problem of model-based global stability analysis of discrete-time Takagi–Sugeno multiregional dynamic output controllers with static antiwindup filters. The presented analyses are reduced to the problem of a feasibility study of the Linear Matrix Inequalities (LMIs), derived based on Lyapunov stability theory. Two sets of LMIs are considered candidate derived from the classical common quadratic Lyapunov...
-
Real-Time Multimedia Stream data Processing in a Supercomputer Environment
PublicationRozdział opisuje doświadczenia uzyskane przez autorów podczas pracy w projekcie MAYDAY EURO 2012. Przedstawiono główny cel projektu - stworzenie systemu umożliwiającego rozwijanie i równolegle wykonywanie usług multimedialnych w środowisku klastra obliczeniowego dużej mocy. opisano tematykę przetwarzania dużej liczby strumieni multimedialnych na komputerach dużej mocy. Następnie zaprezentowano możliwości platformy KASKADA: tworzenie...
-
The influence of different time duration of thermal processing on berries quality
PublicationOznaczano zawartość związków bioaktywnych (polifenole, flawonoidy, taniny, antocyjany i kwas askorbinowy) oraz poziom aktywności przeciwutleniającej próbek ekstraktów (wodnych, heksanowych i acetonowych) uzyskanych z różnych gatunków owoców jagodowych. Do pomiaru poziomu aktywności przeciwutleniającej wykorzystano takie testy jak ABTS, DPPH, FRAP i CUPRAC. Zbadano wpływ czasu trwania procesu obróbki termicznej na zawartość bioaktywnych...
-
The influence of different time durations of thermal processing on berries quality
PublicationBioactive compounds (polyphenols, flavonoids, flavanols, tannins, anthocyanins and ascorbic acid) and the level of antioxidant activity by ABTS, DPPH, FRAP and CUPRAC of water, acetone and hexane extracts of Chilean 'Murtilla' (Ugni molinae Turcz) and 'Myrteola' berries (Myrtaceae, Myrteola nummularia (Poiret) Berg.), Chilean and Polish blueberries (Vaccinium corymbosum), Chilean raspberries (Rubus idaeus), and Polish black chokeberry...
-
A nine-input 1.25 mW, 34 ns CMOS analog median filter for image processing in real time
PublicationIn this paper an analog voltage-mode median filter, which operates on a 3 × 3 kernel is presented. The filter is implemented in a 0.35 μm CMOS technology. The proposed solution is based on voltage comparators and a bubble sort configuration. As a result, a fast (34 ns) time response with low power consumption (1.25 mW for 3.3 V) is achieved. The key advantage of the configuration is relatively high accuracy of signal processing,...
-
Application of time-frequency methods for analysis of dynamic silo flow
PublicationW artykule przedstawiono możliwość stosowania metod czasowo-częstotliwościowych w analizie dynamicznego przepływu materiału sypkiego w silosie. W pracy omówiono wyniki FT (Fourier Transform), STFT (Short Time Fourier Transform) oraz WT (Wavelet Transform)
-
On–line Parameter and Delay Estimation of Continuous–Time Dynamic Systems
PublicationThe problem of on-line identification of non-stationary delay systems is considered. The dynamics of supervised industrial processes are usually modeled by ordinary differential equations. Discrete-time mechanizations of continuous-time process models are implemented with the use of dedicated finite-horizon integrating filters. Least-squares and instrumental variable procedures mechanized in recursive forms are applied for simultaneous...
-
CMOS implementation of an analogue median filter for image processing in real time
PublicationAn analogue median filter, realised in a 0.35 μm CMOS technology, is presented in this paper. The key advantages of the filter are: high speed of image processing (50 frames per second), low-power operation (below 1.25 mW under 3.3 V supply) and relatively high accuracy of signal processing. The presented filter is a part of an integrated circuit for image processing (a vision chip), containing: a photo-sensor matrix, a set of...
-
Robust-adaptive dynamic programming-based time-delay control of autonomous ships under stochastic disturbances using an actor-critic learning algorithm
PublicationThis paper proposes a hybrid robust-adaptive learning-based control scheme based on Approximate Dynamic Programming (ADP) for the tracking control of autonomous ship maneuvering. We adopt a Time-Delay Control (TDC) approach, which is known as a simple, practical, model free and roughly robust strategy, combined with an Actor-Critic Approximate Dynamic Programming (ACADP) algorithm as an adaptive part in the proposed hybrid control...
-
Boundary value problems for dynamic equations of Volterra type on time scales
PublicationPraca dotyczy równań i nierówności dla problemów dynamicznych typu Volterry. Podano warunki dostateczne na istnienie ekstremalnych rozwiązań w obszarze ograniczonym przez dolne i górne rozwiązania. Praca zawiera również pewne uwagi dla konkretnych zagadnień różniczkowych i dyskretnych.
-
Boundary value problems for dynamic equations with advanced arguments on time scales
PublicationPraca dotyczy równań i nierówności dynamicznych z wyprzedzonym argumentami. Przedmiotem badań były problemy istnienia rozwiązań równań dynamicznych. Sformułowano warunki dostatczne na istnienie jedynego rozwiązania w odpowiednim obszarze ograniczonym przez górne i dolne rozwiązanie.
-
Task Allocation and Scalability Evaluation for Real-Time Multimedia Processing in a Cluster Envirinment
PublicationAn allocation algorithm for stream processing tasks is proposed (Modified best Fit Descendent, MBFD). A comparison with another solution (BFD) is provided. Tests of the algorithms in an HPC environment are descrobed and the results are presented. A proper scalability metric is proposed and used for the evaluation of the allocation algorithm.
-
Digital processing of pulse signal from light-to-frequency converter under dynamic condition
Publication -
System przetwarzania i wizualizacji sygnału mowy dla potrzeb lingwistycznych = System of speech signal processing and visualisation of the results
PublicationW artykule przedstawiono sposób przetwarzania i wizualizacji sygnału mowy w formie prostego w obsłudze i relatywnie niedrogiego urządzenia do nagrywania sygnału akustycznego oraz przetwarzania cyfrowego wyselekcjonowanych fragmentów i wizualizacji uzyskanych rezultatów przekształceń. Zastosowano do tego celu komputer z kartą dźwiękową. Przetwarzanie cyfrowe oraz wizualizacja dokonywana była w oparciu o program MATLAB bezpośrednio...
-
IEEE-ACM Transactions on Audio Speech and Language Processing
Journals