Wyniki wyszukiwania dla: TWO-DIMENSIONAL REPRESENTATION OF SPEECH SIGNAL
-
Assessment of Left Atrial Function in Patients with Paroxysmal, Persistent, and Permanent Atrial Fibrillation Using Two-Dimensional Strain.
Publikacja -
Detection of DNA Replication Intermediates after Two-Dimensional Agarose Gel Electrophoresis Using a Fluorescein-Labeled Probe
Publikacja -
Enantioselective comprehensive two-dimensional gas chromatography. A route to elucidate the authenticity and origin ofRosa damascena Milleressential oils
Publikacja -
Solution conformational study of Scyliorhinin I analogues with conformational constraints by two-dimensional NMR and theoretical conformational analysis
Publikacja -
Method of reconstructing two-dimensional velocity fields on the basis of temperature field values measured with a thermal imaging camera
PublikacjaThis paper describes a novel numerical reconstruction procedure (NRP) of the velocity field during natural convective heat transfer from a two-sided, isothermal, heated vertical plate based only on the known temperature field obtained, e.g. with a thermal imaging camera. It has been demonstrated that with a knowledge of temperature distributions, the NRP enables the reconstruction of velocity fields by solving the Navier-Stokes...
-
Diagnostic Test Accuracy of Artificial Intelligence in Detecting Periapical Periodontitis on Two-Dimensional Radiographs: A Retrospective Study and Literature Review
Publikacja -
Cascading transitions toward unconventional charge density wave states in the quasi-two-dimensional monophosphate tungsten bronze P4W16O56
PublikacjaSingle crystals of the m = 8 member of the low-dimensional monophosphate tungsten bronzes (PO2)4(WO3)2m family were grown by chemical vapour transport technique and the high crystalline quality obtained allowed a reinvestigation of the physical and structural properties. Resistivity measurements revealed three anomalies at TC1 = 258 K, TC2 = 245 K and TC3 = 140 K, never observed until now. Parallel X-ray diffraction investigations...
-
Examples of numerical simulations of two-dimensional unsaturated flow with VS2DI code using different interblock conductivity averaging schemes
PublikacjaFlow in unsaturated porous media is commonly described by the Richards equation. This equation is strongly nonlinear due to interrelationships between water pressure head (negative in unsaturated conditions), water content and hydraulic conductivity. The accuracy of numerical solution of the Richards equation often depends on the method used to estimate average hydraulic conductivity between neighboring nodes or cells of the numerical...
-
Qualitative characteristics and comparison of volatile fraction of vodkas made from different botanical materials by comprehensive two-dimensional gas chromatography and the electronic nose based on the technology of ultra-fast gas chromatography
PublikacjaBACKGROUND Vodka is a spirit-based beverage made from ethyl alcohol of agricultural origin. At present, increasingly more vodka brands have labels that specify the botanical origin of the product. Until now, the techniques for distinguishing between vodkas of different botanical origin have been costly, time-consuming and insufficient for making a distinction between vodka produced from similar raw materials. Therefore, it is...
-
Quantification of DNA Modifications Using Two-Dimensional Ultraperformance Liquid Chromatography Tandem Mass Spectrometry (2D-UPLC-MS/MS)
Publikacja -
Characteristics of Cucumis metuliferus, Actinidia deliciosa and Musa paradisica fragrance profiles using a comprehensive two-dimensional gas chromatography with time-of-flight mass spectrometric detection (GC×GC-TOF MS)
PublikacjaComprehensive two-dimensional gas chromatography with time-of-flight mass spectrometric detection (GC×GC-TOF-MS) is a modern analytical technique used in many fields. This technique enables an effective separation of volatile chemical compounds [1]. In recent years, the topic of healthy food and healthy living has become very popular. For this reason, more and more scientific publications related to the analysis of food products...
-
One-Step Synergistic Effect to Produce Two-Dimensional N-Doped Hierarchical Porous Carbon Nanosheets for High-Performance Flexible Supercapacitors
Publikacja -
Studies on origin of Polish honeys by two-dimensional gas chromatography Ocena pochodzenia surowcowego polskich miodów przy użyciu dwuwymiarowej chromatografii gazowej
PublikacjaPolish acacia, linden, rapessed and buckwheat-derived honeys and a honydew were analysed for presence of hydrocarbons, alcs., ketones and esters by 2-dimensional gas chromatog. to establish the markers for the honey origin. PrOH was found characteristic for acacia honey, Me(CH2)11OH for linden honey, Me(CH2)6CHOMe for honeydew and Me(CH2)8COOEt for the rapeseed honey. Wykorzystano technikę dwuwymiarowej chromatografii gazowej sprzężonej...
-
Improving signal quality in speech codec using hybrid perceptual-parametric algorithm. [Poprawa jakości sygnału w kodekach mowy przy użyciu hybrydowego, parametryczno-perceptualnego algorytmu kodowania]
PublikacjaPrzedstawiono hybrydową, parametryczno-perceptualną architekturę kodeka. Podstawowa struktura kodeka parametrycznego CELP została wzbogacona o kodowanie perceptualne. Celem hybrydyzacji kodeka jest uzyskanie znaczącej poprawy subiektywnej jakości zdekodowanego sygnału. Zaproponowano dwie hybrydowe struktury. Pierwsza polega na perceptualnym kodowaniu dźwięcznych elementów sygnału rezydualnego kodeka CELP. Druga metoda dzieli sygnał...
-
Badanie rozkładów parametrów sygnału mowy w zastosowaniach do prognozowania prawdopodobieństwa popełnienia błędów w systemach identyfikacji mówców = Examining distribution of speech signal parameters for the prognosis of error probability in speaker verification systems
PublikacjaPrzedmiotem pracy jest system identyfikacji mówców w sposób zależny od tekstu ("text dependent''). Dokonano analizy wielu różnych wypowiedzi kilkudziesięciu mówców. Zastosowana metoda parametryzacji to metoda oparta na wynikach analizy cepstralnej sygnału mowy. Zdefiniowane zostały nowe parametry skojarzone z elementarnymi zdarzeniami w procesie weryfikacji mówców. Na tej podstawie dokonano estymacji funkcji gęstości prawdopodobieństwa...
-
Selected results of signal measurements in a ship power station with two generators working in parallel with the use of the Estimator/Analyzer instrument
Dane BadawczeThe presented dataset is part of research focusing on the assessment of metrological properties of the instrument, Estimator/ Analyser (A/E v.2), developed and made at the Faculty of Electrical Engineering, Department of Marine Electrical Power Engineering, of Gdynia Maritime University. The instrument performs a set of measurement functions that...
-
Accurate, Direct, and High-Throughput Analyses of a Broad Spectrum of Endogenously Generated DNA Base Modifications with Isotope-Dilution Two-Dimensional Ultraperformance Liquid Chromatography with Tandem Mass Spectrometry: Possible Clinical Implication
Publikacja -
IEEE International Conference on Acoustics, Speech and Signal Processing
Konferencje -
Improving the quality of speech in the conditions of noise and interference
PublikacjaThe aim of the work is to present a method of intelligent modification of the speech signal with speech features expressed in noise, based on the Lombard effect. The recordings utilized sets of words and sentences as well as disturbing signals, i.e., pink noise and the so-called babble speech. Noise signal, calibrated to various levels at the speaker's ears, was played over two loudspeakers located 2 m away from the speaker. In...
-
Metoda i algorytmy modyfikacji sygnału do celu wspomagania rozumienia mowy przez osoby z pogorszoną rozdzielczością czasową słuchu
PublikacjaPrzedmiotem badań przeprowadzonych w ramach rozprawy są metody modyfikacji czasu trwania sygnału (ang. Time Scale Modification –TSM) mowy operujące w czasie rzeczywistym oraz ocena ich wpływu na rozumienie wypowiedzi przez osoby z pogorszoną rozdzielczością czasową słuchu. Pogorszona rozdzielczość słuchu jest jednym z symptomów związanych z ośrodkowymi zaburzeniami słuchu (ang. Cetnral Auditory Processing Disorder – CAPD). W odróżnieniu...
-
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
PublikacjaThe speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...
-
Human voice modification using instantaneous complex frequency
PublikacjaThe paper presents the possibilities of changing human voice by modifying instantaneous complex frequency (ICF) of the speech signal. The proposed method provides a flexible way of altering voice without the necessity of finding fundamental frequency and formants' positions or detecting voiced and unvoiced fragments of speech. The algorithm is simple and fast. Apart from ICF it uses signal factorization into two factors: one fully...
-
Improving Objective Speech Quality Indicators in Noise Conditions
PublikacjaThis work aims at modifying speech signal samples and test them with objective speech quality indicators after mixing the original signals with noise or with an interfering signal. Modifications that are applied to the signal are related to the Lombard speech characteristics, i.e., pitch shifting, utterance duration changes, vocal tract scaling, manipulation of formants. A set of words and sentences in Polish, recorded in silence,...
-
Transient detection for speech coding applications
PublikacjaSignal quality in speech codecs may be improved by selecting transients from speech signal and encoding them using a suitable method. This paper presents an algorithm for transient detection in speech signal. This algorithm operates in several frequency bands. Transient detection functions are calculated from energy measured in short frames of the signal. The final selection of transient frames is based on results of detection...
-
Real-time speech-rate modification experiments
PublikacjaAn algorithm designed for real-time speech time scale modification (stretching) is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness of signal processing algorithms are examined experimentally together...
-
Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions
PublikacjaThe aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...
-
Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit
PublikacjaMethods developed for real-time time scale modification (TSM) of speech signal are presented. They are based onthe non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of theproposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearingimpaired children and elderly listeners. It was shown that for the speech with average rate equal to or...
-
Improved method for real-time speech stretching
Publikacjan algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...
-
Jan Daciuk dr hab. inż.
OsobyJan Daciuk uzyskał tytuł zawodowy magistra na Wydziale Elektroniki Politechniki Gdańskiej w 1986 roku, a doktorat na wydziale Elektroniki, Telekomunikacji i Informatyki PG w 1999. Pracuje na Wydziale od 1988 roku. Jego zainteresowania naukowe obejmują zastosowania automatów skończonych w przetwarzaniu języka naturalnego i przetwarzaniu mowy. Spędził ponad cztery lata w europejskich uniwersytetach i instytutach naukowych, takich...
-
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
PublikacjaThe problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...
-
POPRAWA OBIEKTYWNYCH WSKAŹNIKÓW JAKOŚCI MOWY W WARUNKACH HAŁASU
PublikacjaCelem pracy jest modyfikacja sygnału mowy, aby uzyskać zwiększenie poprawy obiektywnych wskaźników jakości mowy po zmiksowaniu sygnału użytecznego z szumem bądź z sygnałem zakłócającym. Wykonane modyfikacje sygnału bazują na cechach mowy lombardzkiej, a w szczególności na efekcie podniesienia częstotliwości podstawowej F0. Sesja nagraniowa obejmowała zestawy słów i zdań w języku polskim, nagrane w warunkach ciszy, jak również w...
-
Methodology and technology for the polymodal allophonic speech transcription
PublikacjaA method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...
-
Methodology and technology for the polymodal allophonic speech transcription
PublikacjaA method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...
-
Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
PublikacjaThe aim of this study is two-fold. First, we perform a series of experiments to examine the interference of different noises on speech processing. For that purpose, we concentrate on the Lombard effect, an involuntary tendency to raise speech level in the presence of background noise. Then, we apply this knowledge to detecting speech with the Lombard effect. This is for preparing a dataset for training a machine learning-based...
-
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
PublikacjaAplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...
-
A Method of Real-Time Non-uniform Speech Stretching
PublikacjaDeveloped method of real-time non-uniform speech stretching is presented.The proposed solution is based on the well-known SOLA algorithm(Synchronous Overlap and Add). Non-uniform time-scale modification isachieved by the adjustment of time scaling factor values in accordance with thesignal content. Dependently on the speech unit (vowels/consonants), instantaneousrate of speech (ROS), and speech signal presence, values of the scalingfactor...
-
Time-domain prosodic modifications for text-to-speech synthesizer
PublikacjaAn application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
-
Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning
PublikacjaThe Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...
-
Instantaneous complex frequency for pipeline pitch estimation
PublikacjaIn the paper a pipeline algorithm for estimating the pitch of speech signal is proposed. The algorithm uses instantaneous complex frequencies estimated for four waveforms obtained by filtering the original speech signal through four bandpass complex Hilbert filters. The imaginary parts of ICFs from each channel give four candidates for pitch estimates. The decision regarding the final estimate is made based on the real parts of...
-
Properties and interpretation of Instantaneous Complex Frequency
PublikacjaThe concept of Instantaneous Complex Frequency (ICF) was first defined by Lindon and developed mainly in works of two authors S. Hahn and M. Rojewski. Although it is not widely used in signal analysis, ICF was already used as a complex signal representation in the verification of handwritten signatures, pitch estimation, symbol timing recovery in PSK receiver and in detection of anomalies in data transmission. It should be noted,...
-
DAB+ Coverage Analysis: a New Look at Network Planning using GIS Tools
PublikacjaFor many years, the matter of designing a transmitter network, optimized for best signal coverage, has been a subject of intense research. In the last decade, numerous researchers and institutions used GIS and spatial analysis tools for network planning, especially transmitter location. Currently, many existing systems operate in a strictly two-dimensional manner, not taking into account the three-dimensional nature of the analyzed...
-
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
PublikacjaObjective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...
-
Mechanics of materials (PG_00057378), W/Ć, BM, II stop., sem1,3, lato, 2023/24
Kursy OnlineThe aim of this course is providing knowledge in the field of analysis and solving problems of mechanics and strength of one-dimensional systems (bars, beams, frames) and selected two-dimensional systems (shields, plates); preparing the student to solve problems involving complex cases of material strength; developing the ability to assess the stability of structural elements (forms of stability loss, critical forces); consolidation...
-
Strategie treningu neuronowego estymatora częstotliwości tonu krtaniowego z użyciem generatora syntetycznych samogłosek
PublikacjaW wielu zastosowaniach telekomunikacyjnych pojawia się problem przetwarzania lub analizy sygnału mowy, w ramach którego, często w obszarze podstawowych algorytmów, stosuje się estymator częstotliwości tonu krtaniowego. Estymator rozpatrywany w tej pracy bazuje na neuronowym klasyfikatorze podejmującym decyzje na podstawie częstotliwości oraz mocy chwilowej wyznaczanych w podpasmach analizowanego sygnału mowy. W pracy rozważamy...
-
Natalia Sokół dr inż.
OsobyBACKGROUND Master of Science in Light and Lighting (2008-2009/11) The UCL Bartlett School of Graduate Studies, Faculty of the Built Environment, London, UK, www.bartlett.ucl.ac.uk MA Degree in Interior Architecture (1999-2004), The Academy of Fine Arts, Poznan, Poland, www.uap.edu.pl MA Degree in Art Education (1997-2002), Academy of Fine Arts, Poznan, Poland, www.uap.edu.pl MAIN RESEARCH AREAS · ...
-
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
PublikacjaIn this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...
-
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
PublikacjaThis paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...
-
Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set
PublikacjaThis work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...
-
Marking the Allophones Boundaries Based on the DTW Algorithm
PublikacjaThe paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighborhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border...
-
Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor
PublikacjaSpatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...