Search results for: REAL-TIME SPEECH STRETCHING

<title>Decomposition of MATLAB script for FPGA implementation of real time simulation algorithms for LLRF system in European XFEL</title>

Publication

K. Bujnowski
P. Pucyk
K. Pozniak
R. Romaniuk
R. S. Romaniuk

- Year 2007

Full text to download in external service

Towards solving heterogeneous fleet vehicle routing problem with time windows and additional constraints: real use case study

Publication

- Annals of Computer Science and Information Systems - Year 2016

In advanced logistic systems, there is a need for a comprehensive optimization of the transport of goods, which would reduce costs. During past decades, several theoretical and practical approaches to solve vehicle routing problems (VRP) were proposed. The problem of optimal fleet management is often transformed to discrete optimization problem that relies on determining the most economical transport routes for a number of vehicles...

Full text available to download

Investigation of RH effect on uncommon limonene ozonolysis products and SOA formation in indoor air with real time measurement techniques

Publication

- CHEMOSPHERE - Year 2024

Scientific interest in SOA influence on indoor air quality increases since last 20 years. It is well known, that particles of nano-sized diameter pose a threat for human health causing, among others: eye, upper airway irritation, inflammatory response in cells, worsening asthma, hypertension, diabetes, and central nervous dysfunction. Terpenes are reactive VOCs, commonly emitted in indoor air and considered to be SOA precursors...

Full text available to download

Direct use of point clouds in real-time interaction with the cultural heritage in pandemic and post-pandemic tourism on the case of Kłodzko Fortress

Publication

J. Franczuk
K. Boguszewska
S. Parrinello
A. Dell'Amico
F. Galasso
P. Gleń

- Digital Applications in Archaeology and Cultural Heritage - Year 2022

Full text to download in external service

Expression of Selected Connexin and Aquaporin Genes and Real-Time Proliferation of Porcine Endometrial Luminal Epithelial Cells in Primary Culture Model

Publication

K. Wojtanowicz-Markiewicz
M. Kulus
S. Knap
I. Kocherova
M. Jankowski
K. Stefańska
M. Jeseta
H. Piotrowska-Kempisty
D. Bukowska
M. Zabel... and 4 others

- Biomed Research International - Year 2020

Full text to download in external service

A label-free graphene-based impedimetric biosensor for real-time tracing of the cytokine storm in blood serum; suitable for screening COVID-19 patients

Publication

M. Khayamian
M. Parizi
M. Ghaderinia
H. Abadijoo
S. Vanaei
H. Simaee
S. Abdolhosseini
S. Shalileh
M. Faramarzpour
V. Naeini... and 5 others

- RSC Advances - Year 2021

Full text to download in external service

Influence of Estradiol-17beta on Progesterone and Estrogen Receptor mRNA Expression in Porcine Follicular Granulosa Cells during Short-Term, In Vitro Real-Time Cell Proliferation

Publication

S. Ciesiółka
J. Budna
K. Jopek
A. Bryja
W. Kranc
A. Chachuła
S. Borys
M. Dyszkiewicz
A. Ziółkowska
P. Antosik... and 6 others

- Biomed Research International - Year 2016

Full text to download in external service

Short-term Cultivation of Porcine Cumulus Cells Influences the Cyclin-dependent Kinase 4 (Cdk4) and Connexin 43 (Cx43) Protein Expression—A Real-time Cell Proliferation Approach

Publication

B. KEMPISTY
A. ZIÓŁKOWSKA
H. PIOTROWSKA
S. CIESIÓŁKA
P. ANTOSIK
D. BUKOWSKA
P. ZAWIERUCHA
M. WOŹNA
J. JAŚKOWSKI
K. BRÜSSOW... and 2 others

- JOURNAL OF REPRODUCTION AND DEVELOPMENT - Year 2013

Full text to download in external service

Novel analytical approach for real-time monitoring of volatile Maillard reaction products emitted from the sugar-amino acid model system using proton transfer reaction mass spectrometry

Publication

- Year 2021

In the presented research, volatile Maillard reaction products formation in the two sugar-amino acid model systems, namely glucoselysine and ribose-lysine model systems were investigated using proton transfer reaction mass spectrometry. Obtained data were supported by the reference method, i.e., UV/Vis spectrometry. A number of volatile organic compounds were selected based on the correlation of the effect of Maillard reaction...

Full text to download in external service

Ontology groups representing angiogenesis and blood vessels development are highly up-regulated during porcine oviductal epithelial cells long-term real-time proliferation – a primary cell culture approach

Publication

M. Nawrocki
P. Celichowski
M. Jankowski
W. Kranc
A. Bryja
S. Borys-Wójcik
M. Jeseta
P. Antosik
D. Bukowska
M. Bruska... and 3 others

- Medical Journal of Cell Biology - Year 2018

Full text to download in external service

Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning

Publication

K. Kąkol

- Year 2023

The Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...

Full text available to download

Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit

Publication

- Diagnostic Pathology - Year 2012

Methods developed for real-time time scale modification (TSM) of speech signal are presented. They are based onthe non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of theproposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearingimpaired children and elderly listeners. It was shown that for the speech with average rate equal to or...

Full text available to download

Detecting Lombard Speech Using Deep Learning Approach

Publication

K. Kąkol
G. Korvel
G. Tamulevicius
B. Kostek

- SENSORS - Year 2023

Robust Lombard speech-in-noise detecting is challenging. This study proposes a strategy to detect Lombard speech using a machine learning approach for applications such as public address systems that work in near real time. The paper starts with the background concerning the Lombard effect. Then, assumptions of the work performed for Lombard speech detection are outlined. The framework proposed combines convolutional neural networks...

Full text available to download

Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions

Publication

- SENSORS - Year 2021

The paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...

Full text available to download

Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training

Publication

P. Rościszewski

- Procedia Computer Science - Year 2017

In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

Full text available to download

Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition

Publication

- Year 2016

The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...

Chirp Rate and Instantaneous Frequency Estimation: Application to Recursive Vertical Synchrosqueezing

Publication

D. Fourer
F. Auger
K. Czarnecki
S. Meignen
P. Flandrin

- IEEE SIGNAL PROCESSING LETTERS - Year 2017

This letter introduces new chirp rate and instantaneous frequency estimators designed for frequency-modulated signals. These estimators are first investigated from a deterministic point of view, then compared together in terms of statistical efficiency. They are also used to design new recursive versions of the vertically synchrosqueezed short-time Fourier transform, using a previously published method (D. Fourer, F. Auger, and...

Full text available to download

Instantaneous complex frequency for pipeline pitch estimation

Publication

M. [. Kaniewska

- Year 2010

In the paper a pipeline algorithm for estimating the pitch of speech signal is proposed. The algorithm uses instantaneous complex frequencies estimated for four waveforms obtained by filtering the original speech signal through four bandpass complex Hilbert filters. The imaginary parts of ICFs from each channel give four candidates for pitch estimates. The decision regarding the final estimate is made based on the real parts of...

Investigating Feature Spaces for Isolated Word Recognition

Publication

G. Korvel
G. Tamulevicus
P. Treigys
J. Bernataviciene
B. Kostek

- Year 2018

Much attention is given by researchers to the speech processing task in automatic speech recognition (ASR) over the past decades. The study addresses the issue related to the investigation of the appropriateness of a two-dimensional representation of speech feature spaces for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and timefrequency signal representation...

Virtual keyboard controlled by eye gaze employing speech synthesis

Publication

- Year 2010

The article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...

Virtual Keyboard controlled by eye gaze employing speech synthesis

Publication

- Elektronika : konstrukcje, technologie, zastosowania - Year 2011

The article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...

Full text to download in external service

Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System

Publication

P. Falkowski-Gilski
G. Debita
M. Habrych
B. Miedziński
P. Jedlikowski
B. Polnik
J. Wandzio
X. Wang

- Year 2020

The broadband over power line – power line communication (BPL-PLC) cable is resistant to electricity stoppage and partial damage of phase conductors. It maintains continuity of transmission in case of an emergency. These features make it an ideal solution for delivering data, e.g. in an underground mine environment, especially clear and easily understandable voice messages. This paper describes a subjective quality evaluation of...

Full text to download in external service

Secured wired BPL voice transmission system

Publication

G. Debita
P. Falkowski-Gilski
M. Habrych
B. Miedziński
J. Wandzio
P. Jedlikowski

- Scientific Journal of the Military University of Land Forces - Year 2020

Designing a secured voice transmission system is not a trivial task. Wired media, thanks to their reliability and resistance to mechanical damage, seem an ideal solution. The BPL (Broadband over Power Line) cable is resistant to electricity stoppage and partial damage of phase conductors, ensuring continuity of transmission in case of an emergency. It seems an appropriate tool for delivering critical data, mostly clear and understandable...

Full text available to download

Speech Intelligibility Measurements in Auditorium

Publication

K. Leo

- ACTA PHYSICA POLONICA A - Year 2010

Speech intelligibility was measured in Auditorium Novum on Technical University of Gdansk (seating capacity 408, volume 3300 m3). Articulation tests were conducted; STI and Early Decay Time EDT coefficients were measured. Negative noise contribution to speech intelligibility was taken into account. Subjective measurements and objective tests reveal high speech intelligibility at most seats in auditorium. Correlation was found between...

Full text available to download

Marking the Allophones Boundaries Based on the DTW Algorithm

Publication

J. Rafałko

- Year 2018

The paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighborhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border...

An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics

Publication

G. Korvel
O. Kurasova
B. Kostek

- Year 2019

The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...

Full text available to download

Investigating Feature Spaces for Isolated Word Recognition

Publication

P. Treigys
G. Korvel
G. Tamulevicius
J. Bernataviciene
B. Kostek

- Year 2020

The study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...

Full text to download in external service

Detection and localization of selected acoustic events in 3D acoustic field for smart surveillance applications

Publication

- Communications in Computer and Information Science - Year 2011

A method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The events are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...

Full text to download in external service

Detection and localization of selected acoustic events in acoustic field for smart surveillance applications

Publication

- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2014

A method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The evens are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...

Full text available to download

Auditory-visual attention stimulator

Publication

- Year 2013

New approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...

Full text to download in external service

Subjective and Objective Comparative Study of DAB+ Broadcast System

Publication

- Archives of Acoustics - Year 2017

Broadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...

Full text available to download

An audio-visual corpus for multimodal automatic speech recognition

Publication

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Year 2017

review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Full text available to download

Noise profiling for speech enhancement employing machine learning models

Publication

K. Kąkol
G. Korvel
B. Kostek

- Journal of the Acoustical Society of America - Year 2022

This paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...

Full text available to download

Multimodal English corpus for automatic speech recognition

Publication

- Year 2013

A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...

Comparative analysis of various transformation techniques for voiceless consonants modeling

Publication

G. Korvel
B. Kostek
O. Kurasova

- International Journal of Computers Communications & Control - Year 2018

In this paper, a comparison of various transformation techniques, namely Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT) and Discrete Walsh Hadamard Transform (DWHT) are performed in the context of their application to voiceless consonant modeling. Speech features based on these transformation techniques are extracted. These features are mean and derivative values of cepstrum coefficients, derived from each transformation....

Full text available to download

Distortion of speech signals in the listening area: its mechanism and measurements

Publication

- Year 2014

The paper deals with a problem of the influence of the number and distribution of loudspeakers in speech reinforcement systems on the quality of publicly addressed voice messages, namely on speech intelligibility in the listening area. Linear superposition of time-shifted broadband waves of a same form and slightly different magnitudes that reach a listener from numerous coherent sources, is accompanied by interference effects...

Full text to download in external service

Speech Analytics Based on Machine Learning

Publication

- Year 2019

In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Full text to download in external service

Performance Analysis of the OpenCL Environment on Mobile Platforms

Publication

- Year 2022

Today’s smartphones have more and more features that so far were only assigned to personal computers. Every year these devices are composed of better and more efficient components. Everything indicates that modern smartphones are replacing ordinary computers in various activities. High computing power is required for tasks such as image processing, speech recognition and object detection. This paper analyses the performance of...

Full text to download in external service

Modeling and Designing Acoustical Conditions of the Interior – Case Study

Publication

- Archives of Acoustics - Year 2016

The primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...

Full text available to download

Ultrawideband transmission in physical channels: a broadband interference view

Publication

- HYDROACOUSTICS - Year 2014

The superposition of multipath components (MPC) of an emitted wave, formed by reflections from limiting surfaces and obstacles in the propagation area, strongly affects communication signals. In the case of modern wideband systems, the effect should be seen as a broadband counterpart of classical interference which is the cause of fading in narrowband systems. This paper shows that in wideband communications, the time- and frequency-domain...

Full text available to download

Threshold photoelectron studies of isoxazole over the energy range 9.9-30 eV

Publication

M. Dampc
B. Mielewska
M. Siggel-King
G. C. King
B. Sivaraman
S. Ptasińska
N. J. Mason
M. Zubek

- CHEMICAL PHYSICS - Year 2010

The threshold photoelectron spectrum of the isoxazole molecule, C3H3NO has been measured over the photon energy range 9.9-30 eV with the use of synchrotron radiation. In the 9.9-10.8 eV range, corresponding to photoionization from the highest occupied molecular orbital 3a"(π3), seven well resolved vibrational series have been observed and their modes are tentatively assigned. A strong adiabatic ionization, with an energy of 11.132...

Full text to download in external service

Automated Classifier Development Process for Recognizing Book Pages from Video Frames

Publication

- Communications in Computer and Information Science - Year 2020

One of the latest developments made by publishing companies is introducing mixed and augmented reality to their printed media (e.g. to produce augmented books). An important computer vision problem that they are facing is classification of book pages from video frames. The problem is non-trivial, especially considering that typical training data is limited to only one digital original per book page, while the trained classifier...

Full text to download in external service

Flock behavior and control

Publication

K. Radziszewski
A. Krężlik

- Year 2016

In this paper we present the results of the Flock Behaviour and Control workshop cluster during “Shapes of Logic Conference 2015”. During the event, students got familiar with the techniques of both visual and sound real-time data processing. The second topic presented for students was behaviourbased approach of design process, mainly based on the mathematical rules set up by Craig Raynolds on the swarm behaviour. The aim of the...

A Novel Approach to the Assessment of Cough Incidence

Publication

- Year 2013

In this paper we consider the problem of identication of cough events in patients suffering from chronic respiratory diseases. The information about frequency of cough events is necessary to medical treatment. The proposed approach is based on bidirectional processing of a measured vibration signal - cough events are localized by combining the results of forward-time and backward-time analysis. The signal is at rst transformed...

Full text to download in external service

Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking

Publication

- Year 2011

Echo cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...

Full text to download in external service

Determining the noise impact on hearing using psychoacoustical noise dosimeter

Publication

- Archives of Acoustics - Year 2007

This research study presents the designed noise dosimeter based on psychoacoustical properties of the human hearing system and, at the same time. evaluation of time and frequency characteristics of noise. The designed noise dosimeter enables assessing temporary threshold shift (TTS) in critical hands in real time. In this way it is possible monitoring the hearing threshold shift continuously for people who stay in the harmful noise...

Full text available to download

Hardware-Software Implementation of Basic Principles Simulator of Nuclear Reactor Processes

Publication

- Acta Energetica - Year 2016

The paper presents implementation process of basic principle simulators of a nuclear reactor processes. Simulators are based on point-models of processes: kinetics of neutrons, heat generation and exchange, poisoning and burning-up nuclear fuel. Reference simulator was developed in MATLAB/Simulink without taking into account real-time operation. Second simulator was built using the toolbox xPC with hard real-time requirements....

Full text available to download

Neural network based algorithm for hand gesture detection in a low-cost microprocessor applications

Publication

- Year 2020

In this paper the simple architecture of neural network for hand gesture classification was presented. The network classifies the previously calculated parameters of EMG signals. The main goal of this project was to develop simple solution that is not computationally complex and can be implemented on microprocessors in low-cost 3D printed prosthetic arms. As the part of conducted research the data set EMG signals corresponding...

Full text to download in external service

Examining Feature Vector for Phoneme Recognition / Analiza parametrów w kontekście automatycznej klasyfikacji fonemów

Publication

- Year 2017

The aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...

Wulkanizaty kauczuku naturalnego otrzymane z użyciem plastyfikatorów pochodzenia roślinnego

Publication

- Elastomery - Year 2019

Celem niniejszej pracy było zbadanie wpływu epoksydowanych olejów roślinnych na wybrane właściwości wul-kanizatów kauczuku naturalnego. Jako plastyfikatorów użyto epoksydowanego oleju sojowego oraz epoksydo-wanego oleju palmowego. Wpływ olejów naturalnych porównano także z wulkanizatami przygotowanymi bez użycia zmiękczaczy, jak i zawierającymi olej maszynowy, pochodzący z przerobu ropy naftowej. Zbadano wpływ rodzaju...

Full text available to download

Search

Filters

Catalog

Category

Year

Options

Search results for: REAL-TIME SPEECH STRETCHING