Search results for: SPEECH STRETCHING

Search results for: SPEECH STRETCHING

results on page:
embed this view on your website

Displayed results came from alternative search method.

Filters

total: 1948

clear all filters disabled

displaying 1000 best results Help

Improved method for real-time speech stretching
Publication
- A. Kupryjanow
- A. Czyżewski
- Year 2012
n algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...

Full text to download in external service
A Method of Real-Time Non-uniform Speech Stretching
Publication
- A. Kupryjanow
- A. Czyżewski
- Year 2012
Developed method of real-time non-uniform speech stretching is presented.The proposed solution is based on the well-known SOLA algorithm(Synchronous Overlap and Add). Non-uniform time-scale modification isachieved by the adjustment of time scaling factor values in accordance with thesignal content. Dependently on the speech unit (vowels/consonants), instantaneousrate of speech (ROS), and speech signal presence, values of the scalingfactor...

Full text to download in external service
A non-uniform real-time speech time-scale stretching method
Publication
- A. Kupryjanow
- A. Czyżewski
- Year 2011
An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...
Real-time speech streching for supporting hearing impaired schoolchildren
Publication
- A. Kupryjanow
- A. Czyżewski
- Elektronika : konstrukcje, technologie, zastosowania - Year 2010
A study of time scale modification algorithms applied to support hearing impaired schoolchildren is presented. Variety of algorithms are considered, namely: overlap-and add, two variations of synchronous overlapand- add, and the phase vocoder. Their effectiveness as well as real-time processing capabilities are examined.

Full text to download in external service
Test Stand for Multi-option Stretching of Soft Tissues
Publication
- G. Rotta
- S. Grymek
- Year 2020
The paper presents the genesis of the design and possibilities of the test stand for stretching soft tissues as well as examples of tests carried out on this device.

Full text to download in external service
Test Stand for Multi-option Stretching of Soft Tissues
Publication
- G. Rotta
- S. Grymek
- Year 2020
The paper presents the genesis of the design and possibilities of the test stand for stretching soft tissues as well as examples of tests carried out on this device.

Full text to download in external service
Unusual Influence of Fluorinated Anions on the Stretching Vibrations of Liquid Water
Publication
- M. Śmiechowski
- JOURNAL OF PHYSICAL CHEMISTRY B - Year 2018
Infrared (IR) spectroscopy is a commonly used and invaluable tool in the studies of solvation phenomena in aqueous solutions. Concurrently, ab initio molecular dynamics (AIMD) simulations deliver the solvation shell picture at a molecular detail level and allow for a consistent decomposition of the theoretical IR spectrum into underlying spatial correlations. Here, we demonstrate how the novel spectral decomposition techniques...

Full text available to download
Speech and Drama

Journals

ISSN: 0038-7142
LANGUAGE AND SPEECH

Journals

ISSN: 0023-8309 , eISSN: 1756-6053
Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
Publication
- G. Korvel
- K. Kąkol
- O. Kurasova
- B. Kostek
- IEEE Access - Year 2020
The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech...

Full text available to download
High Speed Milling vibration surveillance with optimal spindle speed based on optimal speeds map
Publication
- Key Engineering Materials - Year 2014
The paper presents the method of the surveillance of the self-excited chatter vibration. At first, the workpiece modal parameters are estimated based on experimental data which leads to verification of computational model. Then, for selected surface points optimal spindle speeds are calculated. By considering sufficient amount of points it is possible to build a map of optimal spindle speeds. Experimental results show that this...

Full text to download in external service
Emotions in polish speech recordings
Open Research Data
open access
- M. Mięsikowska
- D. Świsulski
The data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...
Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding
Publication
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- Year 2001
Full text to download in external service
Impact and stretching standardized tests as useful tools for assessment of viscoelastic behavior for highly rubberized asphalt binder
Publication
- X. Yu
- W. Yang
- L. Zhang
- K. Formela
- S. Wang
- CONSTRUCTION AND BUILDING MATERIALS - Year 2022
Asphalt binder is generally identified as a brittle material at low service temperature or under high-speed load, and the brittleness becomes serious after weathering aging. Improving the toughness of asphalt binder through adding high-content of crumb tire rubber is an efficient method to solve this problem. Devulcanized rubber modified asphalt binder (DRMA) with different contents (15–40%) of devulcanized rubber (DR) were prepared...

Full text to download in external service
COMPUTER SPEECH AND LANGUAGE

Journals

ISSN: 0885-2308 , eISSN: 1095-8363
SEMINARS IN SPEECH AND LANGUAGE

Journals

ISSN: 0734-0478 , eISSN: 1098-9056
Speech and Language Technology

Journals

ISSN: 1895-0434
Speech Language and Hearing

Journals

ISSN: 1361-3286 , eISSN: 2050-5728
Quarterly Journal of Speech

Journals

ISSN: 0033-5630 , eISSN: 1479-5779
SpringerBriefs in Speech Technology

Journals

ISSN: 2191-737X , eISSN: 2191-7388
Audiology and Speech Research

Journals

ISSN: 2635-5019 , eISSN: 2635-5027
Voice and Speech Review

Journals

ISSN: 2326-8263 , eISSN: 2326-8271
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
Publication
- A. Czyżewski
- Journal of the Acoustical Society of America - Year 2023
Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Full text available to download
Speech Intelligibility Measurements in Auditorium
Publication
- K. Leo
- ACTA PHYSICA POLONICA A - Year 2010
Speech intelligibility was measured in Auditorium Novum on Technical University of Gdansk (seating capacity 408, volume 3300 m3). Articulation tests were conducted; STI and Early Decay Time EDT coefficients were measured. Negative noise contribution to speech intelligibility was taken into account. Subjective measurements and objective tests reveal high speech intelligibility at most seats in auditorium. Correlation was found between...

Full text available to download
Intelligent processing of stuttered speech.
Publication
- A. Czyżewski
- A. Kaczmarek
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Year 2003
W artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się sztucznych sieci neuronowych i zbiorów przybliżonych.
Language Models in Speech Recognition
Publication
- J. Daciuk
- Year 2022
This chapter describes language models used in speech recognition, It starts by indicating the role and the place of language models in speech recognition. Mesures used to compare language models follow. An overview of n-gram, syntactic, semantic, and neural models is given. It is accompanied by a list of popular software.

Full text to download in external service
Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition
Publication
- S. Dziadzio
- A. Nabożny
- A. Smywiński-Pohl
- B. Ziółko
- Year 2015
Full text to download in external service
Novel approaches to wideband speech coding
Publication
- M. Kulesza
- A. Czyżewski
- Year 2008
Dwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...

Full text to download in external service
Transient detection for speech coding applications
Publication
- International Journal of Computer Science and Network Security - Year 2006
Signal quality in speech codecs may be improved by selecting transients from speech signal and encoding them using a suitable method. This paper presents an algorithm for transient detection in speech signal. This algorithm operates in several frequency bands. Transient detection functions are calculated from energy measured in short frames of the signal. The final selection of transient frames is based on results of detection...

Full text to download in external service
Broadband interference in speech reinforcement systems
Publication
- H. Lasota
- R. Mazurek
- Year 2008
Artykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...
A system for multitask noisy speech enhancement.
Publication
- A. Czyżewski
- A. Kaczmarek
- J. Kotus
- A. Pawlik
- A. Rypulak
- P. Żwan
- Year 2004
W artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...
Multitask Noisy Speech Enhancement System
Publication
- A. Czyżewski
- J. Kotus
- G. Szwoch
- M. Dziubiński
- A. Rypulak
- A. Pawlik
- Year 2005
W referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...
Integration of speech enhancement and coding techniques
Publication
- M. Kuropatwinski
- D. Leckschat
- K. Kroschel
- A. Czyzewski
- M. Kuropatwiński
- Year 1999
Full text to download in external service
Speech synthesis controlled by eye gazing
Publication
- A. Czyżewski
- K. Łopatka
- B. Kunka
- R. Rybacki
- B. Kostek
- Year 2010
A method of communication based on eye gaze controlling is presented. Investigations of using gaze tracking have been carried out in various context applications. The solution proposed in the paper could be referred to as ''talking by eyes'' providing an innovative approach in the domain of speech synthesis. The application proposed is dedicated to disabled people, especially to persons in a so-called locked-in syndrome who cannot...
Speech Analytics Based on Machine Learning
Publication
- Year 2019
In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Full text to download in external service
Results of tests on speech intelligibility in reverberant conditions
Open Research Data
open access
The dataset contains the results of tests that aimed to provide a relationship between the rate of speech (RoS) and reverberation conditions characterized by the Speech Transmission Index (STI).
International Journal of Speech Technology

Journals

ISSN: 1381-2416 , eISSN: 1572-8110
Journal of Monolingual and Bilingual Speech

Journals

ISSN: 2631-8407 , eISSN: 2631-8415
Real-time speech-rate modification experiments
Publication
- A. Kupryjanow
- A. Czyżewski
- Year 2010
An algorithm designed for real-time speech time scale modification (stretching) is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness of signal processing algorithms are examined experimentally together...

Full text to download in external service
Silence/noise detection for speech and music signals
Publication
- M. Papaj
- Year 2008
This paper introduces a novel off-line algorithm for silence/noise detection in noisy signals. The main concept of the proposed algorithm is to provide noise patterns for further signals processing i.e. noise reduction for speech enhancement. The algorithm is based on frequency domain characteristics of signals. The examples of different types of noisy signals are presented.
Transient detection algorithms for speech coding applications
Publication
- G. Szwoch
- M. Kulesza
- A. Czyzewski
- Journal of the Acoustical Society of America - Year 2006
Full text to download in external service
New generation speech aid for stuttering people
Publication
- P. Odya
- A. Czyżewski
- Year 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Full text available to download
New generation speech aid for stuttering people
Publication
- P. Odya
- A. Czyżewski
- Archives of Acoustics - Year 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Full text available to download
Comprehensive Evaluation of Statistical Speech Waveform Synthesis
Publication
- T. Merritt
- B. Putrycz
- A. Nadolski
- T. Ye
- D. Korzekwa
- W. Dolecki
- T. Drugman
- V. Klimkov
- A. Moinet
- A. Breen... and 3 others
- Year 2018
Full text to download in external service
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Full text to download in external service
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Full text to download in external service
Speech recognition system for hearing impaired people.
Publication
- P. Dalka
- A. Czyżewski
- Year 2005
Praca przedstawia wyniki badań z zakresu rozpoznawania mowy. Tworzony system wykorzystujący dane wizualne i akustyczne będzie ułatwiał trening poprawnego mówienia dla osób po operacji transplantacji ślimaka i innych osób wykazujących poważne uszkodzenia słuchu. Active Shape models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na...
Applying the Lombard Effect to Speech-in-Noise Communication
Publication
- G. Korvel
- K. Kąkol
- P. Treigys
- B. Kostek
- Electronics - Year 2023
This study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...

Full text available to download
Multimodal English corpus for automatic speech recognition
Publication
- Year 2013
A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
Influence of modulation detection threshold on speech intelligibility
Publication
- K. Leo
- ACTA PHYSICA POLONICA A - Year 2011
Full text available to download

Search

Filters

Catalog

Search results for: SPEECH STRETCHING