Search results for: SPEECH PARAMETRIZATION

Search results for: SPEECH PARAMETRIZATION

results on page:
embed this view on your website

Displayed results came from alternative search method.

Filters

total: 1999

clear all filters disabled

displaying 1000 best results Help

Objectivization of phonological evaluation of speech elements by means of audio parametrization
Publication
- Year 2018
This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions
Publication
- K. Kąkol
- G. Korvel
- B. Kostek
- Year 2018
The aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...
Paremetrization of sounds for recognizing hazarodus events
Publication
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2010
Nowoczesne systemy monitoringu działają na zasadzie automatycznego wykrywania niebezpiecznych zdarzeń na podstawie analizy obrazu z kamer i dźwięku z mikrofonów. W niniejszej publikacji skupiono się na pierwszym etapie rozpoznawania zdarzeń dźwiękowych, jakim jest parametryzacja dźwięku. Podstawą do skutecznego działania systemu jest znalezienie parametrów, których zmienność najlepiej odzwierciedla cechy charakterystyczne dźwięku...
Parametrization and Correlation Analysis Applied to Music Mood Classification .
Publication
- B. Kostek
- M. Piotrowska
- International Journal of Computational Intelligence Studies - Year 2013
The paper presents a study on music mood categorization. First, a review of music mood models is presented. Then, the preparation of a set of music excerpts to be used in the experiments and music parametrization is described. Next, some listening tasks performed to obtain mood descriptors are introduced. Finally,the correlation between mood descriptors and features extracted from parameters is discussed. The paper concludes with...

Full text to download in external service
Analysis of allophones based on audio signal recordings and parameterization
Publication
- Journal of the Acoustical Society of America - Year 2017
The aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping...

Full text to download in external service
Application of the neural networks for developing new parametrization of the Tersoff potential for carbon
Publication
- A. C. Nwachukwu
- S. Winczewski
- TASK Quarterly - Year 2020
Penta-graphene (PG) is a 2D carbon allotrope composed of a layer of pentagons having sp2- and sp3-bonded carbon atoms. A study carried out in 2018 has shown that the parameterization of the Tersoff potential proposed in 2005 by Ehrhart and Able (T05 potential) performs better than other potentials available for carbon, being able to reproduce structural and mechanical properties of the PG. In this work, we tried to improve the...

Full text available to download
Speech and Drama

Journals

ISSN: 0038-7142
LANGUAGE AND SPEECH

Journals

ISSN: 0023-8309 , eISSN: 1756-6053
On geometry parameterization for simulation-driven design closure of antenna structures
Publication
- S. Kozieł
- A. Pietrenko-Dąbrowska
- Scientific Reports - Year 2021
Full-wave electromagnetic (EM) simulation tools have become ubiquitous in antenna design, especially final tuning of geometry parameters. From the reliability standpoint, the recommended realization of EM-driven design is through rigorous numerical optimization. It is a challenging endeavor with the major issues related to the high computational cost of the process, but also the necessity of handling several objectives and constraints...

Full text available to download
Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger
Publication
- P. Żwan
- A. Czyżewski
- Journal of Digital Forensic Practice - Year 2010
W artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających....

Full text to download in external service
Further developments of parameterization methods of audio stream analysis for secuirty purposes
Publication
- P. Żwan
- A. Czyżewski
- Year 2009
The paper presents an automatic sound recognition algorithm intended for application in an audiovisual security monitoring system. A distributed character of security systems does not allow for simultaneous observation of multiple multimedia streams, thus an automatic recognition algorithm must be introduced. In the paper, a module for the parameterization and automatic detection of audio events is described. The spectral analyses...
Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization
Publication
- B. Kostek
- M. Piotrowska
- T. Ciszewski
- A. Czyżewski
- Year 2017
An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...
Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
Publication
- G. Korvel
- K. Kąkol
- O. Kurasova
- B. Kostek
- IEEE Access - Year 2020
The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech...

Full text available to download
Structures for parameterization, meshing and data exchange of topologically related surfaces of a ship hull
Publication
- A. Kniat
- Year 2010
This paper presents proposal of data structures for storage and processing of a parametric three-dimensional model of a midship hull sections. The model consists of coarse surfaces like: decks, frames, girders, stiffeners, brackets, partitions etc. bounded by topological relations. All workshop details are omitted as the model is intended for numeric calculations. Proposed data structures are prepared to facilitate changes in the...
High Speed Milling vibration surveillance with optimal spindle speed based on optimal speeds map
Publication
- Key Engineering Materials - Year 2014
The paper presents the method of the surveillance of the self-excited chatter vibration. At first, the workpiece modal parameters are estimated based on experimental data which leads to verification of computational model. Then, for selected surface points optimal spindle speeds are calculated. By considering sufficient amount of points it is possible to build a map of optimal spindle speeds. Experimental results show that this...

Full text to download in external service
Emotions in polish speech recordings
Open Research Data
open access
- M. Mięsikowska
- D. Świsulski
The data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...
Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding
Publication
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- Year 2001
Full text to download in external service
COMPUTER SPEECH AND LANGUAGE

Journals

ISSN: 0885-2308 , eISSN: 1095-8363
SEMINARS IN SPEECH AND LANGUAGE

Journals

ISSN: 0734-0478 , eISSN: 1098-9056
Speech and Language Technology

Journals

ISSN: 1895-0434
Speech Language and Hearing

Journals

ISSN: 1361-3286 , eISSN: 2050-5728
Quarterly Journal of Speech

Journals

ISSN: 0033-5630 , eISSN: 1479-5779
SpringerBriefs in Speech Technology

Journals

ISSN: 2191-737X , eISSN: 2191-7388
Audiology and Speech Research

Journals

ISSN: 2635-5019 , eISSN: 2635-5027
Voice and Speech Review

Journals

ISSN: 2326-8263 , eISSN: 2326-8271
Simple gait parameterization and 3D animation for anonymous visual monitoring based on augmented reality
Publication
- P. Szczuko
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2016
The article presents a method for video anonymization and replacing real human silhouettes with virtual 3D figures rendered on a screen. Video stream is processed to detect and to track objects, whereas anonymization stage employs animating avatars accordingly to behavior of detected persons. Location, movement speed, direction, and person height are taken into account during animation and rendering phases. This approach requires...

Full text available to download
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
Publication
- A. Czyżewski
- Journal of the Acoustical Society of America - Year 2023
Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Full text available to download
Language Models in Speech Recognition
Publication
- J. Daciuk
- Year 2022
This chapter describes language models used in speech recognition, It starts by indicating the role and the place of language models in speech recognition. Mesures used to compare language models follow. An overview of n-gram, syntactic, semantic, and neural models is given. It is accompanied by a list of popular software.

Full text to download in external service
Intelligent processing of stuttered speech.
Publication
- A. Czyżewski
- A. Kaczmarek
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Year 2003
W artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się sztucznych sieci neuronowych i zbiorów przybliżonych.
Speech Intelligibility Measurements in Auditorium
Publication
- K. Leo
- ACTA PHYSICA POLONICA A - Year 2010
Speech intelligibility was measured in Auditorium Novum on Technical University of Gdansk (seating capacity 408, volume 3300 m3). Articulation tests were conducted; STI and Early Decay Time EDT coefficients were measured. Negative noise contribution to speech intelligibility was taken into account. Subjective measurements and objective tests reveal high speech intelligibility at most seats in auditorium. Correlation was found between...

Full text available to download
Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition
Publication
- S. Dziadzio
- A. Nabożny
- A. Smywiński-Pohl
- B. Ziółko
- Year 2015
Full text to download in external service
Integration of speech enhancement and coding techniques
Publication
- M. Kuropatwinski
- D. Leckschat
- K. Kroschel
- A. Czyzewski
- M. Kuropatwiński
- Year 1999
Full text to download in external service
Novel approaches to wideband speech coding
Publication
- M. Kulesza
- A. Czyżewski
- Year 2008
Dwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...

Full text to download in external service
Transient detection for speech coding applications
Publication
- International Journal of Computer Science and Network Security - Year 2006
Signal quality in speech codecs may be improved by selecting transients from speech signal and encoding them using a suitable method. This paper presents an algorithm for transient detection in speech signal. This algorithm operates in several frequency bands. Transient detection functions are calculated from energy measured in short frames of the signal. The final selection of transient frames is based on results of detection...

Full text to download in external service
Speech synthesis controlled by eye gazing
Publication
- A. Czyżewski
- K. Łopatka
- B. Kunka
- R. Rybacki
- B. Kostek
- Year 2010
A method of communication based on eye gaze controlling is presented. Investigations of using gaze tracking have been carried out in various context applications. The solution proposed in the paper could be referred to as ''talking by eyes'' providing an innovative approach in the domain of speech synthesis. The application proposed is dedicated to disabled people, especially to persons in a so-called locked-in syndrome who cannot...
Broadband interference in speech reinforcement systems
Publication
- H. Lasota
- R. Mazurek
- Year 2008
Artykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...
Multitask Noisy Speech Enhancement System
Publication
- A. Czyżewski
- J. Kotus
- G. Szwoch
- M. Dziubiński
- A. Rypulak
- A. Pawlik
- Year 2005
W referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...
A system for multitask noisy speech enhancement.
Publication
- A. Czyżewski
- A. Kaczmarek
- J. Kotus
- A. Pawlik
- A. Rypulak
- P. Żwan
- Year 2004
W artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...
Speech Analytics Based on Machine Learning
Publication
- Year 2019
In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Full text to download in external service
Robust continuous-time controller design via structural Youla-Kučera parameterization with application to predictive control.
Publication
- Z. Kowalczuk
- P. Suchomski
- Year 2004
Praca dotyczy projektowania ciągłoczasowego sterowania liniowymi układami SISO o niepewnych parametrach w celu uzyskania nominalnej i odpornej stabilności oraz nominalnej i odpornej jakości regulacji. Specyficzne zastosowanie koncepcji parametryzacji Youli-Kučery (YK) prowadzi do nowego zastosowania obserwatorowych struktur sterowania. Metoda ta została połączona z nominalną strategią uogólnionego sterowania predykcyjnego CGPC,...
Results of tests on speech intelligibility in reverberant conditions
Open Research Data
open access
The dataset contains the results of tests that aimed to provide a relationship between the rate of speech (RoS) and reverberation conditions characterized by the Speech Transmission Index (STI).
International Journal of Speech Technology

Journals

ISSN: 1381-2416 , eISSN: 1572-8110
Journal of Monolingual and Bilingual Speech

Journals

ISSN: 2631-8407 , eISSN: 2631-8415
Comparison of the exponential thermal transient parameterization methods with the SMTP method in the unipedicled DIEP flap computer modelling and simulation
Publication
- M. Moderhak
- QIRT Journal - Year 2018
The aim of this paper is to compare the spatial contrast of the image descriptors obtained via three different thermal transient parameterization methods in Active Dynamic Thermography. The thermal constants and amplitude values of the one- and two- exponential parametrization are compared to the Simplified Magnitude-Temporal Parametrization method (SMTP). The comparison is performed using the data obtained by simulating the cold...

Full text to download in external service
Silence/noise detection for speech and music signals
Publication
- M. Papaj
- Year 2008
This paper introduces a novel off-line algorithm for silence/noise detection in noisy signals. The main concept of the proposed algorithm is to provide noise patterns for further signals processing i.e. noise reduction for speech enhancement. The algorithm is based on frequency domain characteristics of signals. The examples of different types of noisy signals are presented.
New generation speech aid for stuttering people
Publication
- P. Odya
- A. Czyżewski
- Year 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Full text available to download
New generation speech aid for stuttering people
Publication
- P. Odya
- A. Czyżewski
- Archives of Acoustics - Year 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Full text available to download
Multimodal English corpus for automatic speech recognition
Publication
- Year 2013
A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
Transient detection algorithms for speech coding applications
Publication
- G. Szwoch
- M. Kulesza
- A. Czyzewski
- Journal of the Acoustical Society of America - Year 2006
Full text to download in external service
Comprehensive Evaluation of Statistical Speech Waveform Synthesis
Publication
- T. Merritt
- B. Putrycz
- A. Nadolski
- T. Ye
- D. Korzekwa
- W. Dolecki
- T. Drugman
- V. Klimkov
- A. Moinet
- A. Breen... and 3 others
- Year 2018
Full text to download in external service

Search

Filters

Catalog

Search results for: SPEECH PARAMETRIZATION