Wyniki wyszukiwania dla: TWO-DIMENSIONAL REPRESENTATION OF SPEECH SIGNAL

Wyniki wyszukiwania dla: TWO-DIMENSIONAL REPRESENTATION OF SPEECH SIGNAL

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 892

wyczyść wszystkie filtry niedostępne

Assessment of Left Atrial Function in Patients with Paroxysmal, Persistent, and Permanent Atrial Fibrillation Using Two-Dimensional Strain.
Publikacja
- L. Aleksandra
- K. Magdalena
- D. Leszek
- K. Klaudia
- S. Monika
- P. Prof.
- O. Prof.
- L. Drabik
- Journal of Atrial Fibrillation - Rok 2019
Pełny tekst do pobrania w serwisie zewnętrznym
Detection of DNA Replication Intermediates after Two-Dimensional Agarose Gel Electrophoresis Using a Fluorescein-Labeled Probe
Publikacja
- S. Śrutkowska
- R. Caspi
- M. Gabig
- G. Węgrzyn
- ANALYTICAL BIOCHEMISTRY - Rok 1999
Pełny tekst do pobrania w serwisie zewnętrznym
Enantioselective comprehensive two-dimensional gas chromatography. A route to elucidate the authenticity and origin ofRosa damascena Milleressential oils
Publikacja
- J. Krupčík
- R. Gorovenko
- I. Špánik
- P. Sandra
- D. Armstrong
- M. Kaykhaii
- Journal of Separation Science - Rok 2015
Pełny tekst do pobrania w serwisie zewnętrznym
Solution conformational study of Scyliorhinin I analogues with conformational constraints by two-dimensional NMR and theoretical conformational analysis
Publikacja
- S. Rodziewicz-Motowidło
- A. Łegowska
- X. Qi
- C. Czaplewski
- A. Liwo
- K. Rolka
- P. Sowiński
- W. Mozga
- J. Olczak
- J. Zabrocki
- C. Czaplewski
- JOURNAL OF PEPTIDE RESEARCH - Rok 2000
Pełny tekst do pobrania w serwisie zewnętrznym
Method of reconstructing two-dimensional velocity fields on the basis of temperature field values measured with a thermal imaging camera
Publikacja
- INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER - Rok 2022
This paper describes a novel numerical reconstruction procedure (NRP) of the velocity field during natural convective heat transfer from a two-sided, isothermal, heated vertical plate based only on the known temperature field obtained, e.g. with a thermal imaging camera. It has been demonstrated that with a knowledge of temperature distributions, the NRP enables the reconstruction of velocity fields by solving the Navier-Stokes...

Pełny tekst do pobrania w portalu
Diagnostic Test Accuracy of Artificial Intelligence in Detecting Periapical Periodontitis on Two-Dimensional Radiographs: A Retrospective Study and Literature Review
Publikacja
- J. Issa
- M. Jaber
- I. Rifai
- P. Mozdziak
- B. Kempisty
- M. Dyszkiewicz-Konwińska
- Medicina - Rok 2023
Pełny tekst do pobrania w serwisie zewnętrznym
Cascading transitions toward unconventional charge density wave states in the quasi-two-dimensional monophosphate tungsten bronze P4W16O56
Publikacja
- E. Duverger-Nédellec
- A. Pautrat
- K. Kolincio
- L. Hervé
- O. Pérez
- IUCrJ - Rok 2020
Single crystals of the m = 8 member of the low-dimensional monophosphate tungsten bronzes (PO2)4(WO3)2m family were grown by chemical vapour transport technique and the high crystalline quality obtained allowed a reinvestigation of the physical and structural properties. Resistivity measurements revealed three anomalies at TC1 = 258 K, TC2 = 245 K and TC3 = 140 K, never observed until now. Parallel X-ray diffraction investigations...

Pełny tekst do pobrania w portalu
Examples of numerical simulations of two-dimensional unsaturated flow with VS2DI code using different interblock conductivity averaging schemes
Publikacja
- GEOLOGOS - Rok 2015
Flow in unsaturated porous media is commonly described by the Richards equation. This equation is strongly nonlinear due to interrelationships between water pressure head (negative in unsaturated conditions), water content and hydraulic conductivity. The accuracy of numerical solution of the Richards equation often depends on the method used to estimate average hydraulic conductivity between neighboring nodes or cells of the numerical...

Pełny tekst do pobrania w portalu
Qualitative characteristics and comparison of volatile fraction of vodkas made from different botanical materials by comprehensive two-dimensional gas chromatography and the electronic nose based on the technology of ultra-fast gas chromatography
Publikacja
- JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE - Rok 2017
BACKGROUND Vodka is a spirit-based beverage made from ethyl alcohol of agricultural origin. At present, increasingly more vodka brands have labels that specify the botanical origin of the product. Until now, the techniques for distinguishing between vodkas of different botanical origin have been costly, time-consuming and insufficient for making a distinction between vodka produced from similar raw materials. Therefore, it is...

Pełny tekst do pobrania w serwisie zewnętrznym
Quantification of DNA Modifications Using Two-Dimensional Ultraperformance Liquid Chromatography Tandem Mass Spectrometry (2D-UPLC-MS/MS)
Publikacja
- M. Starczak
- M. Gawronski
- R. Olinski
- D. Gackowski
- Rok 2021
Pełny tekst do pobrania w serwisie zewnętrznym
Characteristics of Cucumis metuliferus, Actinidia deliciosa and Musa paradisica fragrance profiles using a comprehensive two-dimensional gas chromatography with time-of-flight mass spectrometric detection (GC×GC-TOF MS)
Publikacja
- Rok 2017
Comprehensive two-dimensional gas chromatography with time-of-flight mass spectrometric detection (GC×GC-TOF-MS) is a modern analytical technique used in many fields. This technique enables an effective separation of volatile chemical compounds [1]. In recent years, the topic of healthy food and healthy living has become very popular. For this reason, more and more scientific publications related to the analysis of food products...

Pełny tekst do pobrania w serwisie zewnętrznym
One-Step Synergistic Effect to Produce Two-Dimensional N-Doped Hierarchical Porous Carbon Nanosheets for High-Performance Flexible Supercapacitors
Publikacja
- X. Liu
- Y. Wen
- X. Chen
- A. Dymerska
- R. Wróbel
- J. Zhu
- X. Wen
- Z. Liu
- E. Mijowska
- ACS Applied Energy Materials - Rok 2020
Pełny tekst do pobrania w serwisie zewnętrznym
Studies on origin of Polish honeys by two-dimensional gas chromatography Ocena pochodzenia surowcowego polskich miodów przy użyciu dwuwymiarowej chromatografii gazowej
Publikacja
- Przemysł Chemiczny - Rok 2014
Polish acacia, linden, rapessed and buckwheat-derived honeys and a honydew were analysed for presence of hydrocarbons, alcs., ketones and esters by 2-dimensional gas chromatog. to establish the markers for the honey origin. PrOH was found characteristic for acacia honey, Me(CH2)11OH for linden honey, Me(CH2)6CHOMe for honeydew and Me(CH2)8COOEt for the rapeseed honey. Wykorzystano technikę dwuwymiarowej chromatografii gazowej sprzężonej...
Improving signal quality in speech codec using hybrid perceptual-parametric algorithm. [Poprawa jakości sygnału w kodekach mowy przy użyciu hybrydowego, parametryczno-perceptualnego algorytmu kodowania]
Publikacja
- Rok 2006
Przedstawiono hybrydową, parametryczno-perceptualną architekturę kodeka. Podstawowa struktura kodeka parametrycznego CELP została wzbogacona o kodowanie perceptualne. Celem hybrydyzacji kodeka jest uzyskanie znaczącej poprawy subiektywnej jakości zdekodowanego sygnału. Zaproponowano dwie hybrydowe struktury. Pierwsza polega na perceptualnym kodowaniu dźwięcznych elementów sygnału rezydualnego kodeka CELP. Druga metoda dzieli sygnał...
Badanie rozkładów parametrów sygnału mowy w zastosowaniach do prognozowania prawdopodobieństwa popełnienia błędów w systemach identyfikacji mówców = Examining distribution of speech signal parameters for the prognosis of error probability in speaker verification systems
Publikacja
- A. Kaczmarek
- Rok 2010
Przedmiotem pracy jest system identyfikacji mówców w sposób zależny od tekstu ("text dependent''). Dokonano analizy wielu różnych wypowiedzi kilkudziesięciu mówców. Zastosowana metoda parametryzacji to metoda oparta na wynikach analizy cepstralnej sygnału mowy. Zdefiniowane zostały nowe parametry skojarzone z elementarnymi zdarzeniami w procesie weryfikacji mówców. Na tej podstawie dokonano estymacji funkcji gęstości prawdopodobieństwa...
Selected results of signal measurements in a ship power station with two generators working in parallel with the use of the Estimator/Analyzer instrument
Dane Badawcze
open access
- R. Maśnicki
- D. Hallmann
- J. Mindykowski
- T. Tarasiuk
- M. Górniak
- M. Szweda
- B. Pałczyńska
The presented dataset is part of research focusing on the assessment of metrological properties of the instrument, Estimator/ Analyser (A/E v.2), developed and made at the Faculty of Electrical Engineering, Department of Marine Electrical Power Engineering, of Gdynia Maritime University. The instrument performs a set of measurement functions that...
Accurate, Direct, and High-Throughput Analyses of a Broad Spectrum of Endogenously Generated DNA Base Modifications with Isotope-Dilution Two-Dimensional Ultraperformance Liquid Chromatography with Tandem Mass Spectrometry: Possible Clinical Implication
Publikacja
- D. Gackowski
- M. Starczak
- E. Zarakowska
- M. Modrzejewska
- A. Szpila
- Z. Banaszkiewicz
- R. Olinski
- ANALYTICAL CHEMISTRY - Rok 2016
Pełny tekst do pobrania w serwisie zewnętrznym
IEEE International Conference on Acoustics, Speech and Signal Processing

Konferencje
Improving the quality of speech in the conditions of noise and interference
Publikacja
- B. Kostek
- K. Kąkol
- Journal of the Acoustical Society of America - Rok 2018
The aim of the work is to present a method of intelligent modification of the speech signal with speech features expressed in noise, based on the Lombard effect. The recordings utilized sets of words and sentences as well as disturbing signals, i.e., pink noise and the so-called babble speech. Noise signal, calibrated to various levels at the speaker's ears, was played over two loudspeakers located 2 m away from the speaker. In...

Pełny tekst do pobrania w serwisie zewnętrznym
Metoda i algorytmy modyfikacji sygnału do celu wspomagania rozumienia mowy przez osoby z pogorszoną rozdzielczością czasową słuchu
Publikacja
- A. Kupryjanow
- Rok 2013
Przedmiotem badań przeprowadzonych w ramach rozprawy są metody modyfikacji czasu trwania sygnału (ang. Time Scale Modification –TSM) mowy operujące w czasie rzeczywistym oraz ocena ich wpływu na rozumienie wypowiedzi przez osoby z pogorszoną rozdzielczością czasową słuchu. Pogorszona rozdzielczość słuchu jest jednym z symptomów związanych z ośrodkowymi zaburzeniami słuchu (ang. Cetnral Auditory Processing Disorder – CAPD). W odróżnieniu...
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
Publikacja
- G. Korvel
- O. Kurasova
- B. Kostek
- Rok 2019
The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...

Pełny tekst do pobrania w portalu
Human voice modification using instantaneous complex frequency
Publikacja
- M. Kaniewska
- Rok 2010
The paper presents the possibilities of changing human voice by modifying instantaneous complex frequency (ICF) of the speech signal. The proposed method provides a flexible way of altering voice without the necessity of finding fundamental frequency and formants' positions or detecting voiced and unvoiced fragments of speech. The algorithm is simple and fast. Apart from ICF it uses signal factorization into two factors: one fully...
Improving Objective Speech Quality Indicators in Noise Conditions
Publikacja
- K. Kąkol
- G. Korvel
- B. Kostek
- Rok 2020
This work aims at modifying speech signal samples and test them with objective speech quality indicators after mixing the original signals with noise or with an interfering signal. Modifications that are applied to the signal are related to the Lombard speech characteristics, i.e., pitch shifting, utterance duration changes, vocal tract scaling, manipulation of formants. A set of words and sentences in Polish, recorded in silence,...

Pełny tekst do pobrania w serwisie zewnętrznym
Transient detection for speech coding applications
Publikacja
- International Journal of Computer Science and Network Security - Rok 2006
Signal quality in speech codecs may be improved by selecting transients from speech signal and encoding them using a suitable method. This paper presents an algorithm for transient detection in speech signal. This algorithm operates in several frequency bands. Transient detection functions are calculated from energy measured in short frames of the signal. The final selection of transient frames is based on results of detection...

Pełny tekst do pobrania w serwisie zewnętrznym
Real-time speech-rate modification experiments
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2010
An algorithm designed for real-time speech time scale modification (stretching) is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness of signal processing algorithms are examined experimentally together...

Pełny tekst do pobrania w serwisie zewnętrznym
Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions
Publikacja
- K. Kąkol
- G. Korvel
- B. Kostek
- Rok 2018
The aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...
Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Diagnostic Pathology - Rok 2012
Methods developed for real-time time scale modification (TSM) of speech signal are presented. They are based onthe non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of theproposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearingimpaired children and elderly listeners. It was shown that for the speech with average rate equal to or...

Pełny tekst do pobrania w portalu
Improved method for real-time speech stretching
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2012
n algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...

Pełny tekst do pobrania w serwisie zewnętrznym
Jan Daciuk dr hab. inż.

Osoby

Wydział Elektroniki, Telekomunikacji i Informatyki, Katedra Inteligentnych Systemów Interaktywnych

Jan Daciuk uzyskał tytuł zawodowy magistra na Wydziale Elektroniki Politechniki Gdańskiej w 1986 roku, a doktorat na wydziale Elektroniki, Telekomunikacji i Informatyki PG w 1999. Pracuje na Wydziale od 1988 roku. Jego zainteresowania naukowe obejmują zastosowania automatów skończonych w przetwarzaniu języka naturalnego i przetwarzaniu mowy. Spędził ponad cztery lata w europejskich uniwersytetach i instytutach naukowych, takich...
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
Publikacja
- Rok 2016
The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...
POPRAWA OBIEKTYWNYCH WSKAŹNIKÓW JAKOŚCI MOWY W WARUNKACH HAŁASU
Publikacja
- K. Kąkol
- B. Kostek
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2018
Celem pracy jest modyfikacja sygnału mowy, aby uzyskać zwiększenie poprawy obiektywnych wskaźników jakości mowy po zmiksowaniu sygnału użytecznego z szumem bądź z sygnałem zakłócającym. Wykonane modyfikacje sygnału bazują na cechach mowy lombardzkiej, a w szczególności na efekcie podniesienia częstotliwości podstawowej F0. Sesja nagraniowa obejmowała zestawy słów i zdań w języku polskim, nagrane w warunkach ciszy, jak również w...

Pełny tekst do pobrania w portalu
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Pełny tekst do pobrania w serwisie zewnętrznym
Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
Publikacja
- G. Korvel
- K. Kąkol
- P. Treigys
- B. Kostek
- Rok 2022
The aim of this study is two-fold. First, we perform a series of experiments to examine the interference of different noises on speech processing. For that purpose, we concentrate on the Lombard effect, an involuntary tendency to raise speech level in the presence of background noise. Then, we apply this knowledge to detecting speech with the Lombard effect. This is for preparing a dataset for training a machine learning-based...

Pełny tekst do pobrania w portalu
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
Publikacja
- M. Włoszczyńska
- B. Kostek
- Rok 2023
Aplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...

Pełny tekst do pobrania w serwisie zewnętrznym
A Method of Real-Time Non-uniform Speech Stretching
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2012
Developed method of real-time non-uniform speech stretching is presented.The proposed solution is based on the well-known SOLA algorithm(Synchronous Overlap and Add). Non-uniform time-scale modification isachieved by the adjustment of time scaling factor values in accordance with thesignal content. Dependently on the speech unit (vowels/consonants), instantaneousrate of speech (ROS), and speech signal presence, values of the scalingfactor...

Pełny tekst do pobrania w serwisie zewnętrznym
Time-domain prosodic modifications for text-to-speech synthesizer
Publikacja
- J. Łopatka
- P. Suchomski
- A. Czyżewski
- Rok 2010
An application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning
Publikacja
- K. Kąkol
- Rok 2023
The Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...

Pełny tekst do pobrania w portalu
Instantaneous complex frequency for pipeline pitch estimation
Publikacja
- M. [. Kaniewska
- Rok 2010
In the paper a pipeline algorithm for estimating the pitch of speech signal is proposed. The algorithm uses instantaneous complex frequencies estimated for four waveforms obtained by filtering the original speech signal through four bandpass complex Hilbert filters. The imaginary parts of ICFs from each channel give four candidates for pitch estimates. The decision regarding the final estimate is made based on the real parts of...
Properties and interpretation of Instantaneous Complex Frequency
Publikacja
- K. Świder
- T. Bandurski
- R. Studański
- POLISH JOURNAL OF ENVIRONMENTAL STUDIES - Rok 2011
The concept of Instantaneous Complex Frequency (ICF) was first defined by Lindon and developed mainly in works of two authors S. Hahn and M. Rojewski. Although it is not widely used in signal analysis, ICF was already used as a complex signal representation in the verification of handwritten signatures, pitch estimation, symbol timing recovery in PSK receiver and in detection of anomalies in data transmission. It should be noted,...

Pełny tekst do pobrania w serwisie zewnętrznym
DAB+ Coverage Analysis: a New Look at Network Planning using GIS Tools
Publikacja
- Rok 2018
For many years, the matter of designing a transmitter network, optimized for best signal coverage, has been a subject of intense research. In the last decade, numerous researchers and institutions used GIS and spatial analysis tools for network planning, especially transmitter location. Currently, many existing systems operate in a strictly two-dimensional manner, not taking into account the three-dimensional nature of the analyzed...
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
Publikacja
- SENSORS - Rok 2022
Objective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...

Pełny tekst do pobrania w portalu
Mechanics of materials (PG_00057378), W/Ć, BM, II stop., sem1,3, lato, 2023/24
Kursy Online
- B. Rozmarynowski
- P. M. Bielski
The aim of this course is providing knowledge in the field of analysis and solving problems of mechanics and strength of one-dimensional systems (bars, beams, frames) and selected two-dimensional systems (shields, plates); preparing the student to solve problems involving complex cases of material strength; developing the ability to assess the stability of structural elements (forms of stability loss, critical forces); consolidation...
Strategie treningu neuronowego estymatora częstotliwości tonu krtaniowego z użyciem generatora syntetycznych samogłosek
Publikacja
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2022
W wielu zastosowaniach telekomunikacyjnych pojawia się problem przetwarzania lub analizy sygnału mowy, w ramach którego, często w obszarze podstawowych algorytmów, stosuje się estymator częstotliwości tonu krtaniowego. Estymator rozpatrywany w tej pracy bazuje na neuronowym klasyfikatorze podejmującym decyzje na podstawie częstotliwości oraz mocy chwilowej wyznaczanych w podpasmach analizowanego sygnału mowy. W pracy rozważamy...

Pełny tekst do pobrania w portalu
Natalia Sokół dr inż.

Osoby

Katedra Urbanistyki i Planowania Regionalnego

BACKGROUND Master of Science in Light and Lighting (2008-2009/11) The UCL Bartlett School of Graduate Studies, Faculty of the Built Environment, London, UK, www.bartlett.ucl.ac.uk MA Degree in Interior Architecture (1999-2004), The Academy of Fine Arts, Poznan, Poland, www.uap.edu.pl MA Degree in Art Education (1997-2002), Academy of Fine Arts, Poznan, Poland, www.uap.edu.pl MAIN RESEARCH AREAS · ...
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
Publikacja
- Rok 2018
In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

Pełny tekst do pobrania w serwisie zewnętrznym
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
Publikacja
- Rok 2018
This paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...

Pełny tekst do pobrania w serwisie zewnętrznym
Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set
Publikacja
- P. Filipowicz
- B. Kostek
- Applied Sciences-Basel - Rok 2023
This work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...

Pełny tekst do pobrania w portalu
Marking the Allophones Boundaries Based on the DTW Algorithm
Publikacja
- J. Rafałko
- Rok 2018
The paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighborhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border...
Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor
Publikacja
- Rok 2015
Spatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: TWO-DIMENSIONAL REPRESENTATION OF SPEECH SIGNAL

Jan Daciuk dr hab. inż.

Natalia Sokół dr inż.