Search results for: SPEECH REINFORCEMENT SYSTEMS

Search results for: SPEECH REINFORCEMENT SYSTEMS

results on page:
embed this view on your website

Filters

total: 6614

clear all filters disabled

displaying 1000 best results Help

Tensor Decomposition for Imagined Speech Discrimination in EEG
Publication
- J. S. Garcia Salinas
- L. Villaseñor-Pineda
- C. A. Reyes-Garćia
- A. A. Torres-García
- LECTURE NOTES IN COMPUTER SCIENCE - Year 2018
Most of the researches in Electroencephalogram(EEG)-based Brain-Computer Interfaces (BCI) are focused on the use of motor imagery. As an attempt to improve the control of these interfaces, the use of language instead of movement has been recently explored, in the form of imagined speech. This work aims for the discrimination of imagined words in electroencephalogram signals. For this purpose, the analysis of multiple variables...

Full text to download in external service
Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit
Publication
- A. Kupryjanow
- A. Czyżewski
- Diagnostic Pathology - Year 2012
Methods developed for real-time time scale modification (TSM) of speech signal are presented. They are based onthe non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of theproposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearingimpaired children and elderly listeners. It was shown that for the speech with average rate equal to or...

Full text available to download
Multimodal English corpus for automatic speech recognition
Publication
- Year 2013
A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
Influence of effective width of flange on calculation and reinforcement dimensioning of beam of reinforced concrete frame
Publication
- M. T. Solarczyk
- Budownictwo i Architektura - Year 2021
The paper analyses the influence of modelling the cross-section of a beam in two-storey reinforced concrete frame of industrial warehouse with dimensions: 18.0 m × 32.0 m using bar elements on the results of bending moments, the value of elastic deflection and the dimensioning of reinforcement due to bending. Six options were considered: a beam as a rectangular section and five T-beam variants with different definitions of effective...

Full text available to download
Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
Publication
- G. Korvel
- K. Kąkol
- P. Treigys
- B. Kostek
- Year 2022
The aim of this study is two-fold. First, we perform a series of experiments to examine the interference of different noises on speech processing. For that purpose, we concentrate on the Lombard effect, an involuntary tendency to raise speech level in the presence of background noise. Then, we apply this knowledge to detecting speech with the Lombard effect. This is for preparing a dataset for training a machine learning-based...

Full text available to download
Elimination of clicks from archive speech signals using sparse autoregressive modeling
Publication
- M. Niedźwiecki
- M. Ciołek
- Year 2012
This paper presents a new approach to elimination of impulsivedisturbances from archive speech signals. The proposedsparse autoregressive (SAR) signal representation is given ina factorized form - the model is a cascade of the so-called formantfilter and pitch filter. Such a technique has been widelyused in code-excited linear prediction (CELP) systems, as itguarantees model stability. After detection of noise pulses usinglinear...

Full text to download in external service
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
Publication
- Year 2016
The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...
Ranking Speech Features for Their Usage in Singing Emotion Classification
Publication
- S. Zaporowski
- B. Kostek
- Year 2020
This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Full text available to download
System Supporting Speech Perception in Special Educational Needs Schoolchildren
Publication
- A. Kupryjanow
- P. Suchomski
- P. Odya
- A. Czyżewski
- Year 2012
The system supporting speech perception during the classes is presented in the paper. The system is a combination of portable device, which enables real-time speech stretching, with the workstation designed in order to perform hearing tests. System was designed to help children suffering from Central Auditory Processing Disorders.

Full text to download in external service
High quality speech codec employing sines+noise+transients model
Publication
- Archives of Acoustics - Year 2006
A method of high quality wideband speech signal representation employing sines+transients+noise model is presented. The need for a wideband speech coding approach as well as various methods for analysis and synthesis of sines, residual and transient states of speech signal is discussed. The perceptual criterion is applied in the proposed approach during encoding of sines amplitudes in order to reduce bandwidth requirements and...

Full text available to download
Silence/noise detection for speech and music signals
Publication
- M. Papaj
- Year 2008
This paper introduces a novel off-line algorithm for silence/noise detection in noisy signals. The main concept of the proposed algorithm is to provide noise patterns for further signals processing i.e. noise reduction for speech enhancement. The algorithm is based on frequency domain characteristics of signals. The examples of different types of noisy signals are presented.
Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions
Publication
- K. Kąkol
- G. Korvel
- B. Kostek
- Year 2018
The aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
Publication
- SENSORS - Year 2022
Objective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...

Full text available to download
Database of speech and facial expressions recorded with optimized face motion capture settings
Publication
- A. Czyżewski
- M. Kawaler
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Year 2019
The broad objective of the present research is the analysis of spoken English employing a multiplicity of modalities. An important stage of this process, discussed in the paper, is creating a database of speech accompanied with facial expressions. Recordings of speakers were made using an advanced system for capturing facial muscle motion. A brief historical outline, current applications, limitations and the ways of capturing face...

Full text available to download
Artur Gańcza mgr inż.

People

Department of Marine Electronic Systems

I received the M.Sc. degree from the Gdańsk University of Technology (GUT), Gdańsk, Poland, in 2019. I am currently a Ph.D. student at GUT, with the Department of Automatic Control, Faculty of Electronics, Telecommunications and Informatics. My professional interests include speech recognition, system identification, adaptive signal processing and linear algebra.
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
Publication
- G. Korvel
- O. Kurasova
- B. Kostek
- Year 2019
The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...

Full text available to download
Basic sensitivity analysis of a telecommunication tower complementing standard reinforcement design process
Publication
- K. Winkelmann
- S. Duch
- AIP Conference Proceedings - Year 2019
This paper presents straightforward sensitivity assessment of a telecommunication tower. The analysis is set toidentify the elements of the tower which may be reinforced with the greatest structural advantage. As current expertopin ions on structural redesign of similar structures due to a planned addition of extra loads are mainly based ondeterministic computations or engineering intuition,...

Full text available to download
Corrupted speech intelligibility improvement using adaptive filter based algorithm
Publication
- D. Ellwart
- A. Czyżewski
- Year 2010
A technique for improving the quality of speech signals recorded in strong noise is presented. The proposed algorithmemploying adaptive filtration is described and additional possibilities of speech intelligibility improvement arediscussed. Results of the tests are presented.
Force transfer and stress distribution in short cantilever deep beams loaded throughout the depth with a various reinforcement
Publication
- A. Kopańska
- K. Nagrodzka-Godycka
- Year 2019
Deep beams used as the main reinforced concrete structural elements which taking over the load and stiffening construction are often found in high-rise buildings. The architecture of these buildings is sometimes sophisticated and varied, arouse the admiration of the majority of recipients, and thus causing an engineering challenge to correctly design the structural system and force transfer. In such structures is important to shape...

Full text to download in external service
Jarosław Sadowski dr hab. inż.

People

Department of Radiocommunication Systems and Networks
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
Publication
- B. Kostek
- B. Szyca
- Journal of the Acoustical Society of America - Year 2023
The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

Full text available to download
A non-uniform real-time speech time-scale stretching method
Publication
- A. Kupryjanow
- A. Czyżewski
- Year 2011
An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...
Problems of reinforcement designing for plates
Publication
- M. Cichocki
- Czasopismo Techniczne - Year 2002
Przedstawiono problem projektowania zbrojenia nietrajektorialnego płyt w aspekcie ich odkształcalności. Na podstawie niektórych wyników badań doświadczalnych, przeprowadzonych na żelbetowych płytach skręcanych, zweryfikowano procedury wymiarowania. Analiza wykazuje, że pomimo formalnego zapewnienia nośności przekroju płyt nietrajektorialnie zbrojonych, ich odkształcalność znacznie wzrasta. Aby zapewnić im sztywność na poziomie...
Emotions in polish speech recordings
Open Research Data
open access
- M. Mięsikowska
- D. Świsulski
The data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...
Intelligent processing of stuttered speech.
Publication
- A. Czyżewski
- A. Kaczmarek
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Year 2003
W artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się sztucznych sieci neuronowych i zbiorów przybliżonych.
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
Publication
- G. Tamulevicius
- G. Korvel
- A. B. Yayak
- P. Treigys
- J. Bernataviciene
- B. Kostek
- Electronics - Year 2020
In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Full text available to download
Study on Speech Transmission under Varying QoS Parameters in a OFDM Communication System
Publication
- M. Zamłyńska
- P. Falkowski-Gilski
- G. Debita
- B. Miedziński
- Year 2021
Although there has been an outbreak of multiple multimedia platforms worldwide, speech communication is still the most essential and important type of service. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission, based most often on multi-valued modulations, multiple...

Full text to download in external service
Communication Platform for Evaluation of Transmitted Speech Quality
Publication
- A. Ciarkowski
- A. Czyżewski
- Journal of Telecommunications and Information Technology - Year 2011
A voice communication system designed and implemented is described. The purpose of the presented platform was to enable a series of experiments related to the quality assessment of algorithms used in the coding and transmitting of speech. The system is equipped with tools for recording signals at each stage of processing, making it possible to subject them to subjective assessments by listening tests or, objective evaluation employing...

Full text available to download
Verification of selected calculation methods regarding shear strength in beams without web reinforcement
Publication
- M. Hirsz
- K. Nagrodzka-Godycka
- MATEC Web of Conferences - Year 2018
The purpose of the article was to compare selected calculation methods regarding shear strength in reinforced concrete beams without web reinforcement. Several calculation methods were tested. This included codes: PN-EN 1992-1-1:2008, ACI 318-14 and fib Model Code for Concrete Structures 2010. The analysis also consists of authorial methods published in technical literature. Calculations of shear strengths were made based on experimental...

Full text available to download
Transfer learning in imagined speech EEG-based BCIs
Publication
- J. S. Garcia Salinas
- L. Villaseñor-Pineda
- C. A. Reyes-Garćia
- A. A. Torres-García
- Biomedical Signal Processing and Control - Year 2019
The Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...

Full text available to download
Results of tests on speech intelligibility in reverberant conditions
Open Research Data
open access
The dataset contains the results of tests that aimed to provide a relationship between the rate of speech (RoS) and reverberation conditions characterized by the Speech Transmission Index (STI).
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publication
- Elektronika : konstrukcje, technologie, zastosowania - Year 2008
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publication
- T. Bandurski
- Ł. Hamerski
- M. Papaj
- A. Paruzel
- K. Świder
- Year 2007
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Automated detection of pronunciation errors in non-native English speech employing deep learning
Publication
- D. Korzekwa
- Year 2023
Despite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...

Full text available to download
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
Publication
- D. Korzekwa
- J. Lorenzo-trueba
- T. Drugman
- S. Calamaro
- B. Kostek
- Year 2021
We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

Full text available to download
Experiments and calibration of a bond-slip relation and efficiency factors for textile reinforcement in concrete
Publication
- A. Ścięgaj
- F. Larsson
- K. Lundgren
- CEMENT & CONCRETE COMPOSITES - Year 2022
Textile reinforcement yarns consist of many filaments, which can slip relative each other. At modelling of the global structural behaviour, interfilament slip in the yarns, and slip between the yarns and the concrete can be considered by efficiency factors for the stiffness and strength of the yarns, and by applying a bond-slip relation between yarns and concrete. In this work, an effective and robust method for calibration of...

Full text available to download
Mowa nienawiści (hate speech) a odpowiedzialność dostawców usług internetowych w orzecznictwie sądów europejskich
Publication
- K. Kowalik-Bańczyk
- Year 2015
The article analyses the phenomenon of hate speech in the Internet contrasted with the problem of responsability of Internet Service Providers for cases of such abuses of freedom of expression. The text provides an analysis of jurisprudence of two European Courts. On the one hand it presents the position of the European Court of Human Rights on the problem of hate speech: its definition and the liability for it as an exception...
The effect of multiaxial geocomposite reinforcement on fatigue performance and crack propagation delay in double-layered asphalt beams
Publication
- P. Jaskuła
- D. Ryś
- C. Szydłowski
- M. Golos
- K. Kornacka
- J. Żółtko
- J. Kawalec
- M. Stienss
- Road Materials and Pavement Design - Year 2023
The presented study investigates the effect of a recently developed multiaxial geocomposite made of polypropylene geogrid and non-woven fabric on the delay of crack propagation, based on four-point bending tests of large asphalt concrete beams – both for reinforced and non-reinforced specimens. Several approaches are described in this study, including analysis of stiffness modulus decrease and analysis of crack propagation using...

Full text available to download
Structure and Randomness in Planning and Reinforcement Learning
Publication
- K. Czechowski
- P. Januszewski
- P. Kozakowski
- Ł. Kuciński
- P. Miłoś
- Year 2021
Planning in large state spaces inevitably needs to balance the depth and breadth of the search. It has a crucial impact on the performance of a planner and most manage this interplay implicitly. We present a novel method \textit{Shoot Tree Search (STS)}, which makes it possible to control this trade-off more explicitly. Our algorithm can be understood as an interpolation between two celebrated search mechanisms: MCTS and random...

Full text to download in external service
Visual Lip Contour Detection for the Purpose of Speech Recognition
Publication
- Year 2014
A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
Objectivization of phonological evaluation of speech elements by means of audio parametrization
Publication
- Year 2018
This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
TELECOMMUNICATION SYSTEMS

Journals

ISSN: 1018-4864 , eISSN: 1572-9451
Piotr Szczuko dr hab. inż.

People

Department of Multimedia Systems

Piotr Szczuko received his M.Sc. degree in 2002. His thesis was dedicated to examination of correlation phenomena between perception of sound and vision for surround sound and digital image. He finished Ph.D. studies in 2007 and one year later completed a dissertation "Application of Fuzzy Rules in Computer Character Animation" that received award of Prime Minister of Poland. His interests include: processing of audio and video, computer...
Human-computer interactions in speech therapy using a blowing interface
Publication
- Year 2014
In this paper we present a new human-computer interface for the quantitative measurement of blowing activities. The interface can measure the air flow and air pressure during the blowing activity. The measured values are stored and used to control the state of the graphical objects in the graphical user interface. In speech therapy children will find easier to play attractive therapeutic games than to perform repetitive and tedious,...

Full text to download in external service
Speech and Drama

Journals

ISSN: 0038-7142
LANGUAGE AND SPEECH

Journals

ISSN: 0023-8309 , eISSN: 1756-6053
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
Publication
- Electronics - Year 2022
Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

Full text available to download
Cellulose Nanofibers Isolated from the Cuscuta Reflexa Plant as a Green Reinforcement of Natural Rubber
Publication
- M. Dominic C.D.
- R. Joseph
- P. S. Begum
- M. Joseph
- D. Padmanabhan
- L. A. Morris
- A. S. Kumar
- K. Formela
- Polymers - Year 2020
In the present work, we used the steam explosion method for the isolation of cellulose nanofiber (CNF) from Cuscuta reflexa, a parasitic plant commonly seen in Kerala and we evaluated its reinforcing efficiency in natural rubber (NR). Fourier Transform Infrared Spectroscopy (FTIR), X-Ray Diffraction (XRD), Scanning Electron Microscopy (SEM), Transmission Electron Microscopy (TEM), and Thermogravimetric analysis (TGA) techniques...

Full text available to download
Minimal transverse reinforcement of reinforced concrete members
Publication
- T. Godycki-Ćwirko
- M. Wesołowski
- Year 2005
W pierwszej części pracy omówiono zagadnienia dotyczące minimalnego zbrojenia na ścinanie elementów żelbetowych w kontekście norm europejskich oraz pozaeuropejskich. W drugiej części pracy dokonano analizy wyników badań eksperymentalnych dotyczących nośności elementów bez zbrojenia poprzecznego, które stanowią podstawę do weryfikacji zaleceń normowych w zakresie minimalnego zbrojenia na ścinanie.
Estimation of the short-term predictor parameters of speech under noisy conditions
Publication
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- IEEE Transactions on Audio Speech and Language Processing - Year 2006
Full text to download in external service

Search

Filters

Catalog

Search results for: SPEECH REINFORCEMENT SYSTEMS

Artur Gańcza mgr inż.

Jarosław Sadowski dr hab. inż.

Piotr Szczuko dr hab. inż.