Wyniki wyszukiwania dla: DYSARTHRIA DETECTION, SPEECH RECOGNITION, SPEECH SYNTHESIS, INTERPRETABLE DEEP LEARNING MODELS

Wyniki wyszukiwania dla: DYSARTHRIA DETECTION, SPEECH RECOGNITION, SPEECH SYNTHESIS, INTERPRETABLE DEEP LEARNING MODELS

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 412

wyczyść wszystkie filtry niedostępne

Optimized Deep Learning Model for Flood Detection Using Satellite Images
Publikacja
- A. Stateczny
- H. D. Praveena
- R. H. Krishnappa
- K. R. Chythanya
- B. B. Babysarojam
- Remote Sensing - Rok 2023
The increasing amount of rain produces a number of issues in Kerala, particularly in urban regions where the drainage system is frequently unable to handle a significant amount of water in such a short duration. Meanwhile, standard flood detection results are inaccurate for complex phenomena and cannot handle enormous quantities of data. In order to overcome those drawbacks and enhance the outcomes of conventional flood detection...

Pełny tekst do pobrania w portalu
Audiovisual speech recognition for training hearing impaired patients
Publikacja
- Rok 2006
Praca przedstawia system rozpoznawania izolowanych głosek mowy wykorzystujący dane wizualne i akustyczne. Modele Active Shape Models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na współczynnikach melcepstralnych. Sieć neuronowa została użyta do rozpoznawania wymawianych głosek na podstawie wektora cech zawierającego oba typy...
Automatic Image and Speech Recognition Based on Neural Network
Publikacja
- D. Król
- B. Szlachetko
- Journal of Information Technology Research - Rok 2010
Pełny tekst do pobrania w serwisie zewnętrznym
LDNet: A Robust Hybrid Approach for Lie Detection Using Deep Learning Techniques
Publikacja
- S. A. Prome
- M. R. Islam
- D. Asirvatham
- N. A. Ragavan
- C. Sanín
- E. Szczerbicki
- CMC-Computers Materials & Continua - Rok 2024
Deception detection is regarded as a concern for everyone in their daily lives and affects social interactions. The human face is a rich source of data that offers trustworthy markers of deception. The deception or lie detection systems are non-intrusive, cost-effective, and mobile by identifying facial expressions. Over the last decade, numerous studies have been conducted on deception detection using several advanced techniques....

Pełny tekst do pobrania w serwisie zewnętrznym
Transient detection algorithms for speech coding applications
Publikacja
- G. Szwoch
- M. Kulesza
- A. Czyzewski
- Journal of the Acoustical Society of America - Rok 2006
Pełny tekst do pobrania w serwisie zewnętrznym
Influence of modulation detection threshold on speech intelligibility
Publikacja
- K. Leo
- ACTA PHYSICA POLONICA A - Rok 2011
Pełny tekst do pobrania w portalu
Deep learning-based waste detection in natural and urban environments
Publikacja
- S. Majchrowska
- A. Mikołajczyk-Bareła
- M. Ferlin
- Z. Klawikowska
- M. A. Plantykow
- A. Kwasigroch
- K. Majek
- WASTE MANAGEMENT - Rok 2022
Waste pollution is one of the most significant environmental issues in the modern world. The importance of recycling is well known, both for economic and ecological reasons, and the industry demands high efficiency. Current studies towards automatic waste detection are hardly comparable due to the lack of benchmarks and widely accepted standards regarding the used metrics and data. Those problems are addressed in this article by...

Pełny tekst do pobrania w portalu
Comprehensive Evaluation of Statistical Speech Waveform Synthesis
Publikacja
- T. Merritt
- B. Putrycz
- A. Nadolski
- T. Ye
- D. Korzekwa
- W. Dolecki
- T. Drugman
- V. Klimkov
- A. Moinet
- A. Breen... i 3 innych
- Rok 2018
Pełny tekst do pobrania w serwisie zewnętrznym
Transfer learning in imagined speech EEG-based BCIs
Publikacja
- J. S. Garcia Salinas
- L. Villaseñor-Pineda
- C. A. Reyes-Garćia
- A. A. Torres-García
- Biomedical Signal Processing and Control - Rok 2019
The Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...

Pełny tekst do pobrania w portalu
Auditory-model based robust feature selection for speech recognition
Publikacja
- C. Koniaris
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- Journal of the Acoustical Society of America - Rok 2010
Pełny tekst do pobrania w serwisie zewnętrznym
Detection of dialogue in movie soundtrack for speech intelligibility enhancement
Publikacja
- K. Łopatka
- Rok 2014
A method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility....

Pełny tekst do pobrania w serwisie zewnętrznym
Using Isolation Forest and Alternative Data Products to Overcome Ground Truth Data Scarcity for Improved Deep Learning-based Agricultural Land Use Classification Models
Publikacja
- A. Pereira García
- L. Porwol
- A. Ojo
- Rok 2023
High-quality labelled datasets represent a cornerstone in the development of deep learning models for land use classification. The high cost of data collection, the inherent errors introduced during data mapping efforts, the lack of local knowledge, and the spatial variability of the data hinder the development of accurate and spatially-transferable deep learning models in the context of agriculture. In this paper, we investigate...

Pełny tekst do pobrania w serwisie zewnętrznym
Combining visual and acoustic modalities to ease speech recognition by hearing impaired people
Publikacja
- B. Kostek
- P. Dalka
- Rok 2005
Artykuł prezentuje system, którego celem działania jest ułatwienie procesu treningu poprawnej wymowy dla osób z poważnymi wadami słuchu. W analizie mowy wykorzystane zostały parametry akutyczne i wizualne. Do wyznaczenia parametrów wizualnych na podstawie kształtu i ruchu ust zostały wykorzystane modele Active Shape Models. Parametry akustyczne bazują na współczynnikach melcepstralnych. Do klasyfikacji wypowiadanych głosek została...
Real-Time Facial Features Detection from Low Resolution Thermal Images with Deep Classification Models
Publikacja
- Journal of Medical Imaging and Health Informatics - Rok 2018
Deep networks have already shown a spectacular success for object classification and detection for various applications from everyday use cases to advanced medical problems. The main advantage of the classification models over the detection models is less time and effort needed for dataset preparation, because classification networks do not require bounding box annotations, but labels at the image level only. Yet, after passing...

Pełny tekst do pobrania w serwisie zewnętrznym
Improving platelet‐RNA‐based diagnostics: a comparative analysis of machine learning models for cancer detection and multiclass classification
Publikacja
- M. A. Jopek
- K. Pastuszak
- M. Sieczczyński
- S. Cygert
- A. J. Żaczek
- M. T. Rondina
- A. Supernat
- Molecular Oncology - Rok 2024
Liquid biopsy demonstrates excellent potential in patient management by providing a minimally invasive and cost-effective approach to detecting and monitoring cancer, even at its early stages. Due to the complexity of liquid biopsy data, machine-learning techniques are increasingly gaining attention in sample analysis, especially for multidimensional data such as RNA expression profiles. Yet, there is no agreement in the community...

Pełny tekst do pobrania w portalu
AGAR a Microbial Colony Dataset for Deep Learning Detection
Publikacja
- S. Majchrowska
- J. Pawlowski
- G. Gula
- T. Bonus
- A. Hanas
- A. Loch
- A. Pawlak
- J. Roszkowiak
- T. Golan
- Z. Drulis-Kawa
- Rok 2021
Pełny tekst do pobrania w serwisie zewnętrznym
Bożena Kostek prof. dr hab. inż.

Osoby

Laboratorium Akustyki Fonicznej
Deep learning-based waste detection in natural and urban environments
Publikacja
- S. Majchrowska
- A. Mikołajczyk
- M. Ferlin
- Z. Klawikowska
- M. Plantykow
- A. Kwasigroch
- K. Majek
- WASTE MANAGEMENT - Rok 2022
Pełny tekst do pobrania w serwisie zewnętrznym
Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition
Publikacja
- Rok 2019
Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

Pełny tekst do pobrania w portalu
High-resolution synthesis of high-density breast mammograms: Application to improved fairness in deep learning based mass detection
Publikacja
- L. Garrucho
- K. Kushibar
- R. Osuala
- O. Diaz
- A. Catanese
- J. del
- M. Bobowicz
- F. Strand
- L. Igual
- K. Lekadir
- Frontiers in Oncology - Rok 2023
Pełny tekst do pobrania w serwisie zewnętrznym
DEEP LEARNING BASED ON X-RAY IMAGING IMPROVES COXARTHROSIS DETECTION
Publikacja
- M. Maj
- J. Borkowski
- J. Wasilewski
- S. Hrynowiecka
- A. Kastrau
- M. Liksza
- P. Jasik
- M. Treder
- Rok 2022
Objective: The purpose of the study was to create an Artificial Neural Network (ANN) based on X-ray images of the pelvis, as an additional tool to automate and improve the diagnosis of coxarthrosis. The research is focused on joint space narrowing, which is a radiological symptom showing the thinning of the articular cartilage layer, which is translucent to X-rays. It is the first and the most important of the radiological signs...

Pełny tekst do pobrania w serwisie zewnętrznym
Andrzej Czyżewski prof. dr hab. inż.

Osoby

Katedra Systemów Multimedialnych

Prof. zw. dr hab. inż. Andrzej Czyżewski jest absolwentem Wydziału Elektroniki PG (studia magisterskie ukończył w 1982 r.). Pracę doktorską na temat związany z dźwiękiem cyfrowym obronił z wyróżnieniem na Wydziale Elektroniki PG w roku 1987. W 1992 r. przedstawił rozprawę habilitacyjną pt.: „Cyfrowe operacje na sygnałach fonicznych”. Jego kolokwium habilitacyjne zostało przyjęte jednomyślnie w czerwcu 1992 r. w Akademii Górniczo-Hutniczej...
Data-driven Models for Predicting Compressive Strength of 3D-printed Fiber-Reinforced Concrete using Interpretable Machine Learning Algorithms
Publikacja
- M. Arif
- F. Jan
- A. Rezzoug
- M. A. Afridi
- M. Luqman
- W. A. Khan
- M. Kujawa
- H. Alabduljabbar
- M. Khan
- Case Studies in Construction Materials - Rok 2024
3D printing technology is growing swiftly in the construction sector due to its numerous benefits, such as intricate designs, quicker construction, waste reduction, environmental friendliness, cost savings, and enhanced safety. Nevertheless, optimizing the concrete mix for 3D printing is a challenging task due to the numerous factors involved, requiring extensive experimentation. Therefore, this study used three machine learning...

Pełny tekst do pobrania w serwisie zewnętrznym
Orientation-aware ship detection via a rotation feature decoupling supported deep learning approach
Publikacja
- X. Chen
- H. Wu
- B. Han
- W. Liu
- J. Montewka
- R. W. Liu
- ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE - Rok 2023
Ship imaging position plays an important role in visual navigation, and thus significant focuses have been paid to accurately extract ship imaging positions in maritime videos. Previous studies are mainly conducted in the horizontal ship detection manner from maritime image sequences. This can lead to unsatisfied ship detection performance due to that some background pixels maybe wrongly identified as ship contours. To address...

Pełny tekst do pobrania w serwisie zewnętrznym
A Novel Spatio–Temporal Deep Learning Vehicle Turns Detection Scheme Using GPS-Only Data
Publikacja
- M. A. Rahim
- S. D. Khan
- S. Khan
- M. Rashid
- R. Ullah
- H. Tariq
- S. Czapp
- IEEE Access - Rok 2023
Whether the computer is driving your car or you are, advanced driver assistance systems (ADAS) come into play on all levels, from weather monitoring to safety. These modern-day ADASs use various assisting tools for drivers to keep the journey safe; these sophisticated tools provide early signals of numerous events, such as road conditions, emerging traffic scenarios, and weather warnings. Many urban applications, such as car-sharing...

Pełny tekst do pobrania w portalu
Introduction to the special issue on machine learning in acoustics
Publikacja
- Z. Michalopoulou
- P. Gerstoft
- B. Kostek
- M. A. Roch
- Journal of the Acoustical Society of America - Rok 2021
When we started our Call for Papers for a Special Issue on “Machine Learning in Acoustics” in the Journal of the Acoustical Society of America, our ambition was to invite papers in which machine learning was applied to all acoustics areas. They were listed, but not limited to, as follows: • Music and synthesis analysis • Music sentiment analysis • Music perception • Intelligent music recognition • Musical source separation • Singing...

Pełny tekst do pobrania w portalu
Modeling of medium flow processes in transportation pipelines - the synthesis of their state-space models and the analysis of the mathematical properties of the models for leak detection purposes
Publikacja
- M. S. Tatara
- Rok 2019
The dissertation concerns the issue of modeling the pipeline flow process under incompressible and isothermal conditions, with a target application to the leak detection and isolation systems. First, an introduction to the model-based process diagnostics is provided, where its basic terminology, tools, and methods are described. In the following chapter, a review of the state of the art in the field of leak detection and isolation...
User Orientation Detection in Relation to Antenna Geometry in Ultra-Wideband Wireless Body Area Networks Using Deep Learning
Publikacja
- S. Urwan
- K. Cwalina
- SENSORS - Rok 2024
In this paper, the issue of detecting a user’s position in relation to the antenna geometry in ultra-wideband (UWB) off-body wireless body area network (WBAN) communication using deep learning methods is presented. To measure the impulse response of the channel, a measurement stand consisting of EVB1000 devices and DW1000 radio modules was developed and indoor static measurement scenarios were performed. It was proven that for...

Pełny tekst do pobrania w portalu
Jan Daciuk dr hab. inż.

Osoby

Katedra Inteligentnych Systemów Interaktywnych

Jan Daciuk uzyskał tytuł zawodowy magistra na Wydziale Elektroniki Politechniki Gdańskiej w 1986 roku, a doktorat na wydziale Elektroniki, Telekomunikacji i Informatyki PG w 1999. Pracuje na Wydziale od 1988 roku. Jego zainteresowania naukowe obejmują zastosowania automatów skończonych w przetwarzaniu języka naturalnego i przetwarzaniu mowy. Spędził ponad cztery lata w europejskich uniwersytetach i instytutach naukowych, takich...
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
Publikacja
- D. Korzekwa
- R. Barra-Chicote
- S. Zaporowski
- G. Beringer
- J. Lorenzo-trueba
- A. Serafinowicz
- J. Droppo
- T. Drugman
- B. Kostek
- Rok 2021
This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

Pełny tekst do pobrania w portalu
Analysis-by-synthesis paradigm evolved into a new concept
Publikacja
- B. Kostek
- Journal of the Acoustical Society of America - Rok 2022
This work aims at showing how the well-known analysis-by-synthesis paradigm has recently been evolved into a new concept. However, in contrast to the original idea stating that the created sound should not fail to pass the foolproof synthesis test, the recent development is a consequence of the need to create new data. Deep learning models are greedy algorithms requiring a vast amount of data that, in addition, should be correctly...

Pełny tekst do pobrania w serwisie zewnętrznym
Predicting the Purchase of Electricity Prices for Renewable Energy Sources Based on Polish Power Grids Data Using Deep Learning Models for Controlling Small Hybrid PV Microinstallations
Publikacja
- M. Pikus
- J. Wąs
- Rok 2023
Pełny tekst do pobrania w serwisie zewnętrznym
IEEE Automatic Speech Recognition and Understanding Workshop

Konferencje
ISCA Tutorial and Research Workshop Automatic Speech Recognition

Konferencje
Biometria i przetwarzanie mowy 2023
Kursy Online
- J. Daciuk
{mlang pl} Celem kursu jest zapoznanie studentów z: metodami ustalania i potwierdzania tożsamości ludzi na podstawie mierzalnych cech organizmu cechami mowy ludzkiej, w szczególności polskiej metodami rozpoznawania mowy metodami syntezy mowy {mlang} {mlang en} The aim of the course is to familiarize the students with: methods of identification and verification of identity of people based on measurable features of their...
Biometria i przetwarzanie mowy 2024
Kursy Online
- J. Daciuk
{mlang pl} Celem kursu jest zapoznanie studentów z: metodami ustalania i potwierdzania tożsamości ludzi na podstawie mierzalnych cech organizmu cechami mowy ludzkiej, w szczególności polskiej metodami rozpoznawania mowy metodami syntezy mowy {mlang} {mlang en} The aim of the course is to familiarize the students with: methods of identification and verification of identity of people based on measurable features of their...
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publikacja
- Rok 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Pełny tekst do pobrania w serwisie zewnętrznym
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
Publikacja
- D. Korzekwa
- J. Lorenzo-trueba
- S. Zaporowski
- S. Calamaro
- T. Drugman
- B. Kostek
- Rok 2021
A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

Pełny tekst do pobrania w serwisie zewnętrznym
Time-domain prosodic modifications for text-to-speech synthesizer
Publikacja
- J. Łopatka
- P. Suchomski
- A. Czyżewski
- Rok 2010
An application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
Speech codec enhancements utilizing time compression and perceptual coding
Publikacja
- M. Kulesza
- A. Czyżewski
- Rok 2007
A method for encoding wideband speech signal employing standardized narrowband speech codecs is presented as well as experimental results concerning detection of tonal spectral components. The speech signal sampled with a higher sampling rate than it is suitable for narrowband coding algorithm is compressed in order to decrease the amount of samples. Next, the time-compressed representation of a signal is encoded using a narrowband...
Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor
Publikacja
- Rok 2015
Spatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...

Pełny tekst do pobrania w serwisie zewnętrznym
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
Publikacja
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2016
W referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...
Marking the Allophones Boundaries Based on the DTW Algorithm
Publikacja
- J. Rafałko
- Rok 2018
The paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighborhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border...
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Pełny tekst do pobrania w serwisie zewnętrznym
Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
Publikacja
- G. Korvel
- K. Kąkol
- P. Treigys
- B. Kostek
- Rok 2022
The aim of this study is two-fold. First, we perform a series of experiments to examine the interference of different noises on speech processing. For that purpose, we concentrate on the Lombard effect, an involuntary tendency to raise speech level in the presence of background noise. Then, we apply this knowledge to detecting speech with the Lombard effect. This is for preparing a dataset for training a machine learning-based...

Pełny tekst do pobrania w portalu
Improved method for real-time speech stretching
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2012
n algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...

Pełny tekst do pobrania w serwisie zewnętrznym
High quality speech codec employing sines+noise+transients model
Publikacja
- Archives of Acoustics - Rok 2006
A method of high quality wideband speech signal representation employing sines+transients+noise model is presented. The need for a wideband speech coding approach as well as various methods for analysis and synthesis of sines, residual and transient states of speech signal is discussed. The perceptual criterion is applied in the proposed approach during encoding of sines amplitudes in order to reduce bandwidth requirements and...

Pełny tekst do pobrania w portalu
Applying the Lombard Effect to Speech-in-Noise Communication
Publikacja
- G. Korvel
- K. Kąkol
- P. Treigys
- B. Kostek
- Electronics - Rok 2023
This study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...

Pełny tekst do pobrania w portalu
Real-time speech-rate modification experiments
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2010
An algorithm designed for real-time speech time scale modification (stretching) is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness of signal processing algorithms are examined experimentally together...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: DYSARTHRIA DETECTION, SPEECH RECOGNITION, SPEECH SYNTHESIS, INTERPRETABLE DEEP LEARNING MODELS

Bożena Kostek prof. dr hab. inż.

Andrzej Czyżewski prof. dr hab. inż.

Jan Daciuk dr hab. inż.