Wyniki wyszukiwania dla: IMAGINED SPEECH

Wyniki wyszukiwania dla: IMAGINED SPEECH

wyników na stronę:
osadź ten widok na swojej stronie

Wyświetlane wyniki pochodzą z wyszukiwania alternatywnego.

Filtry

wszystkich: 2633

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

Time-scale modification of speech signals for supporting hearing impaired schoolchildren
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2009
A study of time scale modification algorithmsapplied to hearing impaired schoolchildren supporting ispresented. Variety of algorithms are considered, namely:overlap and add, two variations of synchronized overlapand add, and the phase vocoder. Their effectiveness as wellas real-time processing capabilities are examined.
Corrupted speech intelligibility improvement using adaptive filter based algorithm
Publikacja
- D. Ellwart
- A. Czyżewski
- Rok 2010
A technique for improving the quality of speech signals recorded in strong noise is presented. The proposed algorithmemploying adaptive filtration is described and additional possibilities of speech intelligibility improvement arediscussed. Results of the tests are presented.
Estimation of the short-term predictor parameters of speech under noisy conditions
Publikacja
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- IEEE Transactions on Audio Speech and Language Processing - Rok 2006
Pełny tekst do pobrania w serwisie zewnętrznym
A non-uniform real-time speech time-scale stretching method
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2011
An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...
Speech formant frequency and pitch estimation using instantaneous complex frequency
Publikacja
- M. [. Kaniewska
- Rok 2008
W pracy opisany został algorytm estymacji częstotliwości podstawowej oraz częstotliwości środkowych i pasm formantów mowy z wykorzystaniem zespolonej pulsacji chwilowej. W artykule przedstawiono również wyniki działania algorytmu dla polskich samogłosek.
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
Publikacja
- B. Kostek
- B. Szyca
- Journal of the Acoustical Society of America - Rok 2023
The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

Pełny tekst do pobrania w portalu
Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition
Publikacja
- G. Korvel
- P. Treigys
- G. Tamulevicus
- J. Bernataviciene
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2018
convolutional neural network (CNN) which is a class of deep, feed-forward artificial neural network. We decided to analyze audio signal feature maps, namely spectrograms, linear and Mel-scale cepstrograms, and chromagrams. The choice was made upon the fact that CNN performs well in 2D data-oriented processing contexts. Feature maps were employed in the Lithuanian word recognition task. The spectral analysis led to the highest word...
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
Publikacja
- Rok 2016
The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
Publikacja
- Electronics - Rok 2022
Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

Pełny tekst do pobrania w portalu
Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
Publikacja
- G. Korvel
- K. Kąkol
- P. Treigys
- B. Kostek
- Rok 2022
The aim of this study is two-fold. First, we perform a series of experiments to examine the interference of different noises on speech processing. For that purpose, we concentrate on the Lombard effect, an involuntary tendency to raise speech level in the presence of background noise. Then, we apply this knowledge to detecting speech with the Lombard effect. This is for preparing a dataset for training a machine learning-based...

Pełny tekst do pobrania w portalu
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
Publikacja
- G. Korvel
- O. Kurasova
- B. Kostek
- Rok 2019
The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...

Pełny tekst do pobrania w portalu
Pursuing the Deep-Learning-Based Classification of Exposed and Imagined Colors from EEG
Publikacja
- A. A. Torres-García
- J. S. Garcia Salinas
- L. Villaseñor-Pineda
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2022
EEG-based brain-computer interfaces are systems aiming to integrate disabled people into their environments. Nevertheless, their control could not be intuitive or depend on an active external stimulator to generate the responses for interacting with it. Targeting the second issue, a novel paradigm is explored in this paper, which depends on a passive stimulus by measuring the EEG responses of a subject to the primary colors (red,...

Pełny tekst do pobrania w serwisie zewnętrznym
Canadian Journal of Speech-Language Pathology and Audiology

Czasopisma

ISSN: 1913-2018
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- T. Bandurski
- Ł. Hamerski
- M. Papaj
- A. Paruzel
- K. Świder
- Rok 2007
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- Elektronika : konstrukcje, technologie, zastosowania - Rok 2008
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Improving signal quality of a speech codec using hybrid perceptual-parametric algorithm
Publikacja
- International Journal of Intelligent Information and Database Systems - Rok 2008
W artykule zaprezentowano hybrydową architekturę parametryczno-perceptualną kodeka mowy. Jego podstawę stanowi kodek CELP, który wspomagany jest kodekiem perceptualnym. Celem zastosowania proponowanej metody jest uzyskanie poprawy jakości kodowania sygnału mowy. Badaniom poddano dwie architektury, z których w jednej dźwięczne części sygnału rezydualnego kodeka CELP kodowane są perceptualnie. Drugi z proponowanych kodeków dokonuje...

Pełny tekst do pobrania w serwisie zewnętrznym
Combining visual and acoustic modalities to ease speech recognition by hearing impaired people
Publikacja
- B. Kostek
- P. Dalka
- Rok 2005
Artykuł prezentuje system, którego celem działania jest ułatwienie procesu treningu poprawnej wymowy dla osób z poważnymi wadami słuchu. W analizie mowy wykorzystane zostały parametry akutyczne i wizualne. Do wyznaczenia parametrów wizualnych na podstawie kształtu i ruchu ust zostały wykorzystane modele Active Shape Models. Parametry akustyczne bazują na współczynnikach melcepstralnych. Do klasyfikacji wypowiadanych głosek została...
A survey of automatic speech recognition deep models performance for Polish medical terms
Publikacja
- Rok 2023
Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

Pełny tekst do pobrania w serwisie zewnętrznym
Elimination of clicks from archive speech signals using sparse autoregressive modeling
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2012
This paper presents a new approach to elimination of impulsivedisturbances from archive speech signals. The proposedsparse autoregressive (SAR) signal representation is given ina factorized form - the model is a cascade of the so-called formantfilter and pitch filter. Such a technique has been widelyused in code-excited linear prediction (CELP) systems, as itguarantees model stability. After detection of noise pulses usinglinear...

Pełny tekst do pobrania w serwisie zewnętrznym
Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions
Publikacja
- K. Kąkol
- G. Korvel
- B. Kostek
- Rok 2018
The aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...
Computer-assisted pronunciation training—Speech synthesis is almost all you need
Publikacja
- D. Korzekwa
- J. Lorenzo-trueba
- T. Drugman
- B. Kostek
- SPEECH COMMUNICATION - Rok 2022
The research community has long studied computer-assisted pronunciation training (CAPT) methods in non-native speech. Researchers focused on studying various model architectures, such as Bayesian networks and deep learning methods, as well as on the analysis of different representations of the speech signal. Despite significant progress in recent years, existing CAPT methods are not able to detect pronunciation errors with high...

Pełny tekst do pobrania w portalu
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
Publikacja
- G. Tamulevicius
- G. Korvel
- A. B. Yayak
- P. Treigys
- J. Bernataviciene
- B. Kostek
- Electronics - Rok 2020
In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Pełny tekst do pobrania w portalu
Study on Speech Transmission under Varying QoS Parameters in a OFDM Communication System
Publikacja
- M. Zamłyńska
- P. Falkowski-Gilski
- G. Debita
- B. Miedziński
- Rok 2021
Although there has been an outbreak of multiple multimedia platforms worldwide, speech communication is still the most essential and important type of service. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission, based most often on multi-valued modulations, multiple...

Pełny tekst do pobrania w serwisie zewnętrznym
Database of speech and facial expressions recorded with optimized face motion capture settings
Publikacja
- A. Czyżewski
- M. Kawaler
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2019
The broad objective of the present research is the analysis of spoken English employing a multiplicity of modalities. An important stage of this process, discussed in the paper, is creating a database of speech accompanied with facial expressions. Recordings of speakers were made using an advanced system for capturing facial muscle motion. A brief historical outline, current applications, limitations and the ways of capturing face...

Pełny tekst do pobrania w portalu
Infancias Imagenes

Czasopisma

ISSN: 1657-9089
Documenting the de-identification process of clinical and imaging data for AI for health imaging projects
Publikacja
- H. Kondylakis
- R. Catalan
- S. Alabart
- C. Barelle
- P. Bizopoulos
- M. Bobowicz
- J. Bona
- D. Fotiadis
- T. Garcia
- I. Gomez... i 18 innych
- Insights into Imaging - Rok 2024
Pełny tekst do pobrania w serwisie zewnętrznym
Fluorescence imaging spectroscopy and microscopy
Publikacja
- M. Schlegel-Zawadzka
- SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY - Rok 1997
Pełny tekst do pobrania w serwisie zewnętrznym
Problems of 3d breast imaging
Publikacja
- M. Moderhak
- Rok 2008
W poniższym artykule zaprezentowana zostanie idea stereoskopowego systemu pomiaru trójwymiarowej geometrii badanego obiektu. Taki sposób przetwarzania pomiarów umożliwi symulacje rozkłady temperatur na powierzchni ciała. Omówione zostaną problemy związane z konstrukcją takiego urządzenia.
Imaging forms in passive sonars
Publikacja
- A. Raganowicz
- L. Kilian
- Rok 2005
Referat jest kontynuacją tematyki organizacji zobrazowań we współczesnych systemach hydrolokacyjnych przedstawionej w referacie zaproszonym na ubiegłorocznym OSA dla systemów aktywnych. Tym razem dotyczy organizacji zobrazowań w systemach pasywnych. W praktyce, prócz prostych systemów nasłuchu i rejestracji dźwięków podwodnych, używanych przez badaczy fauny wodnej czy oceanografów, systemy pasywne są wykorzystywane w marynarkach...
IR-THERMAL IMAGING IN CARDIOSURGERY
Publikacja
- M. Kaczmarek
- Rok 2013
A method for monitoring the state of the myocardium during cardiosurgical interventions based on thermal IR imaging is presented. These methods called Static Thermography and Active Dynamic Thermography (ADT) uses information about distribution of temperature on the surface and an external excitation source to induce thermal transient processes in a tested object. Recording time series of thermograms allows to calculate parametric...
Underwater Acoustic Imaging of the Sea
Publikacja
- G. Grelowska
- E. Kozaczka
- Archives of Acoustics - Rok 2014
Acoustic waves are a carrier of information mainly in environments where the use of other types of waves, for example electromagnetic waves, is limited. The term acoustical imaging is widely used in the ultrasonic engineering to imaging areas in which the acoustic waves propagate. In particular, ultrasound is widely used in the visualization of human organs - ultrasonography (Nowicki, 2010). Expanding the concept, acoustical imaging...

Pełny tekst do pobrania w portalu
Radar and Sonar Imaging and Processing
Publikacja
- A. Stateczny
- W. Kazimierski
- K. Kulpa
- Remote Sensing - Rok 2020
The 21 papers (from 61 submitted) published in the Special Issue “Radar and Sonar Imaging Processing” highlighted a variety of topics related to remote sensing with radar and sonar sensors. The sequence of articles included in the SI dealt with a broad profile of aspects of the use of radar and sonar images in line with the latest scientific trends. The latest developments in science, including artificial intelligence, were used.

Pełny tekst do pobrania w portalu
Stochastic Integration and Long Term Predictor Estimation under Noisy Conditions for Speech Enhancement
Publikacja
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- Rok 2005
Pełny tekst do pobrania w serwisie zewnętrznym
Automated detection of pronunciation errors in non-native English speech employing deep learning
Publikacja
- D. Korzekwa
- Rok 2023
Despite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...

Pełny tekst do pobrania w portalu
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
Publikacja
- Journal of the Acoustical Society of America - Rok 2018
A method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...

Pełny tekst do pobrania w serwisie zewnętrznym
Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems
Publikacja
- P. Sokólski
- T. A. Rutkowski
- Pomiary Automatyka Robotyka - Rok 2013
The aim of this paper is to present a hybrid algorithm that combines the advantages ofartificial neural networks and hidden Markov models in speech recognition for control purpos-es. The scope of the paper includes review of currently used solutions, description and analysis of implementation of selected artificial neural network (NN) structures and hidden Markov mod-els (HMM). The main part of the paper consists of a description...

Pełny tekst do pobrania w portalu
Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System
Publikacja
- P. Falkowski-Gilski
- G. Debita
- M. Habrych
- B. Miedziński
- P. Jedlikowski
- B. Polnik
- J. Wandzio
- X. Wang
- Rok 2020
The broadband over power line – power line communication (BPL-PLC) cable is resistant to electricity stoppage and partial damage of phase conductors. It maintains continuity of transmission in case of an emergency. These features make it an ideal solution for delivering data, e.g. in an underground mine environment, especially clear and easily understandable voice messages. This paper describes a subjective quality evaluation of...

Pełny tekst do pobrania w serwisie zewnętrznym
Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students
Publikacja
- P. Falkowski-Gilski
- Rok 2021
The user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...

Pełny tekst do pobrania w serwisie zewnętrznym
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
Publikacja
- D. Korzekwa
- J. Lorenzo-trueba
- T. Drugman
- S. Calamaro
- B. Kostek
- Rok 2021
We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

Pełny tekst do pobrania w portalu
CANCER IMAGING

Czasopisma

ISSN: 1470-7330 , eISSN: 1740-5025
CLINICAL IMAGING

Czasopisma

ISSN: 0899-7071 , eISSN: 1873-4499
Neurovascular Imaging

Czasopisma

eISSN: 2055-5792
Sensing and Imaging

Czasopisma

ISSN: 1557-2064 , eISSN: 1557-2072
Molecular Imaging

Czasopisma

ISSN: 1535-3508 , eISSN: 1536-0121
Forensic Imaging

Czasopisma

ISSN: 2666-2264 , eISSN: 2666-2256
Imaging in Medicine

Czasopisma

ISSN: 1755-5191
ULTRASONIC IMAGING

Czasopisma

ISSN: 0161-7346 , eISSN: 1096-0910
ABDOMINAL IMAGING

Czasopisma

ISSN: 0942-8925
SPEECH COMMUNICATION

Czasopisma

ISSN: 0167-6393 , eISSN: 1872-7182
Akustyczny obraz słowa na tle mowy etnicznej [The acoustic image of ethnic speech words]
Publikacja
- K. Wojan
- Rok 2002

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: IMAGINED SPEECH