Wyniki wyszukiwania dla: SPEECH TRANSMISSION INDEX

Wyniki wyszukiwania dla: SPEECH TRANSMISSION INDEX

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 2218

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

Speech Analytics Based on Machine Learning
Publikacja
- Rok 2019
In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Pełny tekst do pobrania w serwisie zewnętrznym
Detecting Lombard Speech Using Deep Learning Approach
Publikacja
- K. Kąkol
- G. Korvel
- G. Tamulevicius
- B. Kostek
- SENSORS - Rok 2023
Robust Lombard speech-in-noise detecting is challenging. This study proposes a strategy to detect Lombard speech using a machine learning approach for applications such as public address systems that work in near real time. The paper starts with the background concerning the Lombard effect. Then, assumptions of the work performed for Lombard speech detection are outlined. The framework proposed combines convolutional neural networks...

Pełny tekst do pobrania w portalu
Speech synthesis controlled by eye gazing
Publikacja
- A. Czyżewski
- K. Łopatka
- B. Kunka
- R. Rybacki
- B. Kostek
- Rok 2010
A method of communication based on eye gaze controlling is presented. Investigations of using gaze tracking have been carried out in various context applications. The solution proposed in the paper could be referred to as ''talking by eyes'' providing an innovative approach in the domain of speech synthesis. The application proposed is dedicated to disabled people, especially to persons in a so-called locked-in syndrome who cannot...
Numerical Modelling for Prediction of Compression Index from Soil Index Properties in Jimma town, Ethiopia
Publikacja
- W. F. Kabeta
- F. Fufa Fekadu
- K. Feyissa Yerosan
- U.Porto Journal of Engineering - Rok 2022
In this study, correlations are developed to predict compression index (Cc) from index parameters so that one can be able to model Jimma soils with compression index using simple laboratory tests. Undisturbed and disturbed soil samples from twelve different locations in Jimma town were collected. Laboratory tests like specific gravity, grain size analysis, Atterberg limit, and one-dimensional consolidation test for a total of twenty-four...

Pełny tekst do pobrania w portalu
Time-domain prosodic modifications for text-to-speech synthesizer
Publikacja
- J. Łopatka
- P. Suchomski
- A. Czyżewski
- Rok 2010
An application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
A Method of Real-Time Non-uniform Speech Stretching
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2012
Developed method of real-time non-uniform speech stretching is presented.The proposed solution is based on the well-known SOLA algorithm(Synchronous Overlap and Add). Non-uniform time-scale modification isachieved by the adjustment of time scaling factor values in accordance with thesignal content. Dependently on the speech unit (vowels/consonants), instantaneousrate of speech (ROS), and speech signal presence, values of the scalingfactor...

Pełny tekst do pobrania w serwisie zewnętrznym
Justyna Płotka-Wasylka dr hab. inż.

Osoby

Wydział Chemiczny, Katedra Chemii Analitycznej

Urodziła się w Słupsku (24.03.1986).W 2005 roku ukończyła I Liceum Ogólnokształcące im. Jana II Sobieskiego w Wejherowie i rozpoczęła studia na Wydziale Chemicznym Politechniki Gdańskiej. Po ich ukończeniu w 2010 rozpoczęła pracę naukową na tej uczelni, uzyskując w 2014 roku stopień doktora nauk chemicznych. Tematem jej rozprawy doktorskiej, wykonywanej pod kierunkiem prof. Marka Biziuka oraz dr Caluma Morrisona (Uniwersytet w...
Topological extraordinary optical transmission
Publikacja
- K. Baskourelos
- O. Tsilipakos
- T. Stefański
- S. F. Galata
- E. N. Economou
- M. Kafesaki
- K. L. Tsakmakidis
- Physical Review Research - Rok 2022
Τhe incumbent technology for bringing light to the nanoscale, the near-field scanning optical microscope, has notoriously small throughput efficiencies of the order of 10^4-10^5 or less. We report on a broadband, topological, unidirectionally guiding structure, not requiring adiabatic tapering and, in principle, enabling near-perfect (∼100%) optical transmission through an unstructured single arbitrarily subdiffraction slit at...

Pełny tekst do pobrania w portalu
Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Publikacja
- D. Korzekwa
- R. Barra-Chicote
- B. Kostek
- T. Drugman
- M. Łajszczak
- Rok 2019
We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not...

Pełny tekst do pobrania w portalu
Examining Influence of Distance to Microphone on Accuracy of Speech Recognition
Publikacja
- Rok 2015
The problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal...

Pełny tekst do pobrania w serwisie zewnętrznym
Comparison of various speech time-scale modificartion methods
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Archives of Acoustics - Rok 2011
The objective of this work is to investigate the influence of the different time-scale modification (TSM) methods on the quality of the speech stretched up using the designed non-uniform real-time speech time-scale modification algorithm (NU-RTSM). The algorithm provides a combination of the typical TSM algorithm with the vowels, consonants, stutter, transients and silence detectors. Based on the information about the content and...
Tensor Decomposition for Imagined Speech Discrimination in EEG
Publikacja
- J. S. Garcia Salinas
- L. Villaseñor-Pineda
- C. A. Reyes-Garćia
- A. A. Torres-García
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2018
Most of the researches in Electroencephalogram(EEG)-based Brain-Computer Interfaces (BCI) are focused on the use of motor imagery. As an attempt to improve the control of these interfaces, the use of language instead of movement has been recently explored, in the form of imagined speech. This work aims for the discrimination of imagined words in electroencephalogram signals. For this purpose, the analysis of multiple variables...

Pełny tekst do pobrania w serwisie zewnętrznym
Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Diagnostic Pathology - Rok 2012
Methods developed for real-time time scale modification (TSM) of speech signal are presented. They are based onthe non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of theproposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearingimpaired children and elderly listeners. It was shown that for the speech with average rate equal to or...

Pełny tekst do pobrania w portalu
Multimodal English corpus for automatic speech recognition
Publikacja
- Rok 2013
A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publikacja
- Rok 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Pełny tekst do pobrania w serwisie zewnętrznym
Optical transmission of the Niobium thin films
Dane Badawcze
open access
- M. Łapiński
Niobium thin films with a thickness of 200nm were deposited n a Corning glass substrate by magnetron sputtering method. The optical transmission spectra in a visible light range were.recorded. Investigations showed a good optical transmission thru the layers for each samples, annealed at various temperatures. For measurements samples annealed at 500,...
An audio-visual corpus for multimodal automatic speech recognition
Publikacja
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2017
review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Pełny tekst do pobrania w portalu
The cohomological span of LS-Conley index
Publikacja
- J. Maksymiuk
- JOURNAL OF DIFFERENTIAL EQUATIONS - Rok 2015
In this paper we introduce a new homotopy invariant – the cohomological span of LS-Conley index. We prove the theorems on the existence of critical points for a class of strongly indefinite functionals with the gradient of the form Lx+K(x), where L is bounded linear and K is completely continuous. We give examples of Hamiltonian systems for which our methods give better results than the Morse inequalities. We also give a formula...

Pełny tekst do pobrania w portalu
Mathematical Modelling of Drive System with an Elastic Coupling Based on Formal Analogy between the Transmission Shaft and the Electric Transmission Line
Publikacja
- A. Popenda
- M. Lis
- M. Nowak
- K. Blecharz
- ENERGIES - Rok 2020
In the paper, the kinematic structure of the transmission shaft between the driving motor and the working mechanism is studied. The analysis is based on electrical and mechanical similarities. The equivalent circuits, typical for electrical systems, are defined for the transmission shaft concerned. Modelling of the transmission shaft based on a formal analogy between the transmission shaft and the electric transmission line is...

Pełny tekst do pobrania w portalu
Cooperative Data Transmission in Wireless Vehicular Networks
Publikacja
- A. Marczak
- Rok 2017
The paper presents issues related to the cooperative transmission in wireless vehicular networks. Cooperative transmission involves the use of mobile terminals as relay stations to improve the transmission quality, improve network performance and reduce energy consumption. The paper presents the methods used to implement cooperative transmission and the types of cooperative networks.
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
Publikacja
- Rok 2016
The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...
Morse inequalities via Conley index theory
Publikacja
- M. Izydorek
- M. Styborski
- Rok 2011
The relation known as the Morse inequalities can be extended to a more general setting of flows on a locally compact metric spaces (Conley index) as well as dynamical systems on Hilbert spaces (LS-index). This paper is a discourse around this extension. Except the part concerning the LS-index the material is self-contained and has a character of a survey.
Ranking Speech Features for Their Usage in Singing Emotion Classification
Publikacja
- S. Zaporowski
- B. Kostek
- Rok 2020
This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Pełny tekst do pobrania w portalu
[ITiT] Transmission Channel in Radio Communication Systems
Kursy Online
- S. J. Ambroziak
{mlang pl} Dyscyplina: informatyka techniczna i telekomunikacja Zajęcia fakultatywne dla doktorantów II roku Prowadzący: dr hab. inż. Sławomir Ambroziak, prof. PG Liczba godzin: 15 Forma zajęć: wykład {mlang} {mlang en} Discipline: technical informatics and telecommunications Elective course for 2nd-year PhD students Academic teachers: dr hab. inż. Sławomir Ambroziak, prof. PG Total hours of training: 15 teaching hours Course...
System Supporting Speech Perception in Special Educational Needs Schoolchildren
Publikacja
- A. Kupryjanow
- P. Suchomski
- P. Odya
- A. Czyżewski
- Rok 2012
The system supporting speech perception during the classes is presented in the paper. The system is a combination of portable device, which enables real-time speech stretching, with the workstation designed in order to perform hearing tests. System was designed to help children suffering from Central Auditory Processing Disorders.

Pełny tekst do pobrania w serwisie zewnętrznym
High quality speech codec employing sines+noise+transients model
Publikacja
- Archives of Acoustics - Rok 2006
A method of high quality wideband speech signal representation employing sines+transients+noise model is presented. The need for a wideband speech coding approach as well as various methods for analysis and synthesis of sines, residual and transient states of speech signal is discussed. The perceptual criterion is applied in the proposed approach during encoding of sines amplitudes in order to reduce bandwidth requirements and...

Pełny tekst do pobrania w portalu
Silence/noise detection for speech and music signals
Publikacja
- M. Papaj
- Rok 2008
This paper introduces a novel off-line algorithm for silence/noise detection in noisy signals. The main concept of the proposed algorithm is to provide noise patterns for further signals processing i.e. noise reduction for speech enhancement. The algorithm is based on frequency domain characteristics of signals. The examples of different types of noisy signals are presented.
Virtual keyboard controlled by eye gaze employing speech synthesis
Publikacja
- B. Kunka
- R. Rybacki
- K. Łopatka
- A. Czyżewski
- B. Kostek
- Rok 2010
The article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...
Virtual Keyboard controlled by eye gaze employing speech synthesis
Publikacja
- K. Łopatka
- R. Rybacki
- B. Kunka
- A. Czyżewski
- B. Kostek
- Elektronika : konstrukcje, technologie, zastosowania - Rok 2011
The article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...

Pełny tekst do pobrania w serwisie zewnętrznym
Transmission parameters of underwater communication channels
Publikacja
- H. Lasota
- I. Kochańska
- HYDROACOUSTICS - Rok 2011
The underwater environment is tough and demanding as a communication channel for ultrasonic signals. The channel transmission characteristics in marine and inland waters depend much on local bathymetry and changing weather conditions. The architecture and performance of a reliable underwater acoustic communication (UAC) system should allow real-time adaptation of its transmission parameters to a large variety of possible channel...

Pełny tekst do pobrania w portalu
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
Publikacja
- G. Korvel
- O. Kurasova
- B. Kostek
- Rok 2019
The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...

Pełny tekst do pobrania w portalu
Corrupted speech intelligibility improvement using adaptive filter based algorithm
Publikacja
- D. Ellwart
- A. Czyżewski
- Rok 2010
A technique for improving the quality of speech signals recorded in strong noise is presented. The proposed algorithmemploying adaptive filtration is described and additional possibilities of speech intelligibility improvement arediscussed. Results of the tests are presented.
Distortion of speech signals in the listening area: its mechanism and measurements
Publikacja
- H. Lasota
- R. Mazurek
- I. Kochańska
- Rok 2014
The paper deals with a problem of the influence of the number and distribution of loudspeakers in speech reinforcement systems on the quality of publicly addressed voice messages, namely on speech intelligibility in the listening area. Linear superposition of time-shifted broadband waves of a same form and slightly different magnitudes that reach a listener from numerous coherent sources, is accompanied by interference effects...

Pełny tekst do pobrania w serwisie zewnętrznym
Index de Enfermeria

Czasopisma

ISSN: 1132-1296
Journal of Index Investing

Czasopisma

ISSN: 2154-7238 , eISSN: 2374-135X
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
Publikacja
- B. Kostek
- B. Szyca
- Journal of the Acoustical Society of America - Rok 2023
The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

Pełny tekst do pobrania w portalu
Measurements of transmission properties of Acoustic Communication Channels
Publikacja
- I. Kochańska
- H. Lasota
- HYDROACOUSTICS - Rok 2012
Tough transmission properties of shallow water acoustic channels (SWAC) highly limit the use of underwater acoustic communication (UAC) systems. An adaptive matching of modulation and signaling schemes to instantaneous channel conditions is needed for reliabledata communications. This creates, however, unique challenges for designers when compared to radio transmission systems. When communication system elements are in move, the...

Pełny tekst do pobrania w portalu
A survey of automatic speech recognition deep models performance for Polish medical terms
Publikacja
- Rok 2023
Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

Pełny tekst do pobrania w serwisie zewnętrznym
Topological invariants for equivariant flows: Conley index and degree
Publikacja
- M. Styborski
- Rok 2010
About forty years have passed since Charles Conley defined the homotopy index. Thereby, he generalized the ideas that go back to the calculus of variations work of Marston Morse. Within this long time the Conley index has proved to be a valuable tool in nonlinear analysis and dynamical systems. A significant development of applied methods has been observed. Later, the index theory has evolved to cover such areas as discrete dynamical...
A non-uniform real-time speech time-scale stretching method
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2011
An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...
Building Knowledge for the Purpose of Lip Speech Identification
Publikacja
- Advances in Intelligent Systems and Computing - Rok 2017
Consecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...

Pełny tekst do pobrania w serwisie zewnętrznym
Emotions in polish speech recordings
Dane Badawcze
open access
- M. Mięsikowska
- D. Świsulski
The data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
Publikacja
- G. Tamulevicius
- G. Korvel
- A. B. Yayak
- P. Treigys
- J. Bernataviciene
- B. Kostek
- Electronics - Rok 2020
In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Pełny tekst do pobrania w portalu
Morse cohomology in a Hilbert space via the Conley index
Publikacja
- M. Starostka
- Journal of Fixed Point Theory and Applications - Rok 2015
The main theorem of this paper states that Morse cohomology groups in a Hilbert space are isomorphic to the cohomological Conley index. It is also shown that calculating the cohomological Conley index does not require finite-dimensional approximations of the vector field. Further directions are discussed.

Pełny tekst do pobrania w portalu
Opitmalising Human Development Index with sensitivity analysis
Publikacja
- M. Kuc-Czarnecka
- Rok 2019
Research background: Composite indicators are commonly used not only to measure economic development, the standard of livin g, competitiveness, fairness, effectiveness but are also willingly implemented in to many different fields. How- ever, it seems that in most cases the variable weig hting procedure is avoided or erroneous since in most cases so-called “wights by belief” are...
Communication Platform for Evaluation of Transmitted Speech Quality
Publikacja
- A. Ciarkowski
- A. Czyżewski
- Journal of Telecommunications and Information Technology - Rok 2011
A voice communication system designed and implemented is described. The purpose of the presented platform was to enable a series of experiments related to the quality assessment of algorithms used in the coding and transmitting of speech. The system is equipped with tools for recording signals at each stage of processing, making it possible to subject them to subjective assessments by listening tests or, objective evaluation employing...

Pełny tekst do pobrania w portalu
Transmission Quality Measurements in DAB+ Broadcast System
Publikacja
- P. Falkowski-Gilski
- J. Stefański
- Metrology and Measurement Systems - Rok 2017
In the age of digital media, delivering broadcast content to customers at an acceptable level of quality is one of the most challenging tasks. The most important factor is the efficient use of available resources, including bandwidth. An appropriate way of managing the digital multiplex is essential for both the economic and technical issues. In this paper we describe transmission quality measurements in the DAB+ broadcast system....

Pełny tekst do pobrania w portalu
Application of OFDM technique to underwater acoustic data transmission
Publikacja
- I. Kochańska
- H. Lasota
- HYDROACOUSTICS - Rok 2011
Performances of underwater acoustic communication (UAC) digital systems are strongly related to specific transmission properties of the underwater channel. Depending on the characteristics of the channel, an architecture and modulation techniques are usually implemented that are known as reliable solutions for data transmission in difficult radio channels. The OFDM technique seems to be the most promising nowadays. The parameters...

Pełny tekst do pobrania w portalu
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- T. Bandurski
- Ł. Hamerski
- M. Papaj
- A. Paruzel
- K. Świder
- Rok 2007
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- Elektronika : konstrukcje, technologie, zastosowania - Rok 2008
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: SPEECH TRANSMISSION INDEX

Justyna Płotka-Wasylka dr hab. inż.