Filtry
wszystkich: 2257
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: SPEECH TRANSMISSION INDEX
-
Study on Speech Transmission under Varying QoS Parameters in a OFDM Communication System
PublikacjaAlthough there has been an outbreak of multiple multimedia platforms worldwide, speech communication is still the most essential and important type of service. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission, based most often on multi-valued modulations, multiple...
-
Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine
PublikacjaIn order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability...
-
Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions
PublikacjaThe paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...
-
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
PublikacjaObjective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...
-
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
PublikacjaThis paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...
-
Results of tests on speech intelligibility in reverberant conditions
Dane BadawczeThe dataset contains the results of tests that aimed to provide a relationship between the rate of speech (RoS) and reverberation conditions characterized by the Speech Transmission Index (STI).
-
Improving Objective Speech Quality Indicators in Noise Conditions
PublikacjaThis work aims at modifying speech signal samples and test them with objective speech quality indicators after mixing the original signals with noise or with an interfering signal. Modifications that are applied to the signal are related to the Lombard speech characteristics, i.e., pitch shifting, utterance duration changes, vocal tract scaling, manipulation of formants. A set of words and sentences in Polish, recorded in silence,...
-
Speech codec enhancements utilizing time compression and perceptual coding
PublikacjaA method for encoding wideband speech signal employing standardized narrowband speech codecs is presented as well as experimental results concerning detection of tonal spectral components. The speech signal sampled with a higher sampling rate than it is suitable for narrowband coding algorithm is compressed in order to decrease the amount of samples. Next, the time-compressed representation of a signal is encoded using a narrowband...
-
Applying the Lombard Effect to Speech-in-Noise Communication
PublikacjaThis study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...
-
Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
PublikacjaThe aim of this study is two-fold. First, we perform a series of experiments to examine the interference of different noises on speech processing. For that purpose, we concentrate on the Lombard effect, an involuntary tendency to raise speech level in the presence of background noise. Then, we apply this knowledge to detecting speech with the Lombard effect. This is for preparing a dataset for training a machine learning-based...
-
Adaptacja akustyczna pomieszczenia wykładowego - studium przypadku
PublikacjaW niniejszej pracy przedstawiono analizę rozkładu pola akustycznego sali wykładowej znajdującej się w budynku Wydziału Elektroniki i Telekomunikacji Politechniki Gdańskiej. Badania przeprowadzono metodą pomiarową oraz symulacyjną z wykorzystaniem programu Odeon. Wybór parametrów oceny akustyki wnętrz sugerowany jest wymaganiami stawianymi pomieszczeniom lekcyjnym z zaznaczeniem multimedialnego charakteru wykładów prowadzonych...
-
Secured wired BPL voice transmission system
PublikacjaDesigning a secured voice transmission system is not a trivial task. Wired media, thanks to their reliability and resistance to mechanical damage, seem an ideal solution. The BPL (Broadband over Power Line) cable is resistant to electricity stoppage and partial damage of phase conductors, ensuring continuity of transmission in case of an emergency. It seems an appropriate tool for delivering critical data, mostly clear and understandable...
-
Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions
PublikacjaThe aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...
-
Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System
PublikacjaThe broadband over power line – power line communication (BPL-PLC) cable is resistant to electricity stoppage and partial damage of phase conductors. It maintains continuity of transmission in case of an emergency. These features make it an ideal solution for delivering data, e.g. in an underground mine environment, especially clear and easily understandable voice messages. This paper describes a subjective quality evaluation of...
-
POPRAWA OBIEKTYWNYCH WSKAŹNIKÓW JAKOŚCI MOWY W WARUNKACH HAŁASU
PublikacjaCelem pracy jest modyfikacja sygnału mowy, aby uzyskać zwiększenie poprawy obiektywnych wskaźników jakości mowy po zmiksowaniu sygnału użytecznego z szumem bądź z sygnałem zakłócającym. Wykonane modyfikacje sygnału bazują na cechach mowy lombardzkiej, a w szczególności na efekcie podniesienia częstotliwości podstawowej F0. Sesja nagraniowa obejmowała zestawy słów i zdań w języku polskim, nagrane w warunkach ciszy, jak również w...
-
Study Analysis of Transmission Efficiency in DAB+ Broadcasting System
PublikacjaDAB+ is a very innovative and universal multimedia broadcasting system. Thanks to its updated multimedia technologies and metadata options, digital radio keeps pace with changing consumer expectations and the impact of media convergence. Broadcasting analog and digital radio services does vary, concerning devices on both transmitting and receiving side, as well as content processing mechanisms. However, the biggest difference is...
-
Spectral reflectance and transmission modeling of multi-cavity Fabry-Pérot interferometer with ZnO thin films
PublikacjaIn this paper spectral reflectance and transmission of a low-coherence fiber-optic Fabry-Pérot interferometer with thin ZnO layers is analyzed using a multi-cavity approach. In the investigated setup two standard single-mode optical fibers (SMF-28) with thin ZnO films deposited on their end-faces form an extrinsic Fabry-Pérot interferometer with air cavity. Calculations of the spectral response of the interferometer were performed...
-
Reliability assessment of an OVH HV power line truss transmission tower subjected to seismic loading
PublikacjaThe study focuses on the reliability of a transmission tower OS24 ON150 + 10, an element of an OVH HV power line, under seismic loading. In order to describe the seismic force, the real-life recording of the horizontal component of the El Centro earthquake was adopted. The amplitude and the period of this excitation are assumed random, their variation is described by Weibull distribution. The possible space state of the phenomenon...
-
Studies on optical transmittance of boron-doped nanocrystalline diamond films
PublikacjaThickness is one of the most important parameters in many applications using thin layers. This article describes thickness determination of a boron-doped nanocrystalline diamond (NCD) grown on fused silica glass. A spectroscopic measurement system has been used. A high refractive index (2.3 at 550nm) was achieved for NCD films. The thickness of NCD samples has been determined from the transmission spectrum.
-
Combined Long-Period Fiber Grating and Microcavity In-Line Mach–Zehnder Interferometer for Refractive Index Measurements with Limited Cross-Sensitivity
PublikacjaThis work discusses sensing properties of a long-period grating (LPG) and microcavity in-line Mach–Zehnder interferometer (µIMZI) when both are induced in the same single-mode optical fiber. LPGs were either etched or nanocoated with aluminum oxide (Al2O3) to increase its refractive index (RI) sensitivity up to ≈2000 and 9000 nm/RIU, respectively. The µIMZI was machined using a femtosecond laser as a cylindrical cavity (d = 60...
-
Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online
PublikacjaRadio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...
-
Computer support of analysis optical spectra measurements
PublikacjaVerification of measurement errors has a big impact on assessment of accuracy of conducted measurements and obtained results. In many cases computer simulation results are compared with measurement results in order to evaluate measurement errors. The purpose of our research was to check the accuracy of measurements made with Fabry-Perot interferometer working in the transmission mode. In measurement setup, a 1310 nm superluminescent...
-
Computer Support of Analysis of Optical Spectra Measurements
PublikacjaThe verification of measurement errors has a big impact on the assessment of the accuracy of conducted measurements and obtained results. In many cases, computer simulation results are compared with measurement results in order to evaluate measurement errors. The purpose of our research was to check the accuracy of measurements made with a Fabry–Perot interferometer working in the transmission mode. In the measurement setup, a...
-
The Progress in Electron Microscopy Studies of Particulate Matters to Be Used as a Standard Monitoring Method for Air Dust Pollution
PublikacjaThe present article reviews studies on air solid particles carried out with the use of electron microscopy. Particle analysis combining scanning and transmission electron microscopy (SEM and TEM) can be used to derive size-resolved information of the composition, mixing state, morphology, and complex refractive index of atmospheric aerosol particles. It seems that electron microscopy is more widely used in atmospheric particulate...
-
Enhancement of fiber-optic low-coherence Fabry-Pérot interferometer with ZnO ALD films
PublikacjaIn this paper investigation of the enhanced fiber-optic low coherence Fabry-Pérot interferometer with zinc oxide (ZnO) film deposited by atomic layer deposition (ALD) was presented. Model of the interferometer, which was constructed of single-mode optical fiber with applied ZnO ALD films, was built. The interferometer was also examined by means of experiment. Measurements were performed for both reflective and transmission modes,...
-
Application of the Fractional Fourier Transform for dispersion compensation in signals from a fiber-based Fabry-Perot interferometer
PublikacjaOptical methods of measurement do not require contact of a probe and the object under study, and thus have found use in a broad range of applications such as nondestructive testing (NDT), where noninvasive measurement is crucial. Measuring the refractive index of a material can give a valuable insight into its composition. Low‑coherence radiation sources enable measurement of the sample’s properties across a wide spectrum, while...
-
Thermal dewetting as a method of surface modification of the gold thin films for surface plasmon resonance based sensor applications
PublikacjaHere, we report a quick and simple approach with low, optimized production costs to obtain surface plasmon resonance (SPR) based sensors fabricated through a time- and resource-effective method based on thermal dewetting of thin Au films. From the applicative point of view, the method of detection presented here should be easier to implement, since light transmission measurements seem to be much less challenging than light refractive...
-
Computational analysis of power-law fluids for convective heat transfer in permeable enclosures using Darcy effects
PublikacjaNatural convection is a complex environmental phenomenon that typically occurs in engineering settings in porous structures. Shear thinning or shear thickening fuids are characteristics of power-law fuids, which are non-Newtonian in nature and fnd wide-ranging uses in various industrial processes. Non-Newtonian fuid fow in porous media is a difcult problem with important consequences for energy systems and heat transfer. In this...
-
Ellipsometric investigation of nitrogen doped diamond thin films grown in microwave CH4/H2/N2 plasma enhanced chemical vapor deposition
PublikacjaThe influence of N2 concentration (1%–8%) in CH4/H2/N2 plasma on structure and optical properties of nitrogen doped diamond (NDD) films was investigated. Thickness, roughness, and optical properties of the NDD films in the VIS–NIR range were investigated on the silicon substrates using spectroscopic ellipsometry. The samples exhibited relatively high refractive index (2.6 6 0.25 at 550 nm) and extinction coefficient (0.05 6 0.02...
-
Orken Mamyrbayev Professor
Osoby1. Education: Higher. In 2001, graduated from the Abay Almaty State University (now Abay Kazakh National Pedagogical University), in the specialty: Computer science and computerization manager. 2. Academic degree: Ph.D. in the specialty "6D070300-Information systems". The dissertation was defended in 2014 on the topic: "Kazakh soileulerin tanudyn kupmodaldy zhuyesin kuru". Under my supervision, 16 masters, 1 dissertation...
-
Digital Transformation of Terrestrial Radio: An Analysis of Simulcasted Broadcasts in FM and DAB+ for a Smart and Successful Switchover
PublikacjaThe process of digitizing radio is far from over. It is an important interdisciplinary aspect, involving Big Data and AI (Artificial Intelligence) when it comes to classifying and handling content, and an organizational challenge in the Industry 4.0 concept. There exist several methods for delivering audio signals, including terrestrial broadcasting and internet streaming. Among them, the DAB+ (Digital Audio Broadcasting plus)...
-
Cellulose Nanofibers Isolated from the Cuscuta Reflexa Plant as a Green Reinforcement of Natural Rubber
PublikacjaIn the present work, we used the steam explosion method for the isolation of cellulose nanofiber (CNF) from Cuscuta reflexa, a parasitic plant commonly seen in Kerala and we evaluated its reinforcing efficiency in natural rubber (NR). Fourier Transform Infrared Spectroscopy (FTIR), X-Ray Diffraction (XRD), Scanning Electron Microscopy (SEM), Transmission Electron Microscopy (TEM), and Thermogravimetric analysis (TGA) techniques...
-
Improved surface coverage of an optical fibre with nanocrystalline diamond by the application of dip-coating seeding
PublikacjaGrowth processes of diamond thin films on the fused silica optical fibres (10 cm in length) were investigated at various temperatures. Fused silica pre-treatment by dip-coating in a dispersion consisting of detonation nanodiamond (DND) in dimethyl sulfoxide (DMSO) with polyvinyl alcohol (PVA) was applied. Nanocrystalline diamond (NCD) films were deposited on the fibres using the microwave plasma assisted chemical vapour deposition...
-
Computer-assisted pronunciation training—Speech synthesis is almost all you need
PublikacjaThe research community has long studied computer-assisted pronunciation training (CAPT) methods in non-native speech. Researchers focused on studying various model architectures, such as Bayesian networks and deep learning methods, as well as on the analysis of different representations of the speech signal. Despite significant progress in recent years, existing CAPT methods are not able to detect pronunciation errors with high...
-
BPL-PLC Voice Communication System for the Oil and Mining Industry
PublikacjaApplication of a high-efficiency voice communication systems based on broadband over power line-power line communication (BPL-PLC) technology in medium voltage networks, including hazardous areas (like the oil and mining industry), as a redundant mean of wired communication (apart from traditional fiber optics and electrical wires) can be beneficial. Due to the possibility of utilizing existing electrical infrastructure, it can...
-
Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
PublikacjaThe Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech...
-
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
PublikacjaText-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...
-
Chlorine-free extraction and structural characterization of cellulose nanofibers from waste husk of millet (Pennisetum glaucum)
PublikacjaThis study aims to extract cellulose nanofibers (CNFs) from a sustainable source, i.e. millet husk, which is an agro-waste worthy of consideration. Pre-treatments such as mercerisation, steam explosion, and peroxide bleaching (chlorine-free) were applied for the removal of non-cellulosic components. The bleached millet husk pulp was subjected to acid hydrolysis (5% oxalic acid) followed by homogenization to extract CNFs. The extracted...
-
Speech Intelligibility Measurements in Auditorium
PublikacjaSpeech intelligibility was measured in Auditorium Novum on Technical University of Gdansk (seating capacity 408, volume 3300 m3). Articulation tests were conducted; STI and Early Decay Time EDT coefficients were measured. Negative noise contribution to speech intelligibility was taken into account. Subjective measurements and objective tests reveal high speech intelligibility at most seats in auditorium. Correlation was found between...
-
Language Models in Speech Recognition
PublikacjaThis chapter describes language models used in speech recognition, It starts by indicating the role and the place of language models in speech recognition. Mesures used to compare language models follow. An overview of n-gram, syntactic, semantic, and neural models is given. It is accompanied by a list of popular software.
-
Transient detection for speech coding applications
PublikacjaSignal quality in speech codecs may be improved by selecting transients from speech signal and encoding them using a suitable method. This paper presents an algorithm for transient detection in speech signal. This algorithm operates in several frequency bands. Transient detection functions are calculated from energy measured in short frames of the signal. The final selection of transient frames is based on results of detection...
-
Improving the quality of speech in the conditions of noise and interference
PublikacjaThe aim of the work is to present a method of intelligent modification of the speech signal with speech features expressed in noise, based on the Lombard effect. The recordings utilized sets of words and sentences as well as disturbing signals, i.e., pink noise and the so-called babble speech. Noise signal, calibrated to various levels at the speaker's ears, was played over two loudspeakers located 2 m away from the speaker. In...
-
INDEX ON CENSORSHIP
Czasopisma -
Index Comunicacion
Czasopisma -
Constructing a Dataset of Speech Recordingswith Lombard Effect
PublikacjaThepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...
-
Improved method for real-time speech stretching
Publikacjan algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...
-
Methodology and technology for the polymodal allophonic speech transcription
PublikacjaA method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...
-
Methodology and technology for the polymodal allophonic speech transcription
PublikacjaA method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...
-
Real-time speech-rate modification experiments
PublikacjaAn algorithm designed for real-time speech time scale modification (stretching) is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness of signal processing algorithms are examined experimentally together...
-
E-cohomological Conley index
PublikacjaIn this thesis we continue with developing the E-cohomological Conley index which was introduced by A.Abbondandolo. In particular, we generalize the index to non-gradient flows, we show that it an possesses additional multiplicative structure and we prove the continuation principle. Then, using continuation principle, we show how the computation of the E-cohomological Conley index can be reduced to the computation of the classical...