Wyniki wyszukiwania dla: SPEECH REINFORCEMENT SYSTEMS

Broadband interference in speech reinforcement systems

Publikacja

- Rok 2008

Artykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...

Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning

Publikacja

A. Czyżewski

- Journal of the Acoustical Society of America - Rok 2023

Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Pełny tekst do pobrania w portalu

A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems

Publikacja

- Rok 2018

This paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...

Pełny tekst do pobrania w serwisie zewnętrznym

Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems

Publikacja

- Pomiary Automatyka Robotyka - Rok 2013

The aim of this paper is to present a hybrid algorithm that combines the advantages ofartificial neural networks and hidden Markov models in speech recognition for control purpos-es. The scope of the paper includes review of currently used solutions, description and analysis of implementation of selected artificial neural network (NN) structures and hidden Markov mod-els (HMM). The main part of the paper consists of a description...

Pełny tekst do pobrania w portalu

Badanie rozkładów parametrów sygnału mowy w zastosowaniach do prognozowania prawdopodobieństwa popełnienia błędów w systemach identyfikacji mówców = Examining distribution of speech signal parameters for the prognosis of error probability in speaker verification systems

Publikacja

A. Kaczmarek

- Rok 2010

Przedmiotem pracy jest system identyfikacji mówców w sposób zależny od tekstu ("text dependent''). Dokonano analizy wielu różnych wypowiedzi kilkudziesięciu mówców. Zastosowana metoda parametryzacji to metoda oparta na wynikach analizy cepstralnej sygnału mowy. Zdefiniowane zostały nowe parametry skojarzone z elementarnymi zdarzeniami w procesie weryfikacji mówców. Na tej podstawie dokonano estymacji funkcji gęstości prawdopodobieństwa...

PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS

Publikacja

- Rok 2015

The quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...

Distortion of speech signals in the listening area: its mechanism and measurements

Publikacja

- Rok 2014

The paper deals with a problem of the influence of the number and distribution of loudspeakers in speech reinforcement systems on the quality of publicly addressed voice messages, namely on speech intelligibility in the listening area. Linear superposition of time-shifted broadband waves of a same form and slightly different magnitudes that reach a listener from numerous coherent sources, is accompanied by interference effects...

Pełny tekst do pobrania w serwisie zewnętrznym

Modeling and Designing Acoustical Conditions of the Interior – Case Study

Publikacja

- Archives of Acoustics - Rok 2016

The primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...

Pełny tekst do pobrania w portalu

KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY

Publikacja

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2016

W referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...

Integracja bezprzewodowych heterogenicznych sieci IP dla poprawy efektywności transmisji danych na morzu

Publikacja

M. Hoeft

- Rok 2023

Wraz ze wzrostem istotności środowiska morskiego w naszym codziennym życiu np. w postaci zwiększonego wolumenu transportu realizowanego drogą morską. czy zintensyfikowanych prac dotyczących obserwacji i monitoringu środowiska morskiego, wzrasta również potrzeba opracowania efektywnych systemów komunikacyjnych dedykowanych dla tego środowiska. Heterogeniczne systemy łączności bezprzewodowej integrowane na poziomie warstwy sieciowej...

Pełny tekst do pobrania w portalu

Koncepcja systemu wspomagania decyzji nawigatora statku opartego na ewolucyjnym planowaniu manewrów antykolizyjnych

Publikacja

R. Szłapczyński

- Logistyka - Rok 2014

Artykuł przedstawia koncepcję systemu wspomagania decyzji nawigatora statku opartego na wątkach badań prowadzonych wcześniej przez autora. System będzie rozszerzał funkcjonalność systemów dotychczasowych o możliwość szczegółowego planowania bezpiecznej trajektorii statku na wodach zamkniętych, z dużą liczbą statków obcych i ograniczeniami toru wodnego. Artykuł zawiera dyskusję możliwych podejść do planowania manewrów, optymalizacji...

Computer-assisted pronunciation training—Speech synthesis is almost all you need

Publikacja

D. Korzekwa
J. Lorenzo-trueba
T. Drugman
B. Kostek

- SPEECH COMMUNICATION - Rok 2022

The research community has long studied computer-assisted pronunciation training (CAPT) methods in non-native speech. Researchers focused on studying various model architectures, such as Bayesian networks and deep learning methods, as well as on the analysis of different representations of the speech signal. Despite significant progress in recent years, existing CAPT methods are not able to detect pronunciation errors with high...

Pełny tekst do pobrania w portalu

Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement

Publikacja

G. Korvel
K. Kąkol
O. Kurasova
B. Kostek

- IEEE Access - Rok 2020

The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech...

Pełny tekst do pobrania w portalu

An audio-visual corpus for multimodal automatic speech recognition

Publikacja

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2017

review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Pełny tekst do pobrania w portalu

A note on total reinforcement in graphs

Publikacja

M. A. Henning
N. J. Rad
J. Raczek

- DISCRETE APPLIED MATHEMATICS - Rok 2011

In this note we prove a conjecture and inprove some results presendet in a recent paper of N. Sridharan, M.D. Elias, V.S.A. Subramanian, Total reinforcement number of a graph, AKCE Int. J. Graphs Comb. 4 (2) (2007) 197-202.

Pełny tekst do pobrania w portalu

Stan graniczny nośności dźwigara żelbetowego mostu na zginanie według norm PN-EN 1992-2 oraz PN-S-10042:1991

Publikacja

M. Abramski

- Rok 2016

Praca włącza się w bogaty w ostatnich latach w krajowym piśmiennictwie nurt porównań dwóch generacji norm projektowania mostów z betonu: polskiej - wycofanej, aczkolwiek powszechnie stosowanej oraz europejskiej – wciąż jeszcze wdrażanej do praktyki projektowej. Nowością w stosunku do dotychczasowych publikacji polskich jest szersze ujęcie różnic pomiędzy obydwiema generacjami norm. Poza rozpatrywanymi przez wielu autorów różnicami...

Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions

Publikacja

- Rok 2016

Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Pełny tekst do pobrania w serwisie zewnętrznym

Speech Intelligibility Measurements in Auditorium

Publikacja

K. Leo

- ACTA PHYSICA POLONICA A - Rok 2010

Speech intelligibility was measured in Auditorium Novum on Technical University of Gdansk (seating capacity 408, volume 3300 m3). Articulation tests were conducted; STI and Early Decay Time EDT coefficients were measured. Negative noise contribution to speech intelligibility was taken into account. Subjective measurements and objective tests reveal high speech intelligibility at most seats in auditorium. Correlation was found between...

Pełny tekst do pobrania w portalu

Language Models in Speech Recognition

Publikacja

J. Daciuk

- Rok 2022

This chapter describes language models used in speech recognition, It starts by indicating the role and the place of language models in speech recognition. Mesures used to compare language models follow. An overview of n-gram, syntactic, semantic, and neural models is given. It is accompanied by a list of popular software.

Pełny tekst do pobrania w serwisie zewnętrznym

Detecting Lombard Speech Using Deep Learning Approach

Publikacja

K. Kąkol
G. Korvel
G. Tamulevicius
B. Kostek

- SENSORS - Rok 2023

Robust Lombard speech-in-noise detecting is challenging. This study proposes a strategy to detect Lombard speech using a machine learning approach for applications such as public address systems that work in near real time. The paper starts with the background concerning the Lombard effect. Then, assumptions of the work performed for Lombard speech detection are outlined. The framework proposed combines convolutional neural networks...

Pełny tekst do pobrania w portalu

Filtry

Katalog

Kategoria

Rok

Opcje

Broadband interference in speech reinforcement systems

Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning

A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems

Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems

Badanie rozkładów parametrów sygnału mowy w zastosowaniach do prognozowania prawdopodobieństwa popełnienia błędów w systemach identyfikacji mówców = Examining distribution of speech signal parameters for the prognosis of error probability in speaker verification systems

PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS

Distortion of speech signals in the listening area: its mechanism and measurements

Modeling and Designing Acoustical Conditions of the Interior – Case Study

KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY

Integracja bezprzewodowych heterogenicznych sieci IP dla poprawy efektywności transmisji danych na morzu

Koncepcja systemu wspomagania decyzji nawigatora statku opartego na ewolucyjnym planowaniu manewrów antykolizyjnych

Computer-assisted pronunciation training—Speech synthesis is almost all you need

Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement

An audio-visual corpus for multimodal automatic speech recognition

A note on total reinforcement in graphs

Stan graniczny nośności dźwigara żelbetowego mostu na zginanie według norm PN-EN 1992-2 oraz PN-S-10042:1991

Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions

Speech Intelligibility Measurements in Auditorium

Language Models in Speech Recognition

Detecting Lombard Speech Using Deep Learning Approach

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje

Wyniki wyszukiwania dla: SPEECH REINFORCEMENT SYSTEMS