Wyniki wyszukiwania dla: SPEECH PROCESSING

BPL-PLC Voice Communication System for the Oil and Mining Industry

Publikacja

G. Debita
P. Falkowski-Gilski
M. Habrych
G. Wiśniewski
B. Miedziński
P. Jedlikowski
A. Waniewska
J. Wandzio
B. Polnik

- ENERGIES - Rok 2020

Application of a high-efficiency voice communication systems based on broadband over power line-power line communication (BPL-PLC) technology in medium voltage networks, including hazardous areas (like the oil and mining industry), as a redundant mean of wired communication (apart from traditional fiber optics and electrical wires) can be beneficial. Due to the possibility of utilizing existing electrical infrastructure, it can...

Pełny tekst do pobrania w portalu

Waveguide model of the hearing aid earmold system

Publikacja

- Rok 2006

Background The earmold system of the Behind-The-Ear hearing aid is an acoustic system that modifies the spectrum of the propagated sound waves. Improper selection of the earmold system may result in deterioration of sound quality and speech intelligibility. Computer modeling methods may be useful in the process of hearing aid fitting, allowing physician to examine various earmold system configurations and choose the optimum one...

Pełny tekst do pobrania w serwisie zewnętrznym

Waveguide model of the hearing aid earmold system

Publikacja

- Diagnostic Pathology - Rok 2006

Background The earmold system of the Behind-The-Ear hearing aid is an acoustic system that modifies the spectrum of the propagated sound waves. Improper selection of the earmold system may result in deterioration of sound quality and speech intelligibility. Computer modeling methods may be useful in the process of hearing aid fitting, allowing physician to examine various earmold system configurations and choose the optimum one...

Pełny tekst do pobrania w portalu

Multimodal learning application with interactive animated character. [Multimodalna aplikacja edukacyjna wykorzystująca interaktywną animowaną postać]

Publikacja

P. Szczuko

- Rok 2006

The aim of this study is to design a computer application that may assist teachers and therapists in multimodal manner in their work with impaired or disabled children. The application can be operated in many different ways, giving to a child with special educational needs a possibility to learn and train many skills or treat speech disorders. The main stress in this research is on the creation of animated character that will serve...

Trzej prorocy: Sołżenicyn, Friedman, Dugin. Część pierwsza: Sołżenicyn

Publikacja

Z. Kaźmierczyk

- Rok 2023

Artykuł przedstawia na tle biograficznym dzieło i myśl profetyczną Aleksandra Sołżenicyna. Podstawą jej analizy jest mowa z okazji przyznania autorowi Oddziału chorych na raka literackiej Nagrody Nobla oraz jego wykład na temat stanu cywilizacji Zachodu wygłoszony na Uniwersytecie Harvarda – zatytułowany Zmierzch odwagi. Proroctwa Sołżenicyna dotyczące Zachodu pokazane są w kontekście jego pracy Jak odbudować Rosję? W artykule...

Pełny tekst do pobrania w serwisie zewnętrznym

Voice command recognition using hybrid genetic algorithm

Publikacja

- TASK Quarterly - Rok 2010

Abstract: Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer...

Pełny tekst do pobrania w portalu

New Applications of Multimodal Human-Computer Interfaces

Publikacja

A. Czyżewski

- Rok 2012

Multimodal computer interfaces and examples of their applications to education software and for the disabled people are presented. The proposed interfaces include the interactive electronic whiteboard based on video image analysis, application for controlling computers with gestures and the audio interface for speech stretching for hearing impaired and stuttering people. Application of the eye-gaze tracking system to awareness...

Improving listeners' experience for movie playback through enhancing dialogue clarity in soundtracks

Publikacja

- DIGITAL SIGNAL PROCESSING - Rok 2016

This paper presents a method for improving users' quality of experience through processing of movie soundtracks. The dialogue clarity enhancement algorithms were introduced for detecting dialogue in movie soundtrack mixes and then for amplifying the dialogue components. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity...

Pełny tekst do pobrania w serwisie zewnętrznym

Detection of dialogue in movie soundtrack for speech intelligibility enhancement

Publikacja

K. Łopatka

- Rok 2014

A method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility....

Pełny tekst do pobrania w serwisie zewnętrznym

Comparative Study of Self-Organizing Maps vs. Subjective Evaluation of Quality of Allophone Pronunciation for Nonnative English Speakers

Publikacja

- Rok 2017

The purpose of this study was to apply Self-Organizing Maps to differentiate between the correct and the incorrect allophone pronunciations and to compare the results with subjective evaluation. Recordings of a list of target words, containing selected allophones of English plosive consonants, the velar nasal and the lateral consonant, were made twice. First, the target words were read from the list by 9 non-native speakers and...

Comparison of the Ability of Neural Network Model and Humans to Detect a Cloned Voice

Publikacja

- Electronics - Rok 2023

The vulnerability of the speaker identity verification system to attacks using voice cloning was examined. The research project assumed creating a model for verifying the speaker’s identity based on voice biometrics and then testing its resistance to potential attacks using voice cloning. The Deep Speaker Neural Speaker Embedding System was trained, and the Real-Time Voice Cloning system was employed based on the SV2TTS, Tacotron,...

Pełny tekst do pobrania w portalu

Filtry

Katalog

BPL-PLC Voice Communication System for the Oil and Mining Industry

Waveguide model of the hearing aid earmold system

Waveguide model of the hearing aid earmold system

Multimodal learning application with interactive animated character. [Multimodalna aplikacja edukacyjna wykorzystująca interaktywną animowaną postać]

Trzej prorocy: Sołżenicyn, Friedman, Dugin. Część pierwsza: Sołżenicyn

Voice command recognition using hybrid genetic algorithm

New Applications of Multimodal Human-Computer Interfaces

Improving listeners' experience for movie playback through enhancing dialogue clarity in soundtracks

Detection of dialogue in movie soundtrack for speech intelligibility enhancement

Comparative Study of Self-Organizing Maps vs. Subjective Evaluation of Quality of Allophone Pronunciation for Nonnative English Speakers

Comparison of the Ability of Neural Network Model and Humans to Detect a Cloned Voice

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: SPEECH PROCESSING