Wyniki wyszukiwania dla: voice detection

Voice Maps - portable, dedicated GIS for supporting street navigtion and self-dependent movement of the blind

Publikacja

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2010

The concept and the prototype application of the system supporting the street navigation and independent, outdoor movement of the blind is presented. The system utilises the GIS database of geometric network of the pedestrian paths in the city and is capable of finding the route from the indicated source to destination. Subsequently, the system supports the movement of the blind along the found route. The information on the user's...

Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech

Publikacja

D. Piotrowski
R. Korzeniowski
A. Falai
S. Cygert
K. Pokora
G. Tinchev
Z. Zhang
K. Yanagisawa

- Rok 2023

In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...

Pełny tekst do pobrania w serwisie zewnętrznym

502 - Diagnosis of dementia and post-diagnostic support – voice of people with dementia living in Poland

Publikacja

M. Maćkowiak
M. Ciułkowicz
M. Duda-Sikuła
D. Szcześniak
J. Rymaszewska

- International Psychogeriatrics - Rok 2021

Pełny tekst do pobrania w serwisie zewnętrznym

Detection and localization of selected acoustic events in acoustic field for smart surveillance applications

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2014

A method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The evens are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...

Pełny tekst do pobrania w portalu

Detection and localization of selected acoustic events in 3D acoustic field for smart surveillance applications

Publikacja

- Communications in Computer and Information Science - Rok 2011

A method for automatic determination of position of chosen sound events such as speech signals and impulse sounds in 3-dimensional space is presented. The events are localized in the presence of sound reflections employing acoustic vector sensors. Human voice and impulsive sounds are detected using adaptive detectors based on modified peak-valley difference (PVD) parameter and sound pressure level. Localization based on signals...

Pełny tekst do pobrania w serwisie zewnętrznym

Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine

Publikacja

P. Falkowski-Gilski
G. Debita

- Archives of Acoustics - Rok 2023

In order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability...

Pełny tekst do pobrania w portalu

Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor

Publikacja

- Rok 2015

Spatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...

Pełny tekst do pobrania w serwisie zewnętrznym

Improved method for real-time speech stretching

Publikacja

- Rok 2012

n algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...

Pełny tekst do pobrania w serwisie zewnętrznym

Biofilm Growth Causes Damage to Silicone Voice Prostheses in Patients after Surgical Treatment of Locally Advanced Laryngeal Cancer

Publikacja

J. Spałek
P. Deptuła
M. Cieśluk
A. Strzelecka
D. Łysik
J. Mystkowska
T. Daniluk
G. Król
S. Góźdź
R. Bucki... i 2 innych

- Pathogens - Rok 2020

Pełny tekst do pobrania w serwisie zewnętrznym

Playback Attack Detection: The Search for the Ultimate Set of Antispoof Features

Publikacja

M. Smiatacz

- Rok 2017

Automatic speaker verification systems are vulnerable to several kinds of spoofing attacks. Some of them can be quite simple – for example, the playback of an eavesdropped recording does not require any specialized equipment nor knowledge, but still may pose a serious threat for a biometric identification module built into an e-banking application. In this paper we follow the recent approach and convert recordings to images, assuming...

Pełny tekst do pobrania w serwisie zewnętrznym

Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set

Publikacja

P. Filipowicz
B. Kostek

- Applied Sciences-Basel - Rok 2023

This work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...

Pełny tekst do pobrania w portalu

A non-uniform real-time speech time-scale stretching method

Publikacja

- Rok 2011

An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...

Automatic singing quality recognition employing artificial neural networks

Publikacja

P. Żwan

- Archives of Acoustics - Rok 2008

Celem artykułu jest udowodnienie możliwości automatycznej oceny jakości technicznej głosów śpiewaczych. Pokrótce zaprezentowano w nim stworzoną bazę danych głosów śpiewaczych oraz zaimplementowane parametry. Przy pomocy sztucznych sieci neuronowych zaprojektowano system decyzyjny, który oceniono w pięciostopniowej skali jakość techniczną głosu. Przy pomocy metod statystycznych udowodniono, że wyniki generowane przez ten system...

Pełny tekst do pobrania w portalu

Creating new voices using normalizing flows

Publikacja

P. Biliński
T. Merritt
A. Ezzerg
K. Pokora
S. Cygert
K. Yanagisawa
R. Barra-Chicote
D. Korzekwa

- Rok 2022

Creating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...

Pełny tekst do pobrania w portalu

Communication Platform for Evaluation of Transmitted Speech Quality

Publikacja

- Journal of Telecommunications and Information Technology - Rok 2011

A voice communication system designed and implemented is described. The purpose of the presented platform was to enable a series of experiments related to the quality assessment of algorithms used in the coding and transmitting of speech. The system is equipped with tools for recording signals at each stage of processing, making it possible to subject them to subjective assessments by listening tests or, objective evaluation employing...

Pełny tekst do pobrania w portalu

Wearable system supporting navigation of the blind

Publikacja

- RED. ZAGR. ANGIELSKI - Rok 2011

Improving blind people comfort of life is a problem ofgreat importance. Fortunately, new technolgies provide us withadditional methods to improve everyday life of the blind and visuallyimpaired. The paper presents experimental system made byresearchers from Department of Geoinformatics of Gdansk Universityof Technology, which is capable of finding the route from theindicated source to chosen destination, using dedicated digital...

Pełny tekst do pobrania w serwisie zewnętrznym

A system for singing training

Publikacja

- Rok 2007

The system proposed is aimed at the vocal students and persons who want to improve emission of their voices. The goal is not to substituite a singing teacher but to provide a tool for automatic teaching of voice emission basics. In this way singers can develop their vocal skills and improve them. By a visual feedback a student can control and modify vocal tract maximas (resonances) of a chosen vowel to match the resonances of the...

PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS

Publikacja

- Rok 2015

The quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...

Client-server Approach in the Navigation System for the Blind

Publikacja

- TransNav - The International Journal on Marine Navigation and Safety of Sea Transportation - Rok 2013

The article presents the client‐server approach in the navigation system for the blind ‐ “Voice Maps”. The authors were among the main creators of the prototype and currently the commercialization phase is being finished. In the implemented prototype only exemplary, limited spatial data were used, therefore they could be stored and analysed (for path-finding process) in the mobile device’s memory without any difficulties. The...

Pełny tekst do pobrania w portalu

Subjective and Objective Quality Evaluation Study of BPL -PLC Wired Medium

Publikacja

G. Debita
P. Falkowski-Gilski
M. Habrych
B. Miedziński
B. Polnik
J. Wandzio
P. Jedlikowski

- Elektronika Ir Elektrotechnika - Rok 2020

This paper presents results of research on the effectiveness of bi-directional voice transmission in a 6 kV mine cable network using BPL-PLC (Broadband over Power Line - Power Line Communication) technology. It concerns both emergency cable state (supply outage with cable shorted at both ends) and loaded with distorted current waveforms. The narrowband (0.5 MHz–15 MHz) and broadband (two different modes, frequency range of 3 MHz–7.5...

Pełny tekst do pobrania w portalu

Filtry

Katalog

Voice Maps - portable, dedicated GIS for supporting street navigtion and self-dependent movement of the blind

Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech

502 - Diagnosis of dementia and post-diagnostic support – voice of people with dementia living in Poland

Detection and localization of selected acoustic events in acoustic field for smart surveillance applications

Detection and localization of selected acoustic events in 3D acoustic field for smart surveillance applications

Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine

Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor

Improved method for real-time speech stretching

Biofilm Growth Causes Damage to Silicone Voice Prostheses in Patients after Surgical Treatment of Locally Advanced Laryngeal Cancer

Playback Attack Detection: The Search for the Ultimate Set of Antispoof Features

Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set

A non-uniform real-time speech time-scale stretching method

Automatic singing quality recognition employing artificial neural networks

Creating new voices using normalizing flows

Communication Platform for Evaluation of Transmitted Speech Quality

Wearable system supporting navigation of the blind

A system for singing training

PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS

Client-server Approach in the Navigation System for the Blind

Subjective and Objective Quality Evaluation Study of BPL -PLC Wired Medium

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: voice detection