Wyniki wyszukiwania dla: SPEECH PROCESSING - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: SPEECH PROCESSING

Wyniki wyszukiwania dla: SPEECH PROCESSING

  • BPL-PLC Voice Communication System for the Oil and Mining Industry

    Publikacja
    • G. Debita
    • P. Falkowski-Gilski
    • M. Habrych
    • G. Wiśniewski
    • B. Miedziński
    • P. Jedlikowski
    • A. Waniewska
    • J. Wandzio
    • B. Polnik

    - ENERGIES - Rok 2020

    Application of a high-efficiency voice communication systems based on broadband over power line-power line communication (BPL-PLC) technology in medium voltage networks, including hazardous areas (like the oil and mining industry), as a redundant mean of wired communication (apart from traditional fiber optics and electrical wires) can be beneficial. Due to the possibility of utilizing existing electrical infrastructure, it can...

    Pełny tekst do pobrania w portalu

  • Waveguide model of the hearing aid earmold system

    Publikacja

    - Rok 2006

    Background The earmold system of the Behind-The-Ear hearing aid is an acoustic system that modifies the spectrum of the propagated sound waves. Improper selection of the earmold system may result in deterioration of sound quality and speech intelligibility. Computer modeling methods may be useful in the process of hearing aid fitting, allowing physician to examine various earmold system configurations and choose the optimum one...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Waveguide model of the hearing aid earmold system

    Publikacja

    Background The earmold system of the Behind-The-Ear hearing aid is an acoustic system that modifies the spectrum of the propagated sound waves. Improper selection of the earmold system may result in deterioration of sound quality and speech intelligibility. Computer modeling methods may be useful in the process of hearing aid fitting, allowing physician to examine various earmold system configurations and choose the optimum one...

    Pełny tekst do pobrania w portalu

  • Multimodal learning application with interactive animated character. [Multimodalna aplikacja edukacyjna wykorzystująca interaktywną animowaną postać]

    Publikacja

    - Rok 2006

    The aim of this study is to design a computer application that may assist teachers and therapists in multimodal manner in their work with impaired or disabled children. The application can be operated in many different ways, giving to a child with special educational needs a possibility to learn and train many skills or treat speech disorders. The main stress in this research is on the creation of animated character that will serve...

  • Trzej prorocy: Sołżenicyn, Friedman, Dugin. Część pierwsza: Sołżenicyn

    Publikacja

    - Rok 2023

    Artykuł przedstawia na tle biograficznym dzieło i myśl profetyczną Aleksandra Sołżenicyna. Podstawą jej analizy jest mowa z okazji przyznania autorowi Oddziału chorych na raka literackiej Nagrody Nobla oraz jego wykład na temat stanu cywilizacji Zachodu wygłoszony na Uniwersytecie Harvarda – zatytułowany Zmierzch odwagi. Proroctwa Sołżenicyna dotyczące Zachodu pokazane są w kontekście jego pracy Jak odbudować Rosję? W artykule...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Voice command recognition using hybrid genetic algorithm

    Publikacja

    Abstract: Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer...

    Pełny tekst do pobrania w portalu

  • New Applications of Multimodal Human-Computer Interfaces

    Publikacja

    - Rok 2012

    Multimodal computer interfaces and examples of their applications to education software and for the disabled people are presented. The proposed interfaces include the interactive electronic whiteboard based on video image analysis, application for controlling computers with gestures and the audio interface for speech stretching for hearing impaired and stuttering people. Application of the eye-gaze tracking system to awareness...

  • Improving listeners' experience for movie playback through enhancing dialogue clarity in soundtracks

    This paper presents a method for improving users' quality of experience through processing of movie soundtracks. The dialogue clarity enhancement algorithms were introduced for detecting dialogue in movie soundtrack mixes and then for amplifying the dialogue components. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Detection of dialogue in movie soundtrack for speech intelligibility enhancement

    Publikacja

    - Rok 2014

    A method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Comparative Study of Self-Organizing Maps vs. Subjective Evaluation of Quality of Allophone Pronunciation for Nonnative English Speakers

    Publikacja

    The purpose of this study was to apply Self-Organizing Maps to differentiate between the correct and the incorrect allophone pronunciations and to compare the results with subjective evaluation. Recordings of a list of target words, containing selected allophones of English plosive consonants, the velar nasal and the lateral consonant, were made twice. First, the target words were read from the list by 9 non-native speakers and...

  • Comparison of the Ability of Neural Network Model and Humans to Detect a Cloned Voice

    The vulnerability of the speaker identity verification system to attacks using voice cloning was examined. The research project assumed creating a model for verifying the speaker’s identity based on voice biometrics and then testing its resistance to potential attacks using voice cloning. The Deep Speaker Neural Speaker Embedding System was trained, and the Real-Time Voice Cloning system was employed based on the SV2TTS, Tacotron,...

    Pełny tekst do pobrania w portalu