Wyniki wyszukiwania dla: Query by Sketch

Wyniki wyszukiwania dla: Query by Sketch

wyników na stronę:
osadź ten widok na swojej stronie

Wyświetlane wyniki pochodzą z wyszukiwania alternatywnego.

Filtry

wszystkich: 678

wyczyść wszystkie filtry niedostępne

Speech recognition system for hearing impaired people.
Publikacja
- P. Dalka
- A. Czyżewski
- Rok 2005
Praca przedstawia wyniki badań z zakresu rozpoznawania mowy. Tworzony system wykorzystujący dane wizualne i akustyczne będzie ułatwiał trening poprawnego mówienia dla osób po operacji transplantacji ślimaka i innych osób wykazujących poważne uszkodzenia słuchu. Active Shape models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na...
Tensor Decomposition for Imagined Speech Discrimination in EEG
Publikacja
- J. S. Garcia Salinas
- L. Villaseñor-Pineda
- C. A. Reyes-Garćia
- A. A. Torres-García
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2018
Most of the researches in Electroencephalogram(EEG)-based Brain-Computer Interfaces (BCI) are focused on the use of motor imagery. As an attempt to improve the control of these interfaces, the use of language instead of movement has been recently explored, in the form of imagined speech. This work aims for the discrimination of imagined words in electroencephalogram signals. For this purpose, the analysis of multiple variables...

Pełny tekst do pobrania w serwisie zewnętrznym
Applying the Lombard Effect to Speech-in-Noise Communication
Publikacja
- G. Korvel
- K. Kąkol
- P. Treigys
- B. Kostek
- Electronics - Rok 2023
This study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...

Pełny tekst do pobrania w portalu
Building Knowledge for the Purpose of Lip Speech Identification
Publikacja
- Advances in Intelligent Systems and Computing - Rok 2017
Consecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Pełny tekst do pobrania w serwisie zewnętrznym
Multimodal English corpus for automatic speech recognition
Publikacja
- Rok 2013
A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
New generation speech aid for stuttering people
Publikacja
- P. Odya
- A. Czyżewski
- Archives of Acoustics - Rok 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Pełny tekst do pobrania w portalu
Improved method for real-time speech stretching
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2012
n algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...

Pełny tekst do pobrania w serwisie zewnętrznym
Influence of modulation detection threshold on speech intelligibility
Publikacja
- K. Leo
- ACTA PHYSICA POLONICA A - Rok 2011
Pełny tekst do pobrania w portalu
Communication Platform for Evaluation of Transmitted Speech Quality
Publikacja
- A. Ciarkowski
- A. Czyżewski
- Journal of Telecommunications and Information Technology - Rok 2011
A voice communication system designed and implemented is described. The purpose of the presented platform was to enable a series of experiments related to the quality assessment of algorithms used in the coding and transmitting of speech. The system is equipped with tools for recording signals at each stage of processing, making it possible to subject them to subjective assessments by listening tests or, objective evaluation employing...

Pełny tekst do pobrania w portalu
Improving the quality of speech in the conditions of noise and interference
Publikacja
- B. Kostek
- K. Kąkol
- Journal of the Acoustical Society of America - Rok 2018
The aim of the work is to present a method of intelligent modification of the speech signal with speech features expressed in noise, based on the Lombard effect. The recordings utilized sets of words and sentences as well as disturbing signals, i.e., pink noise and the so-called babble speech. Noise signal, calibrated to various levels at the speaker's ears, was played over two loudspeakers located 2 m away from the speaker. In...

Pełny tekst do pobrania w serwisie zewnętrznym
Constructing a Dataset of Speech Recordingswith Lombard Effect
Publikacja
- D. Weber
- S. Zaporowski
- D. Korzekwa
- Rok 2020
Thepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...
Time-domain prosodic modifications for text-to-speech synthesizer
Publikacja
- J. Łopatka
- P. Suchomski
- A. Czyżewski
- Rok 2010
An application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
Investigations of speech signal parameters with regard to articulation influences
Publikacja
- A. Kaczmarek
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2008
W pracy zostało podjęte zagadnienie parametryzacji sygnału mowy w kontekście ekstrakcji cech biometrycznych. Analizowane parametry to parametry cepstralne (cepstrum liniowe i mel-cepstrum, czyli MFCC), parametry liniowej predykcji (LPC) oraz momenty widmowe i parametr F0. Zastosowano analize w krótkich stałych segmentach sygnału z zastosowaniem dużego zakładkowania, tzw. ''implicite segmentation''. Umożliwiło to zaobserwowanie...
Detection of dialogue in movie soundtrack for speech intelligibility enhancement
Publikacja
- K. Łopatka
- Rok 2014
A method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility....

Pełny tekst do pobrania w serwisie zewnętrznym
Visual Lip Contour Detection for the Purpose of Speech Recognition
Publikacja
- Rok 2014
A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
System of speech signal processing and visualisation for linguistic purposes
Publikacja
- K. Wojan
- Archives of Acoustics - Rok 2005
Digital analysis of ethnic speech – extraction of information code
Publikacja
- K. Wojan
- Archives of Acoustics - Rok 2003
On the EM algorithm for the estimation of speech AR parameters in noise
Publikacja
- M. Kuropatwinski
- B. Kleijn
- M. Kuropatwiński
- Rok 2014
Pełny tekst do pobrania w serwisie zewnętrznym
Evaluation and Irony in Text in the Light of Speech Act Theory
Publikacja
- K. Kukowicz-Zarska
- Forum Filologiczne Ateneum - Rok 2020
Pełny tekst do pobrania w serwisie zewnętrznym
Automatic Image and Speech Recognition Based on Neural Network
Publikacja
- D. Król
- B. Szlachetko
- Journal of Information Technology Research - Rok 2010
Pełny tekst do pobrania w serwisie zewnętrznym
Audiovisual speech recognition for training hearing impaired patients
Publikacja
- Rok 2006
Praca przedstawia system rozpoznawania izolowanych głosek mowy wykorzystujący dane wizualne i akustyczne. Modele Active Shape Models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na współczynnikach melcepstralnych. Sieć neuronowa została użyta do rozpoznawania wymawianych głosek na podstawie wektora cech zawierającego oba typy...
New approach to localization of clicks in archive speech signals.
Publikacja
- M. Niedźwiecki
- A. Sobociński
- Rok 2004
Przedstawiono problem lokalizacji zniekształceń impulsowych w archiwalnych sygnałach mowy. Pokazano, że detekcja oparta na dwuzakresowym modelu autoregresyjnym i przetwarzanie dwukierunkowe pozwala uzyskać znaczącą poprawę działania w stosunku do istniejących metod lokalizacji zniekształceń.
Advanced speech archiving and restoration system for aviation applications
Publikacja
- A. Czyżewski
- J. Kotus
- A. Kaczmarek
- A. Rypulak
- A. Pawlik
- Rok 2005
W referacie przedstawiono opracowany System Rejestracji I Rekonstrukcji Mowy dla potrzeb lotnictwa. System ten umożliwia jednoczesny zapis, archiwizację i poprawę zrozumiałości sygnału mowy pochodzącego z wielu różnych kanałów komunikacji radiowej. Głównym celem systemu jest rejestracja i rekonstrukcja komunikatów słownych wymienianych drogą radiową pomiędzy pilotem samolotu a stacją kontroli lotów - jest to niezwykle istotne w...
Application of hybrid signals processors to speech and hearing aids
Publikacja
- P. Odya
- A. Czyżewski
- Rok 2005
Dzięki postępowi w technice Cyfrowych Procesorów Sygnałowych (ang. DSP) stało się możliwe budowanie miniaturowych protez słuchu i mowy. Mimo niewielkich wymiarów procesory te są w stanie wykonywać złożone algorytmy. Ich dodatkową zaletą jest łatwość zmiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. W pracy skupiono się na zagadnieniach związanych z projektowanie i implementacją algorytmów mających zastosowanie...
Detecting Lombard Speech Using Deep Learning Approach
Publikacja
- K. Kąkol
- G. Korvel
- G. Tamulevicius
- B. Kostek
- SENSORS - Rok 2023
Robust Lombard speech-in-noise detecting is challenging. This study proposes a strategy to detect Lombard speech using a machine learning approach for applications such as public address systems that work in near real time. The paper starts with the background concerning the Lombard effect. Then, assumptions of the work performed for Lombard speech detection are outlined. The framework proposed combines convolutional neural networks...

Pełny tekst do pobrania w portalu
Transfer learning in imagined speech EEG-based BCIs
Publikacja
- J. S. Garcia Salinas
- L. Villaseñor-Pineda
- C. A. Reyes-Garćia
- A. A. Torres-García
- Biomedical Signal Processing and Control - Rok 2019
The Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...

Pełny tekst do pobrania w portalu
Examining Influence of Distance to Microphone on Accuracy of Speech Recognition
Publikacja
- Rok 2015
The problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal...

Pełny tekst do pobrania w serwisie zewnętrznym
MODEL FOR MEASUREMENT OF FLOW INSTALLATION TIME IN SDN SWITCH
Publikacja
- S. Kaczmarek
- J. A. Litka
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2017
SDN is the approach in telecommunication networks that separates control plane from data forwarding plane by specifying a single network entity as a controller that defines rules (called flows) of traffic forwarding for the switches connected to it. The time that is required for installation of these rules might be a hindrance for the overall performance of SDN network. In the paper, a model for testing and evaluating the influence...

Pełny tekst do pobrania w serwisie zewnętrznym
A Method of Real-Time Non-uniform Speech Stretching
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2012
Developed method of real-time non-uniform speech stretching is presented.The proposed solution is based on the well-known SOLA algorithm(Synchronous Overlap and Add). Non-uniform time-scale modification isachieved by the adjustment of time scaling factor values in accordance with thesignal content. Dependently on the speech unit (vowels/consonants), instantaneousrate of speech (ROS), and speech signal presence, values of the scalingfactor...

Pełny tekst do pobrania w serwisie zewnętrznym
Comparison of various speech time-scale modificartion methods
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Archives of Acoustics - Rok 2011
The objective of this work is to investigate the influence of the different time-scale modification (TSM) methods on the quality of the speech stretched up using the designed non-uniform real-time speech time-scale modification algorithm (NU-RTSM). The algorithm provides a combination of the typical TSM algorithm with the vowels, consonants, stutter, transients and silence detectors. Based on the information about the content and...
Ranking Speech Features for Their Usage in Singing Emotion Classification
Publikacja
- S. Zaporowski
- B. Kostek
- Rok 2020
This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Pełny tekst do pobrania w portalu
Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions
Publikacja
- SENSORS - Rok 2021
The paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...

Pełny tekst do pobrania w portalu
Improving Objective Speech Quality Indicators in Noise Conditions
Publikacja
- K. Kąkol
- G. Korvel
- B. Kostek
- Rok 2020
This work aims at modifying speech signal samples and test them with objective speech quality indicators after mixing the original signals with noise or with an interfering signal. Modifications that are applied to the signal are related to the Lombard speech characteristics, i.e., pitch shifting, utterance duration changes, vocal tract scaling, manipulation of formants. A set of words and sentences in Polish, recorded in silence,...

Pełny tekst do pobrania w serwisie zewnętrznym
An audio-visual corpus for multimodal automatic speech recognition
Publikacja
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2017
review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Pełny tekst do pobrania w portalu
Results of tests on speech intelligibility in reverberant conditions
Dane Badawcze
open access
The dataset contains the results of tests that aimed to provide a relationship between the rate of speech (RoS) and reverberation conditions characterized by the Speech Transmission Index (STI).
International Journal of Speech Technology

Czasopisma

ISSN: 1381-2416 , eISSN: 1572-8110
Journal of Monolingual and Bilingual Speech

Czasopisma

ISSN: 2631-8407 , eISSN: 2631-8415
Virtual keyboard controlled by eye gaze employing speech synthesis
Publikacja
- B. Kunka
- R. Rybacki
- K. Łopatka
- A. Czyżewski
- B. Kostek
- Rok 2010
The article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...
Real-time speech streching for supporting hearing impaired schoolchildren
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Elektronika : konstrukcje, technologie, zastosowania - Rok 2010
A study of time scale modification algorithms applied to support hearing impaired schoolchildren is presented. Variety of algorithms are considered, namely: overlap-and add, two variations of synchronous overlapand- add, and the phase vocoder. Their effectiveness as well as real-time processing capabilities are examined.

Pełny tekst do pobrania w serwisie zewnętrznym
Speech codec enhancements utilizing time compression and perceptual coding
Publikacja
- M. Kulesza
- A. Czyżewski
- Rok 2007
A method for encoding wideband speech signal employing standardized narrowband speech codecs is presented as well as experimental results concerning detection of tonal spectral components. The speech signal sampled with a higher sampling rate than it is suitable for narrowband coding algorithm is compressed in order to decrease the amount of samples. Next, the time-compressed representation of a signal is encoded using a narrowband...
Auditory-model based robust feature selection for speech recognition
Publikacja
- C. Koniaris
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- Journal of the Acoustical Society of America - Rok 2010
Pełny tekst do pobrania w serwisie zewnętrznym
A hybrid speech codec employing parametric and perceptual coding techniques
Publikacja
- Rok 2006
W referacie przedstawiono hybrydowy kodek mowy dla zastosowan w komunikacji VoIP wykorzystujący kodowanie parametryczne i percetualne. Sygnał mowy jest dzielony na składowe dźwięczne, które podlegają kodowania perceptualnemu, składowe bezdźwięczne, które kodowane są metodą parametryczną oraz transjenty, które nie są kodowane żadną stratną metodą. Dodatkowo przedstawiono architekturę kodeka, w której perceptualnie kodowana i przesyłana...
Modeling of conducted emission of dc-dc switch-mode converter
Publikacja
- Rok 2003
W publikacji zaprezentowano sposób modelowania i wyznaczania emisji przewodzonej zaburzeń elektromagnetycznych w przekształtnikach energoelektronicznych. Na przykładzie przekształtnika DC-DC, zostały dobrane odpowiednie narzędzia CAD do szerokopasmowego modelowania i symulacji przekształtnika energoelektronicznego. Otrzymane rezultaty zostały eksperymentalnie potwierdzone i porównane zarówno w dziedzinie czasu jak i częstotliwości...
Noise profiling for speech enhancement employing machine learning models
Publikacja
- K. Kąkol
- G. Korvel
- B. Kostek
- Journal of the Acoustical Society of America - Rok 2022
This paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...

Pełny tekst do pobrania w portalu
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publikacja
- Rok 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Pełny tekst do pobrania w serwisie zewnętrznym
Performance Evaluation of a 650V E-HEMT GaN Power Switch
Publikacja
- P. Czyż
- Rok 2015
GaN power switches have better characteristics compared to the state-of-the-art Si power transistors. These devices offer high operating temperature and current densities, fast switching and low on-resistance. However, currently only a few producers offer technology of high voltage GaN transistors. Immaturity of this technology is the reason why experimental evaluation of GaN parameters must be performed to properly exploit their...
System Supporting Speech Perception in Special Educational Needs Schoolchildren
Publikacja
- A. Kupryjanow
- P. Suchomski
- P. Odya
- A. Czyżewski
- Rok 2012
The system supporting speech perception during the classes is presented in the paper. The system is a combination of portable device, which enables real-time speech stretching, with the workstation designed in order to perform hearing tests. System was designed to help children suffering from Central Auditory Processing Disorders.

Pełny tekst do pobrania w serwisie zewnętrznym
Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Diagnostic Pathology - Rok 2012
Methods developed for real-time time scale modification (TSM) of speech signal are presented. They are based onthe non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of theproposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearingimpaired children and elderly listeners. It was shown that for the speech with average rate equal to or...

Pełny tekst do pobrania w portalu

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: Query by Sketch