Search results for: SPEECH BIOMETRICS

Search results for: SPEECH BIOMETRICS

results on page:
embed this view on your website

Displayed results came from alternative search method.

Filters

total: 2008

clear all filters disabled

displaying 1000 best results Help

Examining Influence of Distance to Microphone on Accuracy of Speech Recognition
Publication
- Year 2015
The problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal...

Full text to download in external service
New approach to localization of clicks in archive speech signals.
Publication
- M. Niedźwiecki
- A. Sobociński
- Year 2004
Przedstawiono problem lokalizacji zniekształceń impulsowych w archiwalnych sygnałach mowy. Pokazano, że detekcja oparta na dwuzakresowym modelu autoregresyjnym i przetwarzanie dwukierunkowe pozwala uzyskać znaczącą poprawę działania w stosunku do istniejących metod lokalizacji zniekształceń.
Visual Lip Contour Detection for the Purpose of Speech Recognition
Publication
- Year 2014
A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
Detecting Lombard Speech Using Deep Learning Approach
Publication
- K. Kąkol
- G. Korvel
- G. Tamulevicius
- B. Kostek
- SENSORS - Year 2023
Robust Lombard speech-in-noise detecting is challenging. This study proposes a strategy to detect Lombard speech using a machine learning approach for applications such as public address systems that work in near real time. The paper starts with the background concerning the Lombard effect. Then, assumptions of the work performed for Lombard speech detection are outlined. The framework proposed combines convolutional neural networks...

Full text available to download
A Method of Real-Time Non-uniform Speech Stretching
Publication
- A. Kupryjanow
- A. Czyżewski
- Year 2012
Developed method of real-time non-uniform speech stretching is presented.The proposed solution is based on the well-known SOLA algorithm(Synchronous Overlap and Add). Non-uniform time-scale modification isachieved by the adjustment of time scaling factor values in accordance with thesignal content. Dependently on the speech unit (vowels/consonants), instantaneousrate of speech (ROS), and speech signal presence, values of the scalingfactor...

Full text to download in external service
Transfer learning in imagined speech EEG-based BCIs
Publication
- J. S. Garcia Salinas
- L. Villaseñor-Pineda
- C. A. Reyes-Garćia
- A. A. Torres-García
- Biomedical Signal Processing and Control - Year 2019
The Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...

Full text available to download
Advanced speech archiving and restoration system for aviation applications
Publication
- A. Czyżewski
- J. Kotus
- A. Kaczmarek
- A. Rypulak
- A. Pawlik
- Year 2005
W referacie przedstawiono opracowany System Rejestracji I Rekonstrukcji Mowy dla potrzeb lotnictwa. System ten umożliwia jednoczesny zapis, archiwizację i poprawę zrozumiałości sygnału mowy pochodzącego z wielu różnych kanałów komunikacji radiowej. Głównym celem systemu jest rejestracja i rekonstrukcja komunikatów słownych wymienianych drogą radiową pomiędzy pilotem samolotu a stacją kontroli lotów - jest to niezwykle istotne w...
Application of hybrid signals processors to speech and hearing aids
Publication
- P. Odya
- A. Czyżewski
- Year 2005
Dzięki postępowi w technice Cyfrowych Procesorów Sygnałowych (ang. DSP) stało się możliwe budowanie miniaturowych protez słuchu i mowy. Mimo niewielkich wymiarów procesory te są w stanie wykonywać złożone algorytmy. Ich dodatkową zaletą jest łatwość zmiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. W pracy skupiono się na zagadnieniach związanych z projektowanie i implementacją algorytmów mających zastosowanie...
A Mechatronic System for Building the Map of Optimal Spindle Speeds During High Speed Milling of Flexible Details
Publication
- Year 2011
W pracy przedstawiono mechatroniczny system tworzenia mapy optymalnych prędkości obrotowych wrzeciona w celu nadzorowania drgań typu chatter podczas frezowania szybkościowego przedmiotów podatnych. System ten składa się z części pomiarowej oraz części obliczeniowej, w której wykorzystuje się oprogramowanie autorskie i komercyjne. Na bazie uogólnionego warunku Liao-Younga utworzono mapy optymalnych prędkości obrotowych wrzeciona,...
Developing World Bioethics

Journals

ISSN: 1471-8731 , eISSN: 1471-8847
Monash Bioethics Review

Journals

ISSN: 1321-2753 , eISSN: 1836-6716
AJOB Empirical Bioethics

Journals

ISSN: 2329-4515
Narrative inquiry in bioethics

Journals

ISSN: 2157-1732
Canadian Journal of Bioethics

Journals

eISSN: 2561-4665
AMERICAN JOURNAL OF BIOETHICS

Journals

ISSN: 1526-5161 , eISSN: 1536-0075
Asian Bioethics Review

Journals

ISSN: 1793-9453
Theoretical Medicine and Bioethics

Journals

ISSN: 1386-7415 , eISSN: 1573-1200
International Journal of Speech-Language Pathology (previously called Advances in Speech-Language Pathology)

Journals

ISSN: 1754-9507 , eISSN: 1754-9515
Puhe ja Kieli (Speech and Language)

Journals

ISSN: 1458-3410
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING

Journals

ISSN: 1063-6676
JOURNAL OF MEDICAL SPEECH-LANGUAGE PATHOLOGY

Journals

ISSN: 1065-1438
LANGUAGE SPEECH AND HEARING SERVICES IN SCHOOLS

Journals

ISSN: 0161-1461 , eISSN: 1558-9129
AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY

Journals

ISSN: 1058-0360 , eISSN: 1558-9110
American Speech: a Quarterly of Linguistic Usage

Journals

ISSN: 0003-1283 , eISSN: 1527-2133
Journal of Speech, Language, and Hearing Research

Journals

ISSN: 1092-4388 , eISSN: 1558-9102
Objectivization of phonological evaluation of speech elements by means of audio parametrization
Publication
- Year 2018
This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Publication
- D. Korzekwa
- R. Barra-Chicote
- B. Kostek
- T. Drugman
- M. Łajszczak
- Year 2019
We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not...

Full text available to download
Virtual keyboard controlled by eye gaze employing speech synthesis
Publication
- B. Kunka
- R. Rybacki
- K. Łopatka
- A. Czyżewski
- B. Kostek
- Year 2010
The article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...
Real-time speech streching for supporting hearing impaired schoolchildren
Publication
- A. Kupryjanow
- A. Czyżewski
- Elektronika : konstrukcje, technologie, zastosowania - Year 2010
A study of time scale modification algorithms applied to support hearing impaired schoolchildren is presented. Variety of algorithms are considered, namely: overlap-and add, two variations of synchronous overlapand- add, and the phase vocoder. Their effectiveness as well as real-time processing capabilities are examined.

Full text to download in external service
Automatic prosodic modification in a Text-To-Speech synthesizer of Polish language
Publication
- K. Łopatka
- P. Suchomski
- A. Czyżewski
- Elektronika : konstrukcje, technologie, zastosowania - Year 2011
Przedstawiono system syntezy mowy polskiej z funkcją automatycznej modyfikacji prozodii wypowiedzi. Opisane zostały metody automatycznego wyznaczania akcentu i intonacji wypowiedzi. Przedstawiono zastosowanie algorytmów przetwarzania sygnału mowy w procesie kształtowania prozodii. Omówiono wpływ zastosowanych modyfikacji na naturalność brzmienia syntezowanego sygnału. Zastosowana metoda oparta jest na algorytmie TD-PSOLA. Opracowany...
Virtual Keyboard controlled by eye gaze employing speech synthesis
Publication
- K. Łopatka
- R. Rybacki
- B. Kunka
- A. Czyżewski
- B. Kostek
- Elektronika : konstrukcje, technologie, zastosowania - Year 2011
The article presents the speech synthesis integrated into the eye gaze tracking system. This approach can significantly improve the quality of life of physically disabled people who are unable to communicate. The virtual keyboard (QWERTY) is an interface which allows for entering the text for the speech synthesizer. First, this article describes a methodology of determining the fixation point on a computer screen. Then it presents...

Full text to download in external service
Auditory-model based robust feature selection for speech recognition
Publication
- C. Koniaris
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- Journal of the Acoustical Society of America - Year 2010
Full text to download in external service
A hybrid speech codec employing parametric and perceptual coding techniques
Publication
- Year 2006
W referacie przedstawiono hybrydowy kodek mowy dla zastosowan w komunikacji VoIP wykorzystujący kodowanie parametryczne i percetualne. Sygnał mowy jest dzielony na składowe dźwięczne, które podlegają kodowania perceptualnemu, składowe bezdźwięczne, które kodowane są metodą parametryczną oraz transjenty, które nie są kodowane żadną stratną metodą. Dodatkowo przedstawiono architekturę kodeka, w której perceptualnie kodowana i przesyłana...
Speech codec enhancements utilizing time compression and perceptual coding
Publication
- M. Kulesza
- A. Czyżewski
- Year 2007
A method for encoding wideband speech signal employing standardized narrowband speech codecs is presented as well as experimental results concerning detection of tonal spectral components. The speech signal sampled with a higher sampling rate than it is suitable for narrowband coding algorithm is compressed in order to decrease the amount of samples. Next, the time-compressed representation of a signal is encoded using a narrowband...
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publication
- Year 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Full text to download in external service
Human-computer interactions in speech therapy using a blowing interface
Publication
- Year 2014
In this paper we present a new human-computer interface for the quantitative measurement of blowing activities. The interface can measure the air flow and air pressure during the blowing activity. The measured values are stored and used to control the state of the graphical objects in the graphical user interface. In speech therapy children will find easier to play attractive therapeutic games than to perform repetitive and tedious,...

Full text to download in external service
Distortion of speech signals in the listening area: its mechanism and measurements
Publication
- H. Lasota
- R. Mazurek
- I. Kochańska
- Year 2014
The paper deals with a problem of the influence of the number and distribution of loudspeakers in speech reinforcement systems on the quality of publicly addressed voice messages, namely on speech intelligibility in the listening area. Linear superposition of time-shifted broadband waves of a same form and slightly different magnitudes that reach a listener from numerous coherent sources, is accompanied by interference effects...

Full text to download in external service
Noise profiling for speech enhancement employing machine learning models
Publication
- K. Kąkol
- G. Korvel
- B. Kostek
- Journal of the Acoustical Society of America - Year 2022
This paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...

Full text available to download
System Supporting Speech Perception in Special Educational Needs Schoolchildren
Publication
- A. Kupryjanow
- P. Suchomski
- P. Odya
- A. Czyżewski
- Year 2012
The system supporting speech perception during the classes is presented in the paper. The system is a combination of portable device, which enables real-time speech stretching, with the workstation designed in order to perform hearing tests. System was designed to help children suffering from Central Auditory Processing Disorders.

Full text to download in external service
Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit
Publication
- A. Kupryjanow
- A. Czyżewski
- Diagnostic Pathology - Year 2012
Methods developed for real-time time scale modification (TSM) of speech signal are presented. They are based onthe non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of theproposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearingimpaired children and elderly listeners. It was shown that for the speech with average rate equal to or...

Full text available to download
Analysis of human behavioral patterns
Publication
- A. Kołakowska
- Year 2022
Widespread usage of Internet and mobile devices entailed growing requirements concerning security which in turn brought about development of biometric methods. However, a specially designed biometric system may infer more about users than just verifying their identity. Proper analysis of users’ characteristics may also tell much about their skills, preferences, feelings. This chapter presents biometric methods applied in several...

Full text to download in external service
IEEE Transactions on Audio Speech and Language Processing

Journals

ISSN: 1558-7916
IEEE Transactions on Biometrics, Behavior, and Identity Science

Journals

eISSN: 2637-6407
Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition
Publication
- G. Korvel
- P. Treigys
- G. Tamulevicus
- J. Bernataviciene
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2018
convolutional neural network (CNN) which is a class of deep, feed-forward artificial neural network. We decided to analyze audio signal feature maps, namely spectrograms, linear and Mel-scale cepstrograms, and chromagrams. The choice was made upon the fact that CNN performs well in 2D data-oriented processing contexts. Feature maps were employed in the Lithuanian word recognition task. The spectral analysis led to the highest word...
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
Publication
- G. Korvel
- O. Kurasova
- B. Kostek
- Year 2019
The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...

Full text available to download
Corrupted speech intelligibility improvement using adaptive filter based algorithm
Publication
- D. Ellwart
- A. Czyżewski
- Year 2010
A technique for improving the quality of speech signals recorded in strong noise is presented. The proposed algorithmemploying adaptive filtration is described and additional possibilities of speech intelligibility improvement arediscussed. Results of the tests are presented.
Speech formant frequency and pitch estimation using instantaneous complex frequency
Publication
- M. [. Kaniewska
- Year 2008
W pracy opisany został algorytm estymacji częstotliwości podstawowej oraz częstotliwości środkowych i pasm formantów mowy z wykorzystaniem zespolonej pulsacji chwilowej. W artykule przedstawiono również wyniki działania algorytmu dla polskich samogłosek.
A non-uniform real-time speech time-scale stretching method
Publication
- A. Kupryjanow
- A. Czyżewski
- Year 2011
An algorithm for non-uniform real-time speech stretching is presented. It provides a combination of typical SOLA algorithm (Synchronous Overlap and Add ) with the vowels, consonants and silence detectors. Based on the information about the content and the estimated value of the rate of speech (ROS), the algorithm adapts the scaling factor value. The ability of real-time speech stretching and the resultant quality of voice were...
Time-scale modification of speech signals for supporting hearing impaired schoolchildren
Publication
- A. Kupryjanow
- A. Czyżewski
- Year 2009
A study of time scale modification algorithmsapplied to hearing impaired schoolchildren supporting ispresented. Variety of algorithms are considered, namely:overlap and add, two variations of synchronized overlapand add, and the phase vocoder. Their effectiveness as wellas real-time processing capabilities are examined.
Estimation of the short-term predictor parameters of speech under noisy conditions
Publication
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- IEEE Transactions on Audio Speech and Language Processing - Year 2006
Full text to download in external service

Search

Filters

Catalog

Search results for: SPEECH BIOMETRICS