Search results for: AUTOMATIC SPEECH RECOGNITION, WHISPER, MEDICAL LANGUAGE RECOGNITION, SPEECH PROCESSING

Search results for: AUTOMATIC SPEECH RECOGNITION, WHISPER, MEDICAL LANGUAGE RECOGNITION, SPEECH PROCESSING

results on page:
embed this view on your website

Filters

total: 1234

clear all filters disabled

displaying 1000 best results Help

Emotion Recognition
Open Research Data
open access
- M. Przyborski
- K. Bobkowska
- series: Person A
The films presented here were recorded using so-called high-speed camera Phantom Miro. To play the movie You need the special software which can be downloaded from the web site https://www.phantomhighspeed.com/resourcesandsupport/phantomresources/pccsoftware the details of the movie are available after starting the movie in the viewer in the description...
Puhe ja Kieli (Speech and Language)

Journals

ISSN: 1458-3410
LANGUAGE SPEECH AND HEARING SERVICES IN SCHOOLS

Journals

ISSN: 0161-1461 , eISSN: 1558-9129
AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY

Journals

ISSN: 1058-0360 , eISSN: 1558-9110
Journal of Speech, Language, and Hearing Research

Journals

ISSN: 1092-4388 , eISSN: 1558-9102
Camera angle invariant shape recognition in surveillance systems
Publication
- D. Ellwart
- A. Czyżewski
- Year 2010
A method for human action recognition in surveillance systems is described. Problems within this task are discussed and a solution based on 3D object models is proposed. The idea is shown and some of its limitations are talked over. Shape description methods are introduced along with their main features. Utilized parameterization algorithm is presented. Classification problem, restricted to bi-nary cases is discussed. Support vector...
International Journal of Signal Processing, Image Processing and Pattern Recognition

Journals

ISSN: 2005-4254
Pose classification in the gesture recognition using the linear optical sensor
Publication
- Year 2017
Gesture sensors for mobile devices, which have a capability of distinguishing hand poses, require efficient and accurate classifiers in order to recognize gestures based on the sequences of primitives. Two methods of poses recognition for the optical linear sensor were proposed and validated. The Gaussian distribution fitting and Artificial Neural Network based methods represent two kinds of classification approaches. Three types...

Full text to download in external service
Graph Representation Integrating Signals for Emotion Recognition and Analysis
Publication
- SENSORS - Year 2021
Data reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...

Full text available to download
On practical application of Shannon theory to character recognition and more
Publication
- M. Jurkiewicz
- Year 2014
Let us consider an optical character recognition system, which in particular can be used for identifying objects that were assigned strings of some length. The system is not perfect, for example, it sometimes recognizes wrongly the characters "Y" and "V". What is the largest set of strings of given length for the system under consideration, which can be mutually correctly recognized, and the corresponding objects correctly identified?...
Acquisition and indexing of RGB-D recordings for facial expressions and emotion recognition
Publication
- M. Szwoch
- Studia Informatica Pomerania - Year 2015
In this paper KinectRecorder comprehensive tool is described which provides for convenient and fast acquisition, indexing and storing of RGB-D video streams from Microsoft Kinect sensor. The application is especially useful as a supporting tool for creation of fully indexed databases of facial expressions and emotions that can be further used for learning and testing of emotion recognition algorithms for affect-aware applications....

Full text to download in external service
Molecular Recognition in Complexes of TRF Proteins with Telomeric DNA
Publication
- M. Wieczór
- A. Tobiszewski
- P. Wityk
- B. Tomiczek
- J. Czub
- PLOS ONE - Year 2014
Telomeres are specialized nucleoprotein assemblies that protect the ends of linear chromosomes. In humans and many other species, telomeres consist of tandem TTAGGG repeats bound by a protein complex known as shelterin that remodels telomeric DNA into a protective loop structure and regulates telomere homeostasis. Shelterin recognizes telomeric repeats through its two major components known as Telomere Repeat-Binding Factors, TRF1...

Full text available to download
Accelerometer-based Human Activity Recognition and the Impact of the Sample Size
Publication
- Year 2014
The presented study focused on the recognition of eight user activities (e.g. walking, lying, climbing stairs) basing on the measurements from an accelerometer embedded in a mobile device. It is assumed that the device is carried in a specific location of the user’s clothing. Three types of classifiers were tested on different sizes of the samples. The influence of the time window (the duration of a single trial) on selected activities...

Full text to download in external service
Comparison of selected off-the-shelf solutions for emotion recognition based on facial expressions
Publication
- Year 2016
The paper concerns accuracy of emotion recognition from facial expressions. As there are a couple of ready off-the-shelf solutions available in the market today, this study aims at practical evaluation of selected solutions in order to provide some insight into what potential buyers might expect. Two solutions were compared: FaceReader by Noldus and Xpress Engine by QuantumLab. The performed evaluation revealed that the recognition...

Full text to download in external service
Human-Computer Interface Based on Visual Lip Movement and Gesture Recognition
Publication
- P. Dalka
- A. Czyżewski
- International Journal of Computing Science and Mathematics - Year 2010
The multimodal human-computer interface (HCI) called LipMouse is presented, allowing a user to work on a computer using movements and gestures made with his/her mouth only. Algorithms for lip movement tracking and lip gesture recognition are presented in details. User face images are captured with a standard webcam. Face detection is based on a cascade of boosted classifiers using Haar-like features. A mouth region is located in...

Full text to download in external service
Systematic Literature Review for Emotion Recognition from EEG Signals
Publication
- P. A. Leszczełowska
- N. Dawidowska
- Year 2022
Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Full text to download in external service
Systematic Literature Review for Emotion Recognition from EEG Signals
Publication
- P. A. Leszczełowska
- N. Dawidowska
- Year 2022
Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Full text available to download
Improving Traffic Light Recognition Methods using Shifting Time-Windows
Publication
- A. Blokus
- H. Krawczyk
- Year 2018
We propose a novel method of improving algorithms recognizing traffic lights in video sequences. Our focus is on algorithms for applications which notify the driver of a light in sight. Many existing methods process images in the recording separately. Our method bases on the observation that real-life videos depict underlying continuous processes. We named our method FSA (Frame Sequence Analyzed). It is applicable for any underlying...

Full text to download in external service
Local Texture Pattern Selection for Efficient Face Recognition and Tracking
Publication
- M. Smiatacz
- J. Rumiński
- Advances in Intelligent Systems and Computing - Year 2015
This paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...

Full text to download in external service
Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition
Publication
- Year 2019
Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

Full text available to download
Broadband interference in speech reinforcement systems
Publication
- H. Lasota
- R. Mazurek
- Year 2008
Artykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...
Novel approaches to wideband speech coding
Publication
- M. Kulesza
- A. Czyżewski
- Year 2008
Dwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...

Full text to download in external service
Integration of speech enhancement and coding techniques
Publication
- M. Kuropatwinski
- D. Leckschat
- K. Kroschel
- A. Czyzewski
- M. Kuropatwiński
- Year 1999
Full text to download in external service
A system for multitask noisy speech enhancement.
Publication
- A. Czyżewski
- A. Kaczmarek
- J. Kotus
- A. Pawlik
- A. Rypulak
- P. Żwan
- Year 2004
W artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...
Multitask Noisy Speech Enhancement System
Publication
- A. Czyżewski
- J. Kotus
- G. Szwoch
- M. Dziubiński
- A. Rypulak
- A. Pawlik
- Year 2005
W referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...
Contextual Knowledge to Enhance Workplace Hazard Recognition and Interpretation in a Cognitive Vision Platform
Publication
- C. De
- C. Sanin
- E. Szczerbicki
- Year 2018
The combination of vision and sensor data together with the resulting necessity for formal representations builds a central component of an autonomous Cyber Physical System for detection and tracking of laborers in workplaces environments. This system must be adaptable and perceive the environment as automatically as possible, performing in a variety of plants and scenes without the necessity of recoding the application for each...

Full text available to download
Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students
Publication
- P. Falkowski-Gilski
- Year 2021
The user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...

Full text to download in external service
Theory of recognition in a historical perspective. Axel Honneth's Anerkennung: Eine europäische Ideengeschichte
Publication
- A. Karalus
- Archiwum Historii Filozofii i Myśli Społecznej - Year 2019
The article discusses Honneth excursion into the realm of the history of ideas. This time Honneth decides to laser it on the notion of "recognition" in three different cultural areas and three different traditions: French, English, and German. The article discusses Honneth's persepctive and attempts at finding the common thread that would link three aforementioned traditions.

Full text available to download
A review of emotion recognition methods based on keystroke dynamics and mouse movements
Publication
- A. Kołakowska
- Year 2013
The paper describes the approach based on using standard input devices, such as keyboard and mouse, as sources of data for the recognition of users’ emotional states. A number of systems applying this idea have been presented focusing on three categories of research problems, i.e. collecting and labeling training data, extracting features and training classifiers of emotions. Moreover the advantages and examples of combining standard...

Full text to download in external service
Proposal of a mobile medical waste incinerator with automatic waste feeder and heat recovery system
Publication
- Year 2019
The paper presents and discusses the issue of medical waste (including hazardous ones) and their proper management. Inappropriate handling of infectious medical waste directly endangers the human being health and the environment. Infectious waste must therefore be properly disposed of – one of the most commonly used methods is the thermal treatment in the incinerators tailored for this purpose. During designing an incinerator unit,...

Full text to download in external service
JOURNAL OF MOLECULAR RECOGNITION

Journals

ISSN: 0952-3499 , eISSN: 1099-1352
Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions
Publication
- M. Wang
- T. Sirlapu
- A. Kwaśniewska
- M. Szankin
- M. Bartscherer
- R. Nicolas
- Year 2018
With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

Full text to download in external service
A Review of Emotion Recognition Methods Based on Data Acquired via Smartphone Sensors
Publication
- SENSORS - Year 2020
In recent years, emotion recognition algorithms have achieved high efficiency, allowing the development of various affective and affect-aware applications. This advancement has taken place mainly in the environment of personal computers offering the appropriate hardware and sufficient power to process complex data from video, audio, and other channels. However, the increase in computing and communication capabilities of smartphones,...

Full text available to download
Journal of Pattern Recognition Research

Journals

ISSN: 1558-884X
Pattern Recognition and Image Analysis

Journals

ISSN: 1054-6618
Applicability of Emotion Recognition and Induction Methods to Study the Behavior of Programmers
Publication
- M. Wróbel
- Applied Sciences-Basel - Year 2018
Recent studies in the field of software engineering have shown that positive emotions can increase and negative emotions decrease the productivity of programmers. In the field of affective computing, many methods and tools to recognize the emotions of computer users were proposed. However, it has not been verified yet which of them can be used to monitor the emotional states of software developers. The paper describes a study carried...

Full text available to download
Quarterly Journal of Speech

Journals

ISSN: 0033-5630 , eISSN: 1479-5779
SpringerBriefs in Speech Technology

Journals

ISSN: 2191-737X , eISSN: 2191-7388
Audiology and Speech Research

Journals

ISSN: 2635-5019 , eISSN: 2635-5027
Voice and Speech Review

Journals

ISSN: 2326-8263 , eISSN: 2326-8271
Estimation of time-frequency complex phase-based speech attributes using narrow band filter banks
Publication
- K. Abratkiewicz
- K. Czarnecki
- D. Fourer
- F. Auger
- Year 2017
In this paper, we present nonlinear estimators of nonstationary and multicomponent signal attributes (parameters, properties) which are instantaneous frequency, spectral (or group) delay, and chirp-rate (also known as instantaneous frequency slope). We estimate all of these distributions in the time-frequency domain using both finite and infinite impulse response (FIR and IIR) narrow band filers for speech analysis. Then, we present...

Full text available to download
Canadian Journal of Speech-Language Pathology and Audiology

Journals

ISSN: 1913-2018
Recognition of environmentally important ions
Publication
- N. Łukasik
- E. Wagner-Wysiecka
- V. Hubscher-Bruder
- M. Bocheńska
- S. Michel
- Logistyka - Year 2013
..
From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition
Publication
- P. Rościszewski
- Computer Science - Year 2017
Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

Full text available to download
Deep Learning: A Case Study for Image Recognition Using Transfer Learning
Publication
- S. Erpolat Tasabat
- O. Aydin
- Year 2021
Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Full text to download in external service
Influence of modulation detection threshold on speech intelligibility
Publication
- K. Leo
- ACTA PHYSICA POLONICA A - Year 2011
Full text available to download
Transient detection algorithms for speech coding applications
Publication
- G. Szwoch
- M. Kulesza
- A. Czyzewski
- Journal of the Acoustical Society of America - Year 2006
Full text to download in external service
Comprehensive Evaluation of Statistical Speech Waveform Synthesis
Publication
- T. Merritt
- B. Putrycz
- A. Nadolski
- T. Ye
- D. Korzekwa
- W. Dolecki
- T. Drugman
- V. Klimkov
- A. Moinet
- A. Breen... and 3 others
- Year 2018
Full text to download in external service
New generation speech aid for stuttering people
Publication
- P. Odya
- A. Czyżewski
- Year 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Full text available to download
New generation speech aid for stuttering people
Publication
- P. Odya
- A. Czyżewski
- Archives of Acoustics - Year 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Full text available to download

Search

Filters

Catalog

Search results for: AUTOMATIC SPEECH RECOGNITION, WHISPER, MEDICAL LANGUAGE RECOGNITION, SPEECH PROCESSING