Search results for: VISUAL SPEECH RECOGNITION

Automatic audio-visual threat detection

Publication

- Year 2010

The concept, practical realization and application of a system for detection and classification of hazardous situations based on multimodal sound and vision analysis are presented. The device consists of new kind multichannel miniature sound intensity sensors, digital Pan Tilt Zoom and fixed cameras and a bundle of signal processing algorithms. The simultaneous analysis of multimodal signals can significantly improve the accuracy...

Dynamic Semantic Visual Information Management

Publication

J. Szymański
W. Duch

- Year 2010

Dominant Internet search engines use keywords and therefore are not suited for exploration of new domains of knowledge, when the user does not know specific vocabulary. Browsing through articles in a large encyclopedia, each presenting a small fragment of knowledge, it is hard to map the whole domain, see relevant concepts and their relations. In Wikipedia for example some highly relevant articles are not linked with each other....

Full text to download in external service

A methodology of visual modeling language evaluation

Publication

A. Bobkowska

- Year 2005

Metody oceny jakości metod modelowania są istotnym elementem inżynierii języków modelowania wizualnego. W referacie zaproponowano metodę oceny języków modelowania wizualnego na podstawie wymiarów poznawczych. Zaprezentowano metodologiczną dyskusję zastosowania nauk psychologicznych do oceny metod modelowania, metodologię CD-VML, powiązaną z nią metodę CD-VML-UC do oceny przypadków użycia oraz weryfikację metodologii.

A new approach to visual system testing

Publication

- Year 2005

Opisano budowę laboratoryjnego stanowiska prac bawczych nad perymetrią obiektywną. Przedstawiono zasadę działania algorytmu VEPDA oraz wyniki działania VEPDA na danych eksperymentalnych.

Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine

Publication

P. Falkowski-Gilski
G. Debita

- Archives of Acoustics - Year 2023

In order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability...

Full text available to download

Examining Government-Citizen Interactions on Twitter using Visual and Sentiment Analysis

Publication

R. Hubert
E. Estevez
A. Maguitman
T. Janowski

- Year 2018

The goal of this paper is to propose a methodology comprising a range of visualization techniques to analyze the interactions between government and citizens on the issues of public concern taking place on Twitter, mainly through the official government or ministry accounts. The methodology addresses: 1) the level of government activity in different countries and sectors; 2) the topics that are addressed through such activities;...

Full text available to download

System for automatic singing voice recognition

Publication

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2008

W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...

Influence of accelerometer signal pre-processing and classification method on human activity recognition

Publication

- Elektronika : konstrukcje, technologie, zastosowania - Year 2010

A study of data pre-processing influence on accelerometer-based human activity recognition algorithms is presented. The frequency band used to filter-out the accelerometer signals and the number of accelerometers involved were considered in terms of their influence on the recognition accuracy. In the test four methods of classification were used: support vector machine, decision trees, neural network, k-nearest neighbor.

Full text to download in external service

Pose classification in the gesture recognition using the linear optical sensor

Publication

- Year 2017

Gesture sensors for mobile devices, which have a capability of distinguishing hand poses, require efficient and accurate classifiers in order to recognize gestures based on the sequences of primitives. Two methods of poses recognition for the optical linear sensor were proposed and validated. The Gaussian distribution fitting and Artificial Neural Network based methods represent two kinds of classification approaches. Three types...

Full text to download in external service

Visual Features for Improving Endoscopic Bleeding Detection Using Convolutional Neural Networks

Publication

- SENSORS - Year 2023

The presented paper investigates the problem of endoscopic bleeding detection in endoscopic videos in the form of a binary image classification task. A set of definitions of high-level visual features of endoscopic bleeding is introduced, which incorporates domain knowledge from the field. The high-level features are coupled with respective feature descriptors, enabling automatic capture of the features using image processing methods....

Full text available to download

Acquisition and indexing of RGB-D recordings for facial expressions and emotion recognition

Publication

M. Szwoch

- Studia Informatica Pomerania - Year 2015

In this paper KinectRecorder comprehensive tool is described which provides for convenient and fast acquisition, indexing and storing of RGB-D video streams from Microsoft Kinect sensor. The application is especially useful as a supporting tool for creation of fully indexed databases of facial expressions and emotions that can be further used for learning and testing of emotion recognition algorithms for affect-aware applications....

Full text to download in external service

On practical application of Shannon theory to character recognition and more

Publication

M. Jurkiewicz

- Year 2014

Let us consider an optical character recognition system, which in particular can be used for identifying objects that were assigned strings of some length. The system is not perfect, for example, it sometimes recognizes wrongly the characters "Y" and "V". What is the largest set of strings of given length for the system under consideration, which can be mutually correctly recognized, and the corresponding objects correctly identified?...

Molecular Recognition in Complexes of TRF Proteins with Telomeric DNA

Publication

- PLOS ONE - Year 2014

Telomeres are specialized nucleoprotein assemblies that protect the ends of linear chromosomes. In humans and many other species, telomeres consist of tandem TTAGGG repeats bound by a protein complex known as shelterin that remodels telomeric DNA into a protective loop structure and regulates telomere homeostasis. Shelterin recognizes telomeric repeats through its two major components known as Telomere Repeat-Binding Factors, TRF1...

Full text available to download

Parameters optimization in medicine supporting image recognition algorithms

Publication

A. Brzeski

- Year 2011

In this paper, a procedure of automatic set up of image recognition algorithms' parameters is proposed, for the purpose of reducing the time needed for algorithms' development. The procedure is presented on two medicine supporting algorithms, performing bleeding detection in endoscopic images. Since the algorithms contain multiple parameters which must be specified, empirical testing is usually required to optimise the algorithm's...

Accelerometer-based Human Activity Recognition and the Impact of the Sample Size

Publication

- Year 2014

The presented study focused on the recognition of eight user activities (e.g. walking, lying, climbing stairs) basing on the measurements from an accelerometer embedded in a mobile device. It is assumed that the device is carried in a specific location of the user’s clothing. Three types of classifiers were tested on different sizes of the samples. The influence of the time window (the duration of a single trial) on selected activities...

Full text to download in external service

Comparison of selected off-the-shelf solutions for emotion recognition based on facial expressions

Publication

- Year 2016

The paper concerns accuracy of emotion recognition from facial expressions. As there are a couple of ready off-the-shelf solutions available in the market today, this study aims at practical evaluation of selected solutions in order to provide some insight into what potential buyers might expect. Two solutions were compared: FaceReader by Noldus and Xpress Engine by QuantumLab. The performed evaluation revealed that the recognition...

Full text to download in external service

Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets

Publication

- Year 2008

Celem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory...

Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech

Publication

D. Piotrowski
R. Korzeniowski
A. Falai
S. Cygert
K. Pokora
G. Tinchev
Z. Zhang
K. Yanagisawa

- Year 2023

In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...

Full text to download in external service

Systematic Literature Review for Emotion Recognition from EEG Signals

Publication

- Year 2022

Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Full text to download in external service

Systematic Literature Review for Emotion Recognition from EEG Signals

Publication

- Year 2022

Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Full text available to download

Automatic recognition of therapy progress among children with autism

Publication

A. Kołakowska
A. Landowska
A. Anzulewicz
K. Sobota

- Scientific Reports - Year 2017

The article presents a research study on recognizing therapy progress among children with autism spectrum disorder. The progress is recognized on the basis of behavioural data gathered via five specially designed tablet games. Over 180 distinct parameters are calculated on the basis of raw data delivered via the game flow and tablet sensors - i.e. touch screen, accelerometer and gyroscope. The results obtained confirm the possibility...

Full text available to download

Visual Attention Distribution Based Assessment of User's Skill in Electronic Medical Record Navigation

Publication

T. Kocejko
J. Wtorek
K. Goforth
K. Moidu

- Journal of Medical Imaging and Health Informatics - Year 2015

Currently, the most precise way of reflecting the skills level is an expert’s subjective assessment. In this paper we investigate the possibility of the use of eye tracking data for scalar quantitative and objective assessment of medical staff competency in EMR system navigation. According to the experiment conducted by Yarbus the observation process of particular features is associated with thinking. Moreover, eye tracking is...

Full text to download in external service

Local Texture Pattern Selection for Efficient Face Recognition and Tracking

Publication

- Advances in Intelligent Systems and Computing - Year 2015

This paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...

Full text to download in external service

Feasibility Study for Food Intake Tasks Recognition Based on Smart Glasses

Publication

M. Biallas
A. Andrushevich
R. Kistler
A. Klapproth
K. Czuszyński
A. Bujnowski

- Journal of Medical Imaging and Health Informatics - Year 2015

In this exploratory study 13 adult test subjects have performed different food intake tasks while wearing a three axis accelerometer mounted at a temple of glasses. Two different algorithms for task recognition have been applied and compared. The retrospective data processing leads to better task recognition results when the frequency range of 50 Hz to 100 Hz is analysed within accelerometer signal recordings. A straightforward...

Full text to download in external service

Fuzzy rule-based dynamic gesture recognition employing camera & multimedia projector

Publication

- Year 2010

In the paper the system based on camera and multimedia projector enabling a user to control computer applications by dynamic hand gestures is presented. The main objective is to present the gesture recognition methodology which bases on representing hand movement trajectory by motion vectors analyzed using fuzzy rule-based inference. The approach was engineered in the system developed with J2SE and C++ / OpenCV technology. OpenCV...

Full text to download in external service

Integration of speech enhancement and coding techniques

Publication

M. Kuropatwinski
D. Leckschat
K. Kroschel
A. Czyzewski
M. Kuropatwiński

- Year 1999

Full text to download in external service

Novel approaches to wideband speech coding

Publication

- Year 2008

Dwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...

Full text to download in external service

Broadband interference in speech reinforcement systems

Publication

- Year 2008

Artykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...

A system for multitask noisy speech enhancement.

Publication

A. Czyżewski
A. Kaczmarek
J. Kotus
A. Pawlik
A. Rypulak
P. Żwan

- Year 2004

W artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...

Multitask Noisy Speech Enhancement System

Publication

- Year 2005

W referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...

Visual Traffic Noise Monitoring in Urban Areas

Publication

- International Journal of Multimedia and Ubiquitous Engineering - Year 2007

The paper presents an advanced system for railway and road traffic noise monitoring in metropolitan areas. This system is a functional part of a more complex solution designed for environmental monitoring in cities utilizing analyses of sound, vision and air pollution, based on a ubiquitous computing approach. The system consists of many autonomous, universal measuring units and a multimedia server, which gathers, processes and...

Full text to download in external service

Modeling pragmatics for visual modeling language evaluation

Publication

A. Bobkowska

- Year 2005

Podczas oceny użyteczności języków modelowania wizualnego istnieje potrzeba uwzględnienia ich pragmatyki. Języki modelowania wizualnego mogą być stosowane w różnym kontekście, co powoduje różnice w wymaganiach, które są im stawiane. Jawny opis kontekstu użycia ułatwia precyzyjną ocenę. Pragmatyka składa się ze zbioru profili, które opisują konkretne konteksty użycia. W referacie podjęto próbę zastosowania modeli zadań do opisu...

Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students

Publication

P. Falkowski-Gilski

- Year 2021

The user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...

Full text to download in external service

Contextual Knowledge to Enhance Workplace Hazard Recognition and Interpretation in a Cognitive Vision Platform

Publication

C. De
C. Sanin
E. Szczerbicki

- Year 2018

The combination of vision and sensor data together with the resulting necessity for formal representations builds a central component of an autonomous Cyber Physical System for detection and tracking of laborers in workplaces environments. This system must be adaptable and perceive the environment as automatically as possible, performing in a variety of plants and scenes without the necessity of recoding the application for each...

Full text available to download

Theory of recognition in a historical perspective. Axel Honneth's Anerkennung: Eine europäische Ideengeschichte

Publication

A. Karalus

- Archiwum Historii Filozofii i Myśli Społecznej - Year 2019

The article discusses Honneth excursion into the realm of the history of ideas. This time Honneth decides to laser it on the notion of "recognition" in three different cultural areas and three different traditions: French, English, and German. The article discusses Honneth's persepctive and attempts at finding the common thread that would link three aforementioned traditions.

Full text available to download

A review of emotion recognition methods based on keystroke dynamics and mouse movements

Publication

A. Kołakowska

- Year 2013

The paper describes the approach based on using standard input devices, such as keyboard and mouse, as sources of data for the recognition of users’ emotional states. A number of systems applying this idea have been presented focusing on three categories of research problems, i.e. collecting and labeling training data, extracting features and training classifiers of emotions. Moreover the advantages and examples of combining standard...

Full text to download in external service

Visual and auditory attention stimulator for assisting pedagogical therapy . Stymulator uwagi wzrokowej i słuchowej do wspomagania terapii pedagogicznej

Publication

- Year 2015

Visual and auditory attention stimulator provides a system developed in order to improve reading skills using simultaneous presentation of text in its visual form and in transformed auditory form accompanied by related movie material. The described research employed 40 children at the age of 8 13 years having difficulties in learning of reading, who were diagnosed as having developmental dyslexia. It was shown that application...

Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions

Publication

M. Wang
T. Sirlapu
A. Kwaśniewska
M. Szankin
M. Bartscherer
R. Nicolas

- Year 2018

With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

Full text to download in external service

Graph Representation Integrating Signals for Emotion Recognition and Analysis

Publication

- SENSORS - Year 2021

Data reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...

Full text available to download

Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System

Publication

- Advances in Intelligent Systems and Computing - Year 2013

The main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...

Full text to download in external service

A Review of Emotion Recognition Methods Based on Data Acquired via Smartphone Sensors

Publication

- SENSORS - Year 2020

In recent years, emotion recognition algorithms have achieved high efficiency, allowing the development of various affective and affect-aware applications. This advancement has taken place mainly in the environment of personal computers offering the appropriate hardware and sufficient power to process complex data from video, audio, and other channels. However, the increase in computing and communication capabilities of smartphones,...

Full text available to download

New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception

Publication

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2013

The influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...

Full text available to download

Applicability of Emotion Recognition and Induction Methods to Study the Behavior of Programmers

Publication

M. Wróbel

- Applied Sciences-Basel - Year 2018

Recent studies in the field of software engineering have shown that positive emotions can increase and negative emotions decrease the productivity of programmers. In the field of affective computing, many methods and tools to recognize the emotions of computer users were proposed. However, it has not been verified yet which of them can be used to monitor the emotional states of software developers. The paper describes a study carried...

Full text available to download

Estimation of time-frequency complex phase-based speech attributes using narrow band filter banks

Publication

K. Abratkiewicz
K. Czarnecki
D. Fourer
F. Auger

- Year 2017

In this paper, we present nonlinear estimators of nonstationary and multicomponent signal attributes (parameters, properties) which are instantaneous frequency, spectral (or group) delay, and chirp-rate (also known as instantaneous frequency slope). We estimate all of these distributions in the time-frequency domain using both finite and infinite impulse response (FIR and IIR) narrow band filers for speech analysis. Then, we present...

Full text available to download

Recognition of environmentally important ions

Publication

N. Łukasik
E. Wagner-Wysiecka
V. Hubscher-Bruder
M. Bocheńska
S. Michel

- Logistyka - Year 2013

..

High frequency oscillations are associated with cognitive processing in human recognition memory

Publication

M. T. Kucewicz
J. Cymbalnik
J. Matsumoto
B. H. Brinkmann
M. R. Bower
V. Vasoli
V. Sulc
F. Meyer
W. Marsh
S. M. Stead
G. A. Worrell

- Brain: A Journal of Neurology - Year 2014

High frequency oscillations are associated with normal brain function, but also increasingly recognized as potential biomarkers of the epileptogenic brain. Their role in human cognition has been predominantly studied in classical gamma frequencies (30-100 Hz), which reflect neuronal network coordination involved in attention, learning and memory. Invasive brain recordings in animals and humans demonstrate that physiological oscillations...

Full text available to download

Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition

Publication

- Year 2019

Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

Full text available to download

Deep Learning: A Case Study for Image Recognition Using Transfer Learning

Publication

S. Erpolat Tasabat
O. Aydin

- Year 2021

Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Full text to download in external service

Non-Visual Aspects of the Space Performing the Soundscape of the City

Publication

- Year 2016

The relationship between art and architecture remains a challenging issue today, first and foremost in the domain of architecture, and particularly in art and design schools. To address this issue, the Winter School International Research and Education (WIRE) programme was run in the Gazi University Department of Architecture between 2013 and 2014, with the main theme being “Art and Architecture”, and the sub-themes determined...

Full text to download in external service

Hidden Markov Models for Visual Processing of Marketing Leaflets

Publication

J. Grobelny
R. Michalski

- Year 2021

Full text to download in external service

Search

Filters

Catalog

Category

Year

Options

Search results for: VISUAL SPEECH RECOGNITION