Search results for: audiovisual speech recognition

Search results for: audiovisual speech recognition

results on page:
embed this view on your website

Filters

total: 1052

clear all filters disabled

displaying 1000 best results Help

Theory of recognition in a historical perspective. Axel Honneth's Anerkennung: Eine europäische Ideengeschichte
Publication
- A. Karalus
- Archiwum Historii Filozofii i Myśli Społecznej - Year 2019
The article discusses Honneth excursion into the realm of the history of ideas. This time Honneth decides to laser it on the notion of "recognition" in three different cultural areas and three different traditions: French, English, and German. The article discusses Honneth's persepctive and attempts at finding the common thread that would link three aforementioned traditions.

Full text available to download
A review of emotion recognition methods based on keystroke dynamics and mouse movements
Publication
- A. Kołakowska
- Year 2013
The paper describes the approach based on using standard input devices, such as keyboard and mouse, as sources of data for the recognition of users’ emotional states. A number of systems applying this idea have been presented focusing on three categories of research problems, i.e. collecting and labeling training data, extracting features and training classifiers of emotions. Moreover the advantages and examples of combining standard...

Full text to download in external service
Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions
Publication
- M. Wang
- T. Sirlapu
- A. Kwaśniewska
- M. Szankin
- M. Bartscherer
- R. Nicolas
- Year 2018
With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

Full text to download in external service
Graph Representation Integrating Signals for Emotion Recognition and Analysis
Publication
- SENSORS - Year 2021
Data reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...

Full text available to download
Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System
Publication
- Advances in Intelligent Systems and Computing - Year 2013
The main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...

Full text to download in external service
Journal of Pattern Recognition Research

Journals

ISSN: 1558-884X
Pattern Recognition and Image Analysis

Journals

ISSN: 1054-6618
A Review of Emotion Recognition Methods Based on Data Acquired via Smartphone Sensors
Publication
- SENSORS - Year 2020
In recent years, emotion recognition algorithms have achieved high efficiency, allowing the development of various affective and affect-aware applications. This advancement has taken place mainly in the environment of personal computers offering the appropriate hardware and sufficient power to process complex data from video, audio, and other channels. However, the increase in computing and communication capabilities of smartphones,...

Full text available to download
Applicability of Emotion Recognition and Induction Methods to Study the Behavior of Programmers
Publication
- M. Wróbel
- Applied Sciences-Basel - Year 2018
Recent studies in the field of software engineering have shown that positive emotions can increase and negative emotions decrease the productivity of programmers. In the field of affective computing, many methods and tools to recognize the emotions of computer users were proposed. However, it has not been verified yet which of them can be used to monitor the emotional states of software developers. The paper describes a study carried...

Full text available to download
Estimation of time-frequency complex phase-based speech attributes using narrow band filter banks
Publication
- K. Abratkiewicz
- K. Czarnecki
- D. Fourer
- F. Auger
- Year 2017
In this paper, we present nonlinear estimators of nonstationary and multicomponent signal attributes (parameters, properties) which are instantaneous frequency, spectral (or group) delay, and chirp-rate (also known as instantaneous frequency slope). We estimate all of these distributions in the time-frequency domain using both finite and infinite impulse response (FIR and IIR) narrow band filers for speech analysis. Then, we present...

Full text available to download
Recognition of environmentally important ions
Publication
- N. Łukasik
- E. Wagner-Wysiecka
- V. Hubscher-Bruder
- M. Bocheńska
- S. Michel
- Logistyka - Year 2013
..
Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition
Publication
- Year 2019
Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

Full text available to download
From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition
Publication
- P. Rościszewski
- Computer Science - Year 2017
Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

Full text available to download
Deep Learning: A Case Study for Image Recognition Using Transfer Learning
Publication
- S. Erpolat Tasabat
- O. Aydin
- Year 2021
Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Full text to download in external service
Automatic singing quality recognition employing artificial neural networks
Publication
- P. Żwan
- Archives of Acoustics - Year 2008
Celem artykułu jest udowodnienie możliwości automatycznej oceny jakości technicznej głosów śpiewaczych. Pokrótce zaprezentowano w nim stworzoną bazę danych głosów śpiewaczych oraz zaimplementowane parametry. Przy pomocy sztucznych sieci neuronowych zaprojektowano system decyzyjny, który oceniono w pięciostopniowej skali jakość techniczną głosu. Przy pomocy metod statystycznych udowodniono, że wyniki generowane przez ten system...

Full text available to download
Influence of modulation detection threshold on speech intelligibility
Publication
- K. Leo
- ACTA PHYSICA POLONICA A - Year 2011
Full text available to download
New generation speech aid for stuttering people
Publication
- P. Odya
- A. Czyżewski
- Year 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Full text available to download
New generation speech aid for stuttering people
Publication
- P. Odya
- A. Czyżewski
- Archives of Acoustics - Year 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Full text available to download
Transient detection algorithms for speech coding applications
Publication
- G. Szwoch
- M. Kulesza
- A. Czyzewski
- Journal of the Acoustical Society of America - Year 2006
Full text to download in external service
Comprehensive Evaluation of Statistical Speech Waveform Synthesis
Publication
- T. Merritt
- B. Putrycz
- A. Nadolski
- T. Ye
- D. Korzekwa
- W. Dolecki
- T. Drugman
- V. Klimkov
- A. Moinet
- A. Breen... and 3 others
- Year 2018
Full text to download in external service
A Novel IoT-Perceptive Human Activity Recognition (HAR) Approach Using Multi-Head Convolutional Attention
Publication
- H. Zhang
- Z. Xiao
- J. Wang
- F. Li
- E. Szczerbicki
- IEEE Internet of Things Journal - Year 2019
Together with fast advancement of the Internet of Things (IoT), smart healthcare applications and systems are equipped with increasingly more wearable sensors and mobile devices. These sensors are used not only to collect data, but also, and more importantly, to assist in daily activity tracking and analyzing of their users. Various human activity recognition (HAR) approaches are used to enhance such tracking. Most of the existing...

Full text available to download
Real-time working gas recognition system based on the array of semiconductor gas sensors and portable computer Raspberry PI
Publication
- Year 2013
The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and low-cost alternative for other devices, like gas analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...
Improving Traffic Light Recognition Methods using Shifting Time-Windows
Publication
- A. Blokus
- H. Krawczyk
- Year 2018
We propose a novel method of improving algorithms recognizing traffic lights in video sequences. Our focus is on algorithms for applications which notify the driver of a light in sight. Many existing methods process images in the recording separately. Our method bases on the observation that real-life videos depict underlying continuous processes. We named our method FSA (Frame Sequence Analyzed). It is applicable for any underlying...

Full text to download in external service
Unraveling the Interplay between DNA and Proteins: A Computational Exploration of Sequence and Structure-Specific Recognition Mechanisms
Publication
- K. A. Hossain
- Year 2023
My PhD dissertation focused on DNA-protein interactions and the recognition of specific DNA sequences and structures. I discovered that acidic amino acid residues (Asp/Glu) play a crucial role by exhibiting a preference for cytosine. Their contribution to binding affinity depends on nearby cytosines, balancing electrostatic repulsion with specific interactions. Acidic residues act as negative selectors, discouraging non-cytosine...

Full text available to download
Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams
Publication
- K. Łopatka
- Year 2015
A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...
Karolina Zielińska-Dąbkowska dr inż. arch.

People

Department of Urban Architecture and Waterscapes

Karolina M. Zielinska-Dabkowska, Ph.D., Eng. Arch., M. Arch., is an Assistant Professor at the Faculty of Architecture of Gdańsk University of Technology (GUT). In 2002, she completed her studies of Architecture and Urban Planning at Gdańsk University of Technology (Gdańsk Tech) and in 2004, Architectural Engineering at the University of Applied Sciences and Arts (HAWK) in Hildesheim, Germany. After graduation, she worked for several...
Lucyna Nyka prof. dr hab. inż. arch.

People

Gdańsk University of Technology, Department of Urban Architecture and Waterscapes

Lucyna Nyka (Ph.D., D.Sc., Prof.) is a Professor at the Faculty of Architecture, Gdańsk University of Technology, in 2008-2016 a Vice-Dean for Research, and since 2016 – Dean of the Faculty of Architecture. Her research interests focus on issues concerning water-related architecture and urban landscapes. She is the author and co-author of many projects focused on urban renewal. She was an author of the EU-founded (Audiovisual...
Balance recognition on the basis of EEG measurement.
Publication
- Annals of Computer Science and Information Systems - Year 2016
Although electroencephalography (EEG) is not typically used for verifying the sense of balance, it can be used for analysing cortical signals responsible for this phenomenon. Simple balance tasks can be proposed as a good indicator of whether the sense of balance is acting more or less actively. This article presents preliminary results for the potential of using EEG to balance sensing....

Full text available to download
Face Recognition: Shape versus Texture
Publication
- M. Smiatacz
- Year 2015
This paper describes experiments related to the application of well-known techniques of the texture feature extraction (Local Binary Patterns and Gabor filtering) to the problem of automatic face verification. Results of the tests show that simple image normalization strategy based on the eye center detection and a regular grid of fiducial points outperforms the more complicated approach, employing active models that are able to...

Full text to download in external service
Role of cholesterol in substrate recognition by -secretase
Publication
- Scientific Reports - Year 2021
-Secretase is an enzyme known to cleave multiple substrates within their transmembrane domains, with the amyloid precursor protein of Alzheimer’s Disease among the most prominent examples. The activity of -secretase strictly depends on the membrane cholesterol content, yet the mechanistic role of cholesterol in the substrate binding and cleavage remains unclear. In this work, we used all-atom molecular dynamics simulations to examine...

Full text available to download
Viruses, cancer and non-self recognition
Publication
- M. Padariya
- U. Kalathiya
- S. Mikac
- K. Dziubek
- M. Tovar
- E. Sroka
- R. Fahraeus
- A. Sznarkowska
- Open Biology - Year 2021
Full text to download in external service
Detection of dialogue in movie soundtrack for speech intelligibility enhancement
Publication
- K. Łopatka
- Year 2014
A method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility....

Full text to download in external service
Advanced speech archiving and restoration system for aviation applications
Publication
- A. Czyżewski
- J. Kotus
- A. Kaczmarek
- A. Rypulak
- A. Pawlik
- Year 2005
W referacie przedstawiono opracowany System Rejestracji I Rekonstrukcji Mowy dla potrzeb lotnictwa. System ten umożliwia jednoczesny zapis, archiwizację i poprawę zrozumiałości sygnału mowy pochodzącego z wielu różnych kanałów komunikacji radiowej. Głównym celem systemu jest rejestracja i rekonstrukcja komunikatów słownych wymienianych drogą radiową pomiędzy pilotem samolotu a stacją kontroli lotów - jest to niezwykle istotne w...
Application of hybrid signals processors to speech and hearing aids
Publication
- P. Odya
- A. Czyżewski
- Year 2005
Dzięki postępowi w technice Cyfrowych Procesorów Sygnałowych (ang. DSP) stało się możliwe budowanie miniaturowych protez słuchu i mowy. Mimo niewielkich wymiarów procesory te są w stanie wykonywać złożone algorytmy. Ich dodatkową zaletą jest łatwość zmiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. W pracy skupiono się na zagadnieniach związanych z projektowanie i implementacją algorytmów mających zastosowanie...
Evaluation and Irony in Text in the Light of Speech Act Theory
Publication
- K. Kukowicz-Zarska
- Forum Filologiczne Ateneum - Year 2020
Full text to download in external service
Investigations of speech signal parameters with regard to articulation influences
Publication
- A. Kaczmarek
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2008
W pracy zostało podjęte zagadnienie parametryzacji sygnału mowy w kontekście ekstrakcji cech biometrycznych. Analizowane parametry to parametry cepstralne (cepstrum liniowe i mel-cepstrum, czyli MFCC), parametry liniowej predykcji (LPC) oraz momenty widmowe i parametr F0. Zastosowano analize w krótkich stałych segmentach sygnału z zastosowaniem dużego zakładkowania, tzw. ''implicite segmentation''. Umożliwiło to zaobserwowanie...
System of speech signal processing and visualisation for linguistic purposes
Publication
- K. Wojan
- Archives of Acoustics - Year 2005
Digital analysis of ethnic speech – extraction of information code
Publication
- K. Wojan
- Archives of Acoustics - Year 2003
On the EM algorithm for the estimation of speech AR parameters in noise
Publication
- M. Kuropatwinski
- B. Kleijn
- M. Kuropatwiński
- Year 2014
Full text to download in external service
New approach to localization of clicks in archive speech signals.
Publication
- M. Niedźwiecki
- A. Sobociński
- Year 2004
Przedstawiono problem lokalizacji zniekształceń impulsowych w archiwalnych sygnałach mowy. Pokazano, że detekcja oparta na dwuzakresowym modelu autoregresyjnym i przetwarzanie dwukierunkowe pozwala uzyskać znaczącą poprawę działania w stosunku do istniejących metod lokalizacji zniekształceń.
Automatic recognition of males and females among web browser users based on behavioural patterns of peripherals usage
Publication
- A. Kołakowska
- A. Landowska
- P. Jarmolkowicz
- M. Jarmolkowicz
- K. Sobota
- Internet Research - Year 2016
Purpose The purpose of this paper is to answer the question whether it is possible to recognise the gender of a web browser user on the basis of keystroke dynamics and mouse movements. Design/methodology/approach An experiment was organised in order to track mouse and keyboard usage using a special web browser plug-in. After collecting the data, a number of parameters describing the users’ keystrokes, mouse movements and clicks...

Full text to download in external service
The Influence of Selecting Regions from Endoscopic Video Frames on The Efficiency of Large Bowel Disease Recognition Algorithms
Publication
- Year 2012
The article presents our research in the field of the automatic diagnosis of large intestine diseases on endoscopic video. It focuses on the methods of selecting regions of interest from endoscopic video frames for further analysis by specialized disease recognition algorithms. Four methods of selecting regions of interest have been discussed: a. trivial, b. with the deletion of characteristic, endoscope specific additions to the...
International Journal of Speech Technology

Journals

ISSN: 1381-2416 , eISSN: 1572-8110
Journal of Monolingual and Bilingual Speech

Journals

ISSN: 2631-8407 , eISSN: 2631-8415
International Journal of Applied Pattern Recognition

Journals

ISSN: 2049-887X , eISSN: 2049-8888
World Research Journal of Pattern Recognition

Journals

ISSN: 2278-8557
International Journal on Document Analysis and Recognition

Journals

ISSN: 1433-2833 , eISSN: 1433-2825
Determination of toxic gases based on the responses of a single electrocatalytic sensor and pattern recognition techniques
Publication
- MEASUREMENT SCIENCE & TECHNOLOGY - Year 2014
A response from an electrocatalytic gas sensor contains fingerprint information about the type of gas and its concentration. As a result, a single gas sensor can be used for the determination of different gases. However, information about the type of gas and its concentration is hidden in the unique shape of the current–voltage response and it is quite difficult to explore. One of the ways to get precise information about the measured...

Full text to download in external service
1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
Publication
- Year 2020
A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....
Real-time speech streching for supporting hearing impaired schoolchildren
Publication
- A. Kupryjanow
- A. Czyżewski
- Elektronika : konstrukcje, technologie, zastosowania - Year 2010
A study of time scale modification algorithms applied to support hearing impaired schoolchildren is presented. Variety of algorithms are considered, namely: overlap-and add, two variations of synchronous overlapand- add, and the phase vocoder. Their effectiveness as well as real-time processing capabilities are examined.

Full text to download in external service

Search

Filters

Catalog

Search results for: audiovisual speech recognition

Karolina Zielińska-Dąbkowska dr inż. arch.

Lucyna Nyka prof. dr hab. inż. arch.