Search results for: MULTIMODAL EMOTION RECOGNITION

Search results for: MULTIMODAL EMOTION RECOGNITION

results on page:
embed this view on your website

Filters

total: 936

clear all filters disabled

Michał Lech dr inż.

People

Michał Lech was born in Gdynia in 1983. In 2007 he graduated from the faculty of Electronics, Telecommunications and Informatics of Gdansk University of Technology. In June 2013, he received his Ph.D. degree. The subject of the dissertation was: “A Method and Algorithms for Controlling the Sound Mixing Processes by Hand Gestures Recognized Using Computer Vision”. The main focus of the thesis was the bias of audio perception caused...
Influence of accelerometer signal pre-processing and classification method on human activity recognition
Publication
- Elektronika : konstrukcje, technologie, zastosowania - Year 2010
A study of data pre-processing influence on accelerometer-based human activity recognition algorithms is presented. The frequency band used to filter-out the accelerometer signals and the number of accelerometers involved were considered in terms of their influence on the recognition accuracy. In the test four methods of classification were used: support vector machine, decision trees, neural network, k-nearest neighbor.

Full text to download in external service
Pose classification in the gesture recognition using the linear optical sensor
Publication
- Year 2017
Gesture sensors for mobile devices, which have a capability of distinguishing hand poses, require efficient and accurate classifiers in order to recognize gestures based on the sequences of primitives. Two methods of poses recognition for the optical linear sensor were proposed and validated. The Gaussian distribution fitting and Artificial Neural Network based methods represent two kinds of classification approaches. Three types...

Full text to download in external service
Attention-Based Deep Learning System for Classification of Breast Lesions—Multimodal, Weakly Supervised Approach
Publication
- M. Bobowicz
- M. Rygusik
- J. Buler
- R. Buler
- M. Ferlin
- A. Kwasigroch
- E. Szurowska
- M. Grochowski
- Cancers - Year 2023
Breast cancer is the most frequent female cancer, with a considerable disease burden and high mortality. Early diagnosis with screening mammography might be facilitated by automated systems supported by deep learning artificial intelligence. We propose a model based on a weakly supervised Clustering-constrained Attention Multiple Instance Learning (CLAM) classifier able to train under data scarcity effectively. We used a private...

Full text available to download
On practical application of Shannon theory to character recognition and more
Publication
- M. Jurkiewicz
- Year 2014
Let us consider an optical character recognition system, which in particular can be used for identifying objects that were assigned strings of some length. The system is not perfect, for example, it sometimes recognizes wrongly the characters "Y" and "V". What is the largest set of strings of given length for the system under consideration, which can be mutually correctly recognized, and the corresponding objects correctly identified?...
Molecular Recognition in Complexes of TRF Proteins with Telomeric DNA
Publication
- M. Wieczór
- A. Tobiszewski
- P. Wityk
- B. Tomiczek
- J. Czub
- PLOS ONE - Year 2014
Telomeres are specialized nucleoprotein assemblies that protect the ends of linear chromosomes. In humans and many other species, telomeres consist of tandem TTAGGG repeats bound by a protein complex known as shelterin that remodels telomeric DNA into a protective loop structure and regulates telomere homeostasis. Shelterin recognizes telomeric repeats through its two major components known as Telomere Repeat-Binding Factors, TRF1...

Full text available to download
Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition
Publication
- G. Korvel
- P. Treigys
- G. Tamulevicus
- J. Bernataviciene
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2018
convolutional neural network (CNN) which is a class of deep, feed-forward artificial neural network. We decided to analyze audio signal feature maps, namely spectrograms, linear and Mel-scale cepstrograms, and chromagrams. The choice was made upon the fact that CNN performs well in 2D data-oriented processing contexts. Feature maps were employed in the Lithuanian word recognition task. The spectral analysis led to the highest word...
Parameters optimization in medicine supporting image recognition algorithms
Publication
- A. Brzeski
- Year 2011
In this paper, a procedure of automatic set up of image recognition algorithms' parameters is proposed, for the purpose of reducing the time needed for algorithms' development. The procedure is presented on two medicine supporting algorithms, performing bleeding detection in endoscopic images. Since the algorithms contain multiple parameters which must be specified, empirical testing is usually required to optimise the algorithm's...
Scent emitting multimodal computer interface for learning enhancement
Publication
- Year 2010
Komputerowy interfejs aromatyczny stanowi ważne uzupełnienie procesu stymulacji polisensorycznej. Stymulacja ta odgrywa kluczową rolę w terapii i kształceniu dzieci z zaburzeniami rozwoju (np. w przypadku autyzmu czy ADHD). Opracowany interfejs może stać się elementem wyposażenia tzw. sal doświadczania świata, ale może być także stosowany niezależnie stanowiąc znaczące wzbogacenie komputerowych programów edukacyjnych. Dzięki możliwości...
Accelerometer-based Human Activity Recognition and the Impact of the Sample Size
Publication
- Year 2014
The presented study focused on the recognition of eight user activities (e.g. walking, lying, climbing stairs) basing on the measurements from an accelerometer embedded in a mobile device. It is assumed that the device is carried in a specific location of the user’s clothing. Three types of classifiers were tested on different sizes of the samples. The influence of the time window (the duration of a single trial) on selected activities...

Full text to download in external service
Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets
Publication
- Year 2008
Celem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory...
Automatic recognition of therapy progress among children with autism
Publication
- A. Kołakowska
- A. Landowska
- A. Anzulewicz
- K. Sobota
- Scientific Reports - Year 2017
The article presents a research study on recognizing therapy progress among children with autism spectrum disorder. The progress is recognized on the basis of behavioural data gathered via five specially designed tablet games. Over 180 distinct parameters are calculated on the basis of raw data delivered via the game flow and tablet sensors - i.e. touch screen, accelerometer and gyroscope. The results obtained confirm the possibility...

Full text available to download
A comparative study of English viseme recognition methods and algorithm
Publication
- D. Jachimski
- A. Czyżewski
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2018
An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...

Full text available to download
A comparative study of English viseme recognition methods and algorithms
Publication
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2018
An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

Full text available to download
Local Texture Pattern Selection for Efficient Face Recognition and Tracking
Publication
- M. Smiatacz
- J. Rumiński
- Advances in Intelligent Systems and Computing - Year 2015
This paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...

Full text to download in external service
A Concept of Automatic Film Color Grading Based on Music Recognition and Evoked Emotions
Publication
- D. Weber
- B. Kostek
- Year 2019
The article presents the aspects of the final selection of the color of shots in film production based on the psychology of color. First of all, the elements of color processing, contrast, saturation or white balance in the film shots were presented and the definition of color grading was given. In the second part of the article the analysis of film music was conducted in the context of stimulating appropriate emotions while watching...
Feasibility Study for Food Intake Tasks Recognition Based on Smart Glasses
Publication
- M. Biallas
- A. Andrushevich
- R. Kistler
- A. Klapproth
- K. Czuszyński
- A. Bujnowski
- Journal of Medical Imaging and Health Informatics - Year 2015
In this exploratory study 13 adult test subjects have performed different food intake tasks while wearing a three axis accelerometer mounted at a temple of glasses. Two different algorithms for task recognition have been applied and compared. The retrospective data processing leads to better task recognition results when the frequency range of 50 Hz to 100 Hz is analysed within accelerometer signal recordings. A straightforward...

Full text to download in external service
Fuzzy rule-based dynamic gesture recognition employing camera & multimedia projector
Publication
- M. Lech
- B. Kostek
- Year 2010
In the paper the system based on camera and multimedia projector enabling a user to control computer applications by dynamic hand gestures is presented. The main objective is to present the gesture recognition methodology which bases on representing hand movement trajectory by motion vectors analyzed using fuzzy rule-based inference. The approach was engineered in the system developed with J2SE and C++ / OpenCV technology. OpenCV...

Full text to download in external service
A survey of automatic speech recognition deep models performance for Polish medical terms
Publication
- Year 2023
Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

Full text to download in external service
Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems
Publication
- P. Sokólski
- T. A. Rutkowski
- Pomiary Automatyka Robotyka - Year 2013
The aim of this paper is to present a hybrid algorithm that combines the advantages ofartificial neural networks and hidden Markov models in speech recognition for control purpos-es. The scope of the paper includes review of currently used solutions, description and analysis of implementation of selected artificial neural network (NN) structures and hidden Markov mod-els (HMM). The main part of the paper consists of a description...

Full text available to download
Contextual Knowledge to Enhance Workplace Hazard Recognition and Interpretation in a Cognitive Vision Platform
Publication
- C. De
- C. Sanin
- E. Szczerbicki
- Year 2018
The combination of vision and sensor data together with the resulting necessity for formal representations builds a central component of an autonomous Cyber Physical System for detection and tracking of laborers in workplaces environments. This system must be adaptable and perceive the environment as automatically as possible, performing in a variety of plants and scenes without the necessity of recoding the application for each...

Full text available to download
JOURNAL OF MOLECULAR RECOGNITION

Journals

ISSN: 0952-3499 , eISSN: 1099-1352
Theory of recognition in a historical perspective. Axel Honneth's Anerkennung: Eine europäische Ideengeschichte
Publication
- A. Karalus
- Archiwum Historii Filozofii i Myśli Społecznej - Year 2019
The article discusses Honneth excursion into the realm of the history of ideas. This time Honneth decides to laser it on the notion of "recognition" in three different cultural areas and three different traditions: French, English, and German. The article discusses Honneth's persepctive and attempts at finding the common thread that would link three aforementioned traditions.

Full text available to download
Multimodal coupling matrix for an array of rectangular slots on conducting cylinder
Publication
- R. Lech
- A. Kusiek
- Year 2012
Artykuł prezentuje metodę wyznaczania sprzężeń wzajemnych pomiędzy aperturami promieniującymi położonymi na przewodzącym cylindrze. Pokazano sposób wyznaczania wielorodzajowej macierzy rozproszenia reprezentującej sprzężenia własne i wzajemne w badanej strukturze.
Testing the Accuracy of the Modified ICP Algorithm with Multimodal Weighting Factors
Publication
- Ł. Marchel
- C. Specht
- M. Specht
- ENERGIES - Year 2020
Full text to download in external service
Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions
Publication
- M. Wang
- T. Sirlapu
- A. Kwaśniewska
- M. Szankin
- M. Bartscherer
- R. Nicolas
- Year 2018
With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

Full text to download in external service
Journal of Pattern Recognition Research

Journals

ISSN: 1558-884X
Pattern Recognition and Image Analysis

Journals

ISSN: 1054-6618
Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System
Publication
- Advances in Intelligent Systems and Computing - Year 2013
The main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...

Full text to download in external service
Recognition of environmentally important ions
Publication
- N. Łukasik
- E. Wagner-Wysiecka
- V. Hubscher-Bruder
- M. Bocheńska
- S. Michel
- Logistyka - Year 2013
..
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
Publication
- Year 2016
The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...
Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition
Publication
- Year 2019
Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

Full text available to download
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publication
- Year 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
Deep Learning: A Case Study for Image Recognition Using Transfer Learning
Publication
- S. Erpolat Tasabat
- O. Aydin
- Year 2021
Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Full text to download in external service
From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition
Publication
- P. Rościszewski
- Computer Science - Year 2017
Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

Full text available to download
Automatic singing quality recognition employing artificial neural networks
Publication
- P. Żwan
- Archives of Acoustics - Year 2008
Celem artykułu jest udowodnienie możliwości automatycznej oceny jakości technicznej głosów śpiewaczych. Pokrótce zaprezentowano w nim stworzoną bazę danych głosów śpiewaczych oraz zaimplementowane parametry. Przy pomocy sztucznych sieci neuronowych zaprojektowano system decyzyjny, który oceniono w pięciostopniowej skali jakość techniczną głosu. Przy pomocy metod statystycznych udowodniono, że wyniki generowane przez ten system...

Full text available to download
Processing of acoustical data in a multimodal bank operating room surveillance system
Publication
- J. Kotus
- K. Łopatka
- A. Czyżewski
- G. Bogdanis
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2016
An automatic surveillance system capable of detecting, classifying and localizing acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of...

Full text available to download
Smart Pen - new multimodal computer control tool for graphomotorical therapy
Publication
- Intelligent Decision Technologies-Netherlands - Year 2010
W sytuacji, gdy około 15% populacji uczniów wykazuje cechy dyslektyczne, koniecznością staje się wyposażenie szkół w efektywne narzędzia do diagnozy i terapii tego rodzaju zaburzeń. Dzięki wykorzystaniu tabletu i specjalnie skonstruowanego długopisu wyposażonego w czujniki nacisku uzyskano możliwość monitorowania wielu parametrów, które do tej pory były dla terapeutów całkowicie niedostępne (np. pomiar nacisku na podłoże czy ścisku...

Full text available to download
Multimodal Approach For Polysensory Stimulation And Diagnosis Of Subjects With Severe Communication Disorders
Publication
- A. Czyżewski
- B. Kostek
- A. Kurowski
- P. Szczuko
- M. Lech
- P. Odya
- A. Kwiatkowska
- Year 2017
is evaluated on 9 patients, data analysis methods are described, and experiments of correlating Glasgow Coma Scale with extracted features describing subjects performance in therapeutic exercises exploiting EEG and eyetracker are presented. Performance metrics are proposed, and k-means clusters used to define concepts for mental states related to EEG and eyetracking activity. Finally, it is shown that the strongest correlations...

Full text available to download
Real-time working gas recognition system based on the array of semiconductor gas sensors and portable computer Raspberry PI
Publication
- Year 2013
The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and low-cost alternative for other devices, like gas analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...
A Novel IoT-Perceptive Human Activity Recognition (HAR) Approach Using Multi-Head Convolutional Attention
Publication
- H. Zhang
- Z. Xiao
- J. Wang
- F. Li
- E. Szczerbicki
- IEEE Internet of Things Journal - Year 2019
Together with fast advancement of the Internet of Things (IoT), smart healthcare applications and systems are equipped with increasingly more wearable sensors and mobile devices. These sensors are used not only to collect data, but also, and more importantly, to assist in daily activity tracking and analyzing of their users. Various human activity recognition (HAR) approaches are used to enhance such tracking. Most of the existing...

Full text available to download
Unraveling the Interplay between DNA and Proteins: A Computational Exploration of Sequence and Structure-Specific Recognition Mechanisms
Publication
- K. A. Hossain
- Year 2023
My PhD dissertation focused on DNA-protein interactions and the recognition of specific DNA sequences and structures. I discovered that acidic amino acid residues (Asp/Glu) play a crucial role by exhibiting a preference for cytosine. Their contribution to binding affinity depends on nearby cytosines, balancing electrostatic repulsion with specific interactions. Acidic residues act as negative selectors, discouraging non-cytosine...

Full text available to download
Improving Traffic Light Recognition Methods using Shifting Time-Windows
Publication
- A. Blokus
- H. Krawczyk
- Year 2018
We propose a novel method of improving algorithms recognizing traffic lights in video sequences. Our focus is on algorithms for applications which notify the driver of a light in sight. Many existing methods process images in the recording separately. Our method bases on the observation that real-life videos depict underlying continuous processes. We named our method FSA (Frame Sequence Analyzed). It is applicable for any underlying...

Full text to download in external service
Borderline Personality Disorder and Emotion Dysregulation

Journals

ISSN: 2051-6673
International Journal of Work Organisation and Emotion

Journals

ISSN: 1740-8938
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
Publication
- A. Czyżewski
- Journal of the Acoustical Society of America - Year 2023
Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Full text available to download
Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams
Publication
- K. Łopatka
- Year 2015
A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...
Karolina Zielińska-Dąbkowska dr inż. arch.

People

Department of Urban Architecture and Waterscapes

Karolina M. Zielinska-Dabkowska, Ph.D., Eng. Arch., M. Arch., is an Assistant Professor at the Faculty of Architecture of Gdańsk University of Technology (GUT). In 2002, she completed her studies of Architecture and Urban Planning at Gdańsk University of Technology (Gdańsk Tech) and in 2004, Architectural Engineering at the University of Applied Sciences and Arts (HAWK) in Hildesheim, Germany. After graduation, she worked for several...
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publication
- Year 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
Viruses, cancer and non-self recognition
Publication
- M. Padariya
- U. Kalathiya
- S. Mikac
- K. Dziubek
- M. Tovar
- E. Sroka
- R. Fahraeus
- A. Sznarkowska
- Open Biology - Year 2021
Full text to download in external service

Search

Filters

Catalog

Search results for: MULTIMODAL EMOTION RECOGNITION

Michał Lech dr inż.

Karolina Zielińska-Dąbkowska dr inż. arch.