Search results for: MULTIMODAL EMOTION RECOGNITION

Search results for: MULTIMODAL EMOTION RECOGNITION

results on page:
embed this view on your website

Filters

total: 946

clear all filters disabled

International Conference on Pattern Recognition Applications and Methods

Conferences
IEEE Automatic Speech Recognition and Understanding Workshop

Conferences
International Conference on Artificial Intelligence and Pattern Recognition

Conferences
IEEE International Conference on Document Analysis and Recognition

Conferences
Digital fingerprinting for color images based on the quaternion encryption scheme
Publication
- PATTERN RECOGNITION LETTERS - Year 2014
In this paper we present a new quaternion-based encryption technique for color images. In the proposed encryption method, images are written as quaternions and are rotated in a three-dimensional space around another quaternion, which is an encryption key. The encryption process uses the cipher block chaining (CBC) mode. Further, this paper shows that our encryption algorithm enables digital fingerprinting as an additional feature....

Full text to download in external service
Simultaneous determination of thermodynamic and kinetic parameters of aminopolycarbonate complexes of cobalt(II) and nickel(II) based on isothermal titration calorimetry data
Publication
- A. Tesmar
- D. Wyrzykowski
- E. Muñoz
- B. Pilarski
- J. Pranczk
- D. Jacewicz
- L. Chmurzyński
- JOURNAL OF MOLECULAR RECOGNITION - Year 2017
Full text to download in external service
Zinc(II) complexation by some biologically relevant pH buffers
Publication
- D. Wyrzykowski
- A. Tesmar
- D. Jacewicz
- J. Pranczk
- L. Chmurzyński
- JOURNAL OF MOLECULAR RECOGNITION - Year 2014
Full text to download in external service
Bridging challenges of clinical decision support systems with a semantic approach. A case study on breast cancer
Publication
- E. Szczerbicki
- C. Sanin
- C. Toro
- PATTERN RECOGNITION LETTERS - Year 2013
The integration of Clinical Decision Support Systems (CDSS) in nowadays clinical environments has not been fully achieved yet. Although numerous approaches and technologies have been proposed since 1960, there are still open gaps that need to be bridged. In this work we present advances from the established state of the art, overcoming some of the most notorious reported difficulties in: (i) automating CDSS, (ii) clinical workflow...

Full text to download in external service
Engineering Candida albicans glucosamine-6-phosphate synthase for efficient enzyme purification
Publication
- J. Czarnecka
- K. Kwiatkowska
- I. Gabriel
- M. Wojciechowski
- S. Milewski
- JOURNAL OF MOLECULAR RECOGNITION - Year 2012
Rationally designed muteins of Candida albicans glucosamine-6-phosphate synthase, an enzyme known as a promising target for antifungal chemotherapy, were constructed, overexpressed in Escherichia coli and purified to near homogeneity. To facilitate and to optimize the purification of the enzyme, three recombinant versionscontaining internal oligoHis fragments were constructed: (i) by substituting residues 343 - 348...

Full text to download in external service
Smart Approach for Glioma Segmentation in Magnetic Resonance Imaging using Modified Convolutional Network Architecture (U-NET)
Publication
- N. Sohail
- S. M. Anwar
- F. Majeed
- E. Szczerbicki
- CYBERNETICS AND SYSTEMS - Year 2021
Segmentation of a brain tumor from magnetic resonance multimodal images is a challenging task in the field of medical imaging. The vast diversity in potential target regions, appearance and multifarious intensity threshold levels of various tumor types are few of the major factors that affect segmentation results. An accurate diagnosis and its treatment demand strict delineation of the tumor affected tissues. Herein, we focus on...

Full text available to download
Time-domain prosodic modifications for text-to-speech synthesizer
Publication
- J. Łopatka
- P. Suchomski
- A. Czyżewski
- Year 2010
An application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
ISCA Tutorial and Research Workshop Automatic Speech Recognition

Conferences
International Conference on Advances in Pattern Recognition and Digital Techniques

Conferences
IEEE International Conference on Automatic Face and Gesture Recognition

Conferences
Aktywny system RFID do lokalizacji i identyfikacji obiektów w wielomodalnej infrastrukturze bezpieczeństwa
Publication
- J. Cichowski
- A. Czyżewski
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2014
Przedstawiono prace koncepcyjne, badawcze oraz implementacyjne skoncentrowane na praktycznej realizacji systemu detekcji obiektów z wykorzystaniem kamer wizyjnych i identyfikacji radiowej. Zaproponowano rozbudowę wielomodalnego teleinformatycznego systemu bezpieczeństwa o warstwę identyfikacji radiowej obiektów. Omówiono założenia zaprojektowanego systemu oraz opracowaną warstwę sprzętową. Zaproponowano i przedyskutowano praktyczne...
Emotional distress, burnout and sense of safety during the COVID-19 pandemic in teachers after the reopening of schools
Publication
- D. Pankowski
- E. Pisula
- K. Wytrychiewicz-Pankowska
- I. Nowakowska
- A. Banasiak
- M. Markiewicz
- A. Jórczak-Kopeć
- Advances in Cognitive Psychology - Year 2023
The COVID-19 pandemic is having a significant impact on people's psychological well-being and mental health. This study aimed to identify factors linked to emotional distress, burnout and sense of safety in teachers related to the reopening of Polish schools after lockdown, remote work, and the holiday period between March and August 2020. A total of 1,286 teachers from different educational institutions participated in the...
Pracujący w czasie rzeczywistym system detekcji gazów wykorzystujący przenośny komputer Raspberry PI oraz matrycę półprzewodnikowych czujników gazu
Publication
- Elektronika : konstrukcje, technologie, zastosowania - Year 2014
The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and lowcost alternative for other devices, like gas‑analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...

Full text to download in external service
System Weryfikacji Autentyczności Podpisu Odręcznego
Publication
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2016
W referacie przedstawiono system statycznej i dynamicznej weryfikacji autentyczności podpisu odręcznego, składanego piórem biometrycznym, wyposażonym w 2 akcelerometry, 2 żyroskopy i 3 czujniki ścisku, na rezystancyjnej powierzchni dotykowej, łączącym się bezprzewodowo z urządzeniami komputerowymi. We wstępie przedstawiono architekturę sieciową wielomodalnego systemu biometrii. Przedstawiono warstwę sprzętową systemu weryfikacji...
Interactions with recognized patients using smart glasses
Publication
- J. Rumiński
- M. Smiatacz
- A. Bujnowski
- A. Andrushevich
- M. Biallas
- R. Kistler
- Year 2015
Recently, different smart glasses solutions have been proposed on the market. The rapid development of this wearable technology has led to several research projects related to applications of smart glasses in healthcare. In this paper we propose a general architecture of the system enabling data integration for the recognized person. In the proposed system smart glasses integrates data obtained for the recognized patient from health...

Full text to download in external service
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publication
- Year 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Full text to download in external service
Vocalic Segments Classification Assisted by Mouth Motion Capture
Publication
- Year 2018
Visual features convey important information for automatic speech recognition (ASR), especially in noisy environment. The purpose of this study is to evaluate to what extent visual data (i.e. lip reading) can enhance recognition accuracy in the multi-modal approach. For that purpose motion capture markers were placed on speakers' faces to obtain lips tracking data during speaking. Different parameterizations strategies were tested...

Full text to download in external service
Gesture-based computer control system
Publication
- Elektronika : konstrukcje, technologie, zastosowania - Year 2010
In the paper a system for controlling computer applications by hand gestures is presented. First, selected methods used for gesture recognition are described. The system hardware and a way of controlling a computer by gestures are described. The architecture of the software along with hand gesture recognition methods and algorithms used are presented. Examples of basic and complex gestures recognized by the system are given.

Full text to download in external service
Automatic Classification of Polish Sign Language Words
Publication
- T. Dziubich
- J. Szymański
- Przegląd Elektrotechniczny - Year 2014
In the article we present the approach to automatic recognition of hand gestures using eGlove device. We present the research results of the system for detection and classification of static and dynamic words of Polish language. The results indicate the usage of eGlove allows to gain good recognition quality that additionally can be improved using additional data sources such as RGB cameras.

Full text available to download
Comparative study on the effectiveness of various types of road traffic intensity detectors
Publication
- A. Czyżewski
- A. Sroczynski
- T. Smialkowski
- P. Hoffmann
- S. Cygert
- G. Szwoch
- J. Kotus
- D. Weber
- M. Szczodrak
- D. Koszewski... and 2 others
- Year 2019
Vehicle detection and speed measurements are crucial tasks in traffic monitoring systems. In this work, we focus on several types of electronic sensors, operating on different physical principles in order to compare their effectiveness in real traffic conditions. Commercial solutions are based on road tubes, microwave sensors, LiDARs, and video cameras. Distributed traffic monitoring systems require a high number of monitoring...

Full text to download in external service
Automatic music set organizatio based on mood of music / Automatyczna organizacja bazy muzycznej na podstawie nastroju muzyki
Publication
- M. Piotrowska
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2017
This work is focused on an approach based on the emotional content of music and its automatic recognition. A vector of features describing emotional content of music was proposed. Additionally, a graphical model dedicated to the subjective evaluation of mood of music was created. A series of listening tests was carried out, and results were compared with automatic mood recognition employing SOM (Self Organizing Maps) and ANN (Artificial...

Full text to download in external service
The role of time perspectives and impulsivity dimensions in coping styles
Publication
- I. Nowakowska
- PSYCHOLOGICAL REPORTS - Year 2023
Both time perspectives and impulsivity dimensions are groups of traits that are connected to self-control abilities and might be important for coping styles. However, to date, no study has systematically investigated their utility in predicting coping styles with regard to their multidimensional nature. The current study was correlational and exploratory, aiming to discover what amount of variance in each of the three coping...

Full text to download in external service
Handwritten signature verification system employing wireless biometric pen
Publication
- M. Lech
- A. Czyżewski
- Year 2017
The handwritten signature verification system being a part of the developed multimodal biometric banking stand is presented. The hardware component of the solution is described with a focus on the signature acquisition and on verification procedures. The signature is acquired employing an accelerometer and a gyroscope built-in the biometric pen plus pressure sensors for the assessment of the proper pen grip and then the signature...
Employing a biofeedback method based on hemispheric synchronization in effective learning
Publication
- Year 2012
In this paper an approach to build a brain computer-based hemispheric synchronization system is presented. The concept utilizes the wireless EEG signal registration and acquisition as well as advanced pre-processing methods. The influence of various filtration techniques of EOG artifacts on brain state recognition is examined. The emphasis is put on brain state recognition using band pass filtration for separation of individual...

Full text to download in external service
Krzysztof Goczyła prof. dr hab. inż.

People

Department of Software Engineering

Krzysztof Goczyła, full professor of Gdańsk University of Technology, computer scientist, a specialist in software engineering, knowledge engineering and databases. He graduated from the Faculty of Electronics Technical University of Gdansk in 1976 with a degree in electronic engineering, specializing in automation. Since then he has been working at Gdańsk University of Technology. In 1982 he obtained a doctorate in computer science...
Endoscopic Video Classification with the Consideration of Temporal Patterns
Publication
- Year 2012
The article describes a novel approach to automatic recognition and classification of diseases in endoscopic videos. Current directions of research in this field are discussed. Most presented methods focus on processing single frames and do not take into consideration the temporal relationship between continuous classifications. Existing approaches that consider the temporal structure of an incoming frame sequence are focused on...
Jan Daciuk dr hab. inż.

People

Faculty of Electronics, Telecommunications and Informatics, Department of Intelligent Interactive Systems

Jan Daciuk received his M.Sc. from the Faculty of Electronics of Gdansk University of Technology in 1986, and his Ph.D. from the Faculty of Electronics, Telecommunications and Informatics of Gdańsk University of Technology in 1999. He has been working at the Faculty from 1988. His research interests include finite state methods in natural language processing and computational linguistics including speech processing. Dr. Daciuk...
Wykorzystanie sztucznych sieci neuronowych do wykrywania i rozpoznawania tablic rejestracyjnych na zdjęciach pojazdów
Publication
- M. Huzarek
- T. A. Rutkowski
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Year 2015
W artykule przedstawiono koncepcję algorytmu wykrywania i rozpoznawania tablic rejestracyjnych (AWiRTR) na obrazach cyfrowych pojazdów. Detekcja i lokalizacja tablic rejestracyjnych oraz wyodrębnienie z obrazu tablicy rejestracyjnej poszczególnych znaków odbywa się z wykorzystaniem podstawowych technik przetwarzania obrazu (przekształcenia morfologiczne, wykrywanie krawędzi) jak i podstawowych danych statystycznych obiektów wykrytych...

Full text available to download
A video monitoring system using ontology-driven identification of threats
Publication
- P. Kaczmarek
- P. Zielonka
- Year 2009
In this paper, we present a video monitoring systemthat leverages image recognition and ontological reasoningabout threats. In the solution, an image processing subsystemuses video recording of a monitored area and recognizesknown concepts in scenes. Then, a reasoning subsystem uses anontological description of security conditions and informationfrom image recognition to check if a violation of a conditionhas occurred. If a threat...

Full text to download in external service
Interpretation and modeling of emotions in the management of autonomous robots using a control paradigm based on a scheduling variable
Publication
- ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE - Year 2020
The paper presents a technical introduction to psychological theories of emotions. It highlights a usable ideaimplemented in a number of recently developed computational systems of emotions, and the hypothesis thatemotion can play the role of a scheduling variable in controlling autonomous robots. In the main part ofthis study, we outline our own computational system of emotion – xEmotion – designed as a key structuralelement in...

Full text available to download
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
Publication
- International Journal of Image Processing and Visual Communication - Year 2013
In this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...

Full text to download in external service
An electronic nose for quantitative determination of gas concentrations
Publication
- Year 2016
The practical application of human nose for fragrance recognition is severely limited by the fact that our sense of smell is subjective and gets tired easily. Consequen tly, there is considerable need for an instrument that can be a substitution of the human sense of smell. Electronic nose devices from the mid 1980s are used in growing number of applications. They comprise an array of several electrochemical gas sensors...

Full text to download in external service
Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor
Publication
- Year 2015
Spatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...

Full text to download in external service
Multimedia industrial and medical applications supported by machine learning
Publication
- A. Czyżewski
- Year 2023
This article outlines a keynote paper presented at the Intelligent DecisionTechnologies conference providing a part of the KES Multi-theme Conference “Smart Digital Futures” organized in Rome on June 14–16, 2023. It briefly discusses projects related to traffic control using developed intelligent traffic signs and diagnosing the health of wind turbine mechanisms and multimodal biometric authentication for banking branches to provide...

Full text to download in external service
Video Semantic Analysis Framework based on Run-time Production Rules - Towards Cognitive Vision
Publication
- E. Szczerbicki
- C. Toro
- C. Sanin
- JOURNAL OF UNIVERSAL COMPUTER SCIENCE - Year 2015
This paper proposes a service-oriented architecture for video analysis which separates object detection from event recognition. Our aim is to introduce new tools to be considered in the pathway towards Cognitive Vision as a support for classical Computer Vision techniques that have been broadly used by the scientific community. In the article, we particularly focus in solving some of the reported scalability issues found in current...

Full text available to download
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
Publication
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2016
W referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...
MODALITY corpus - SPEAKER 35 - COMMANDS C1
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S6
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - COMMANDS C5
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 10 - SEQUENCE S1
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 39 - COMMANDS C1
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - COMMANDS C3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...

Search

Filters

Catalog

Search results for: MULTIMODAL EMOTION RECOGNITION

Krzysztof Goczyła prof. dr hab. inż.

Jan Daciuk dr hab. inż.