Wyniki wyszukiwania dla: EMOTION RECOGNITION, DATASET, VIDEO ANNOTATION

Wyniki wyszukiwania dla: EMOTION RECOGNITION, DATASET, VIDEO ANNOTATION

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 338

wyczyść wszystkie filtry niedostępne

Towards New Mappings between Emotion Representation Models
Publikacja
- A. Landowska
- Applied Sciences-Basel - Rok 2018
There are several models for representing emotions in affect-aware applications, and available emotion recognition solutions provide results using diverse emotion models. As multimodal fusion is beneficial in terms of both accuracy and reliability of emotion recognition, one of the challenges is mapping between the models of affect representation. This paper addresses this issue by: proposing a procedure to elaborate new mappings,...

Pełny tekst do pobrania w portalu
Using Different Information Channels for Affect-Aware Video Games - A Case Study
Publikacja
- M. Szwoch
- W. Szwoch
- Rok 2018
This paper presents the problem of creating affect-aware video games that use different information channels, such as image, video, physiological signals, input devices, and player’s behaviour, for emotion recognition. Presented case studies of three affect-aware games show certain conditions and limitations for using specific signals to recognize emotions and lead to interesting conclusions.

Pełny tekst do pobrania w serwisie zewnętrznym
Emotion monitoring system for drivers
Publikacja
- IFAC-PapersOnLine - Rok 2019
This article describes a new approach to the issue of building a driver monitoring system. Actual systems focus, for example, on tracking eyelid and eyebrow movements that result from fatigue. We propose a different approach based on monitoring the state of emotions. Such a system assumes that by using the emotion model based on our own concept, referred to as the reverse Plutchik’s paraboloid of emotions, the recognition of emotions...

Pełny tekst do pobrania w portalu
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publikacja
- Rok 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Pełny tekst do pobrania w serwisie zewnętrznym
Bimodal deep learning model for subjectively enhanced emotion classification in films
Publikacja
- D. Weber
- B. Kostek
- INFORMATION SCIENCES - Rok 2024
This research delves into the concept of color grading in film, focusing on how color influences the emotional response of the audience. The study commenced by recalling state-of-the-art works that process audio-video signals and associated emotions by machine learning. Then, assumptions of subjective tests for refining and validating an emotion model for assigning specific emotional labels to selected film excerpts were presented....

Pełny tekst do pobrania w serwisie zewnętrznym
OntoValidate: OntoNotes 5.0 NER validation dataset
Dane Badawcze
wersja 1.2 open access
- S. Olewniczak
OntoValidate dataset consists of 603 randomly chosen raw textsfrom the original OntoNote 5.0 dataset (3637 raw texts in total).
Subjective tests for gathering konwledge for applaying color grading to video clips automatically
Publikacja
- D. Weber
- B. Kostek
- Rok 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot,and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with...

Pełny tekst do pobrania w serwisie zewnętrznym
Subjective tests for gathering knowledge for applying color grading to video clips automatically
Publikacja
- D. Weber
- B. Kostek
- Rok 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot, and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with or...

Pełny tekst do pobrania w portalu
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
Publikacja
- Electronics - Rok 2022
Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

Pełny tekst do pobrania w portalu
Investigation of educational processes with affective computing methods
Publikacja
- A. Landowska
- G. Brodny
- e-mentor - Rok 2017
This paper concerns the monitoring of educational processes with the use of new technologies for the recognition of human emotions. This paper summarizes results from three experiments, aimed at the validation of applying emotion recognition to e-learning. An analysis of the experiments’ executions provides an evaluation of the emotion elicitation methods used to monitor learners. The comparison of affect recognition algorithms...

Pełny tekst do pobrania w portalu
Towards Emotion Acquisition in IT Usability Evaluation Context
Publikacja
- A. Landowska
- Rok 2015
The paper concerns extension of IT usability studies with automatic analysis of the emotional state of a user. Affect recognition methods and emotion representation models are reviewed and evaluated for applicability in usability testing procedures. Accuracy of emotion recognition, susceptibility to disturbances, independence on human will and interference with usability testing procedures are...

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology of Affective Intervention Design for Intelligent Systems
Publikacja
- INTERACTING WITH COMPUTERS - Rok 2016
This paper concerns how intelligent systems should be designed to make adequate, valuable and natural affective interventions. The article proposes a process for choosing an affective intervention model for an intelligent system. The process consists of 10 activities that allow for step-by-step design of an affective feedback loop and takes into account the following factors: expected and desired emotional states, characteristics...

Pełny tekst do pobrania w serwisie zewnętrznym
Introduction to the special issue on machine learning in acoustics
Publikacja
- Z. Michalopoulou
- P. Gerstoft
- B. Kostek
- M. A. Roch
- Journal of the Acoustical Society of America - Rok 2021
When we started our Call for Papers for a Special Issue on “Machine Learning in Acoustics” in the Journal of the Acoustical Society of America, our ambition was to invite papers in which machine learning was applied to all acoustics areas. They were listed, but not limited to, as follows: • Music and synthesis analysis • Music sentiment analysis • Music perception • Intelligent music recognition • Musical source separation • Singing...

Pełny tekst do pobrania w portalu
Quantifying inconsistencies in the Hamburg Sign Language Notation System
Publikacja
- M. Ferlin
- S. Majchrowska
- M. A. Plantykow
- A. Kwaśniewska
- A. Mikołajczyk-Bareła
- M. Olech
- J. Nalepa
- EXPERT SYSTEMS WITH APPLICATIONS - Rok 2024
The advent of machine learning (ML) has significantly advanced the recognition and translation of sign languages, bridging communication gaps for hearing-impaired communities. At the heart of these technologies is data labeling, crucial for training ML algorithms on a huge amount of consistently labeled data to achieve models that generalize well. The adoption of language-agnostic annotations is essential to connect different sign...

Pełny tekst do pobrania w serwisie zewnętrznym
Identification of Emotions Based on Human Facial Expressions Using a Color-Space Approach
Publikacja
- Z. Kowalczuk
- P. Chudziak
- Rok 2018
HCI technology improves human-computer interaction. Such communication can be carried out with the use of emotions that are visible on the human face since birth. In this paper the Emotion system for detecting and recognizing facial expressions, developed in the MSc work, is presented. The system recognizes emotion from webcam video in real time. It is based on color segmentation and morphological operations. The system uses a...

Pełny tekst do pobrania w portalu
A video monitoring system using ontology-driven identification of threats
Publikacja
- P. Kaczmarek
- P. Zielonka
- Rok 2009
In this paper, we present a video monitoring systemthat leverages image recognition and ontological reasoningabout threats. In the solution, an image processing subsystemuses video recording of a monitored area and recognizesknown concepts in scenes. Then, a reasoning subsystem uses anontological description of security conditions and informationfrom image recognition to check if a violation of a conditionhas occurred. If a threat...

Pełny tekst do pobrania w serwisie zewnętrznym
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
Publikacja
- International Journal of Image Processing and Visual Communication - Rok 2013
In this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...

Pełny tekst do pobrania w serwisie zewnętrznym
Affective Learning Manifesto – 10 Years Later
Publikacja
- A. Landowska
- Rok 2014
In 2004 a group of affective computing researchers proclaimed a manifesto of affective learning that outlined the prospects and white spots of research at that time. Ten years passed by and affective computing developed many methods and tools for tracking human emotional states as well as models for affective systems construction. There are multiple examples of affective methods applications in Intelligent Tutoring Systems (ITS)....
Rust QA: question answering dataset for "The Rust Programming Language" in SQuAD 2.0 format
Dane Badawcze
open access
- S. Olewniczak
- M. Maciszka
- K. Paluszewski
- G. Pozorski
- W. Rosenthal
- Ł. Zaleski
Rust QA is a dataset for training and evaluating QA systems. The dataset consists of 1068 questions to "The Rust Programming Language" book (https://doc.rust-lang.org/stable/book/) with the answers provided as text spans from the book. The dataset is released in SQuAD 2.0 format.
Robot Eye Perspective in Perceiving Facial Expressions in Interaction with Children with Autism
Publikacja
- A. Landowska
- B. Robins
- Advances in Intelligent Systems and Computing - Rok 2020
The paper concerns automatic facial expression analysis applied in a study of natural “in the wild” interaction between children with autism and a social robot. The paper reports a study that analyzed the recordings captured via a camera located in the eye of a robot. Children with autism exhibit a diverse level of deficits, including ones in social interaction and emotional expression. The aim of the study was to explore the possibility...

Pełny tekst do pobrania w serwisie zewnętrznym
Endoscopic Video Classification with the Consideration of Temporal Patterns
Publikacja
- Rok 2012
The article describes a novel approach to automatic recognition and classification of diseases in endoscopic videos. Current directions of research in this field are discussed. Most presented methods focus on processing single frames and do not take into consideration the temporal relationship between continuous classifications. Existing approaches that consider the temporal structure of an incoming frame sequence are focused on...
Selection of an artificial pre-training neural network for the classification of inland vessels based on their images
Publikacja
- K. Bobkowska
- I. Bodus-olkowska Izabela
- Zeszyty Naukowe Akademii Morskiej w Szczecinie - Rok 2021
Artificial neural networks (ANN) are the most commonly used algorithms for image classification problems. An image classifier takes an image or video as input and classifies it into one of the possible categories that it was trained to identify. They are applied in various areas such as security, defense, healthcare, biology, forensics, communication, etc. There is no need to create one’s own ANN because there are several pre-trained...

Pełny tekst do pobrania w portalu
Superresolution algorithm to video surveillance system
Publikacja
- T. Merta
- A. Czyżewski
- Rok 2010
An application of a multiframe SR (superresolution) algorithm applied to video monitoring is described. The video signal generated by various types of video cameras with different parameters and signal distortions which may be very problematic for superresolution algorithms. The paper focuses on disadvantages in video signal which occur in video surveillance systems. Especially motion estimation and its influence on superresolution...
Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives
Publikacja
- T. Souza
- E. Demidova
- T. Risse
- H. Holzmann
- G. Gossen
- J. Szymański
- Rok 2015
Long-term Web archives comprise Web documents gathered over longer time periods and can easily reach hundreds of terabytes in size. Semantic annotations such as named entities can facilitate intelligent access to the Web archive data. However, the annotation of the entire archive content on this scale is often infeasible. The most efficient way to access the documents within Web archives is provided through their URLs, which are...

Pełny tekst do pobrania w serwisie zewnętrznym
Multi-task Video Enhancement for Dental Interventions
Publikacja
- E. Katsaros
- P. Kopa Ostrowski
- K. P. Włódarczak
- E. Lewandowska
- J. Rumiński
- D. Siupka-Mróz
- Ł. Lassmann
- A. Jezierska
- D. Węsierski
- Rok 2022
A microcamera firmly attached to a dental handpiece allows dentists to continuously monitor the progress of conservative dental procedures. Video enhancement in video-assisted dental interventions alleviates low-light, noise, blur, and camera handshakes that collectively degrade visual comfort. To this end, we introduce a novel deep network for multi-task video enhancement that enables macro-visualization of dental scenes. In particular,...

Pełny tekst do pobrania w serwisie zewnętrznym
EMBOA - affective loop in Socially Assistive Robotics as an intervention tool for children with autism
Kursy Online
- M. Wróbel
- A. Landowska
The aim of the training course "Intensive programmes for higher education learner" within the EMBOA project is to familiarise participants with the use of social robots as an intervention tool for children with autism, emotion recognition and the combination of both methods. Students will be informed about the guidelines and results of the project.
Semantic Integration of Heterogeneous Recognition Systems
Publikacja
- P. Kaczmarek
- P. Raszkowski
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2011
Computer perception of real-life situations is performed using a variety of recognition techniques, including video-based computer vision, biometric systems, RFID devices and others. The proliferation of recognition modules enables development of complex systems by integration of existing components, analogously to the Service Oriented Architecture technology. In the paper, we propose a method that enables integration of information...
The American Sign Language alphabet
Dane Badawcze
open access
- S. Olewniczak
- K. Witczak
- I. Czartowski
- H. Wołek
The American Sign Language dataset contains all static letters of the American alphabet, meaning those that do not require movement to perform (the entire alphabet except for the letters 'J' and 'Z', which are dynamic and require hand movement).
The Innovative Faculty for Innovative Technologies
Publikacja
- Rok 2013
A leaflet describing Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology. Multimedia Systems Department described laboratories and prototypes of: Auditory-visual attention stimulator, Automatic video event detection, Object re-identification application for multi-camera surveillance systems, Object Tracking and Automatic Master-Slave PTZ Camera Positioning System, Passive Acoustic Radar,...

Pełny tekst do pobrania w serwisie zewnętrznym
Improving Traffic Light Recognition Methods using Shifting Time-Windows
Publikacja
- A. Blokus
- H. Krawczyk
- Rok 2018
We propose a novel method of improving algorithms recognizing traffic lights in video sequences. Our focus is on algorithms for applications which notify the driver of a light in sight. Many existing methods process images in the recording separately. Our method bases on the observation that real-life videos depict underlying continuous processes. We named our method FSA (Frame Sequence Analyzed). It is applicable for any underlying...

Pełny tekst do pobrania w serwisie zewnętrznym
WYKORZYSTANIE SIECI NEURONOWYCH DO SYNTEZY MOWY WYRAŻAJĄCEJ EMOCJE
Publikacja
- S. Zaporowski
- B. Kostek
- Rok 2018
W niniejszym artykule przedstawiono analizę rozwiązań do rozpoznawania emocji opartych na mowie i możliwości ich wykorzystania w syntezie mowy z emocjami, wykorzystując do tego celu sieci neuronowe. Przedstawiono aktualne rozwiązania dotyczące rozpoznawania emocji w mowie i metod syntezy mowy za pomocą sieci neuronowych. Obecnie obserwuje się znaczny wzrost zainteresowania i wykorzystania uczenia głębokiego w aplikacjach związanych...
Affective reactions to playing digital games
Publikacja
- A. Landowska
- M. Wróbel
- Rok 2015
The paper presents a study of emotional states during a gameplay. An experiment of two-player Tetris game is reported, followed by the analysis of the results - self-reported emotional states as well as physiological signals measurements interpretation. The study reveals the diversity of emotional reactions and concludes, that a representative player's emotional model is hard to define. Instead, an adaptive approach to emotion...

Pełny tekst do pobrania w serwisie zewnętrznym
Rough Sets Applied to Mood of Music Recognition
Publikacja
- B. Kostek
- M. Piotrowska
- Rok 2016
With the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...
Non-Contact Temperature Measurements Dataset
Publikacja
- A. Mroziński
- Rok 2022
The dataset titled The influence of the distance of the pyrometer from the surface of the radiating object on the accuracy of measurements contains temperature measurements using a selection of four commercially available pyrometers (CHY 314P, TM-F03B, TFA 31.1125 and AB-8855) as a function of the measuring distance. The dataset allows a comparison of the accuracy and measuring precision of the devices, which are very important...

Pełny tekst do pobrania w portalu
IMAGE CORRELATION AS A TOLL FOR TRACKING FACIAL CHANGES CAUSING BY EXTERNAL STIMULI
Publikacja
- K. Bobkowska
- A. Janowski
- M. Przyborski
- Rok 2015
Expressions of the human face bring a lot of information, which are a valuable source in the areas of computer vision, remote sensing and affective computing. For years, by analyzing the movement of the skin and facial muscles scientists are trying to create the perfect tool, based on image analysis, allowing the recognition of emotional states of human beings. To create a reliable algorithm, it is necessary to explore and examine...

Pełny tekst do pobrania w serwisie zewnętrznym
Eye Blink Based Detection of Liveness in Biometric Authentication Systems Using Conditional Random Fields
Publikacja
- M. Szwoch
- P. Pieniążek
- Rok 2012
The goal of this paper was to verify whether the conditional random fields are suitable and enough efficient for eye blink detection in user authentication systems based on face recognition with a standard web camera. To evaluate this approach several experiments were carried on using a specially developed test application and video database.
Ranking Speech Features for Their Usage in Singing Emotion Classification
Publikacja
- S. Zaporowski
- B. Kostek
- Rok 2020
This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Pełny tekst do pobrania w portalu
MACHINE LEARNING APPLICATIONS IN RECOGNIZING HUMAN EMOTIONS BASED ON THE EEG
Publikacja
- A. Kastrau
- M. Koronowski
- M. Liksza
- P. Jasik
- Rok 2021
This study examined the machine learning-based approach allowing the recognition of human emotional states with the use of EEG signals. After a short introduction to the fundamentals of electroencephalography and neural oscillations, the two-dimensional valence-arousal Russell’s model of emotion was described. Next, we present the assumptions of the performed EEG experiment. Detail aspects of the data sanitization including preprocessing,...
Evaluating the Use of Edge Device Towards Fall Detection in Smart City Environment
Publikacja
- T. Ludwisiak
- M. Mazur-Milecka
- T. Kocejko
- J. Rumiński
- J. Kang-Hyun
- Rok 2024
This paper presents the development and preliminary testing of a fall detection algorithm that leverages OpenPose for real-time human pose estimation from video feeds. The system is designed to function optimally within a range of up to 7 meters from ground-level cameras, focusing exclusively on detected human silhouettes to enhance processing efficiency. The performance of the proposed approach was evaluated using accuracy values...

Pełny tekst do pobrania w serwisie zewnętrznym
Video content analysis in the urban area telemonitoring system
Publikacja
- Rok 2010
The task of constant monitoring of video streams from a large number of cameras and reviewing the recordings in order to find a specified event requires a considerable amount of time and effort from the system operators and it is prone to errors. A solution to this problem is an automatic system for constant analysis of camera images being able to raise an alarm if a predefined event is detected. The chapter presents various aspects...

Pełny tekst do pobrania w serwisie zewnętrznym
Potential and Use of the Googlenet Ann for the Purposes of Inland Water Ships Classification
Publikacja
- K. Bobkowska
- I. Bodus-olkowska Izabela
- Polish Maritime Research - Rok 2020
This article presents an analysis of the possibilities of using the pre-degraded GoogLeNet artificial neural network to classify inland vessels. Inland water authorities monitor the intensity of the vessels via CCTV. Such classification seems to be an improvement in their statutory tasks. The automatic classification of the inland vessels from video recording is a one of the main objectives of the Automatic Ship Recognition and...

Pełny tekst do pobrania w portalu
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
Publikacja
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2016
W referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...
On Facial Expressions and Emotions RGB-D Database
Publikacja
- M. Szwoch
- Rok 2014
The goal of this paper is to present the idea of creating reference database of RGB-D video recordings for recognition of facial expressions and emotions. Two different formats of the recordings used for creation of two versions of the database are described and compared using different criteria. Examples of first applications using databases are also presented to evaluate their usefulness.

Pełny tekst do pobrania w serwisie zewnętrznym
Interactions with recognized patients using smart glasses
Publikacja
- J. Rumiński
- M. Smiatacz
- A. Bujnowski
- A. Andrushevich
- M. Biallas
- R. Kistler
- Rok 2015
Recently, different smart glasses solutions have been proposed on the market. The rapid development of this wearable technology has led to several research projects related to applications of smart glasses in healthcare. In this paper we propose a general architecture of the system enabling data integration for the recognized person. In the proposed system smart glasses integrates data obtained for the recognized patient from health...

Pełny tekst do pobrania w serwisie zewnętrznym
Multimodal English corpus for automatic speech recognition
Publikacja
- Rok 2013
A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
Recognizing emotions on the basis of keystroke dynamics
Publikacja
- A. Kołakowska
- Rok 2015
The article describes a research on recognizing emotional states on the basis of keystroke dynamics. An overview of various studies and applications of emotion recognition based on data coming from keyboard is presented. Then, the idea of an experiment is presented, i.e. the way of collecting and labeling training data, extracting features and finally training classifiers. Different classification approaches are proposed to be...

Pełny tekst do pobrania w serwisie zewnętrznym
Focus on Misinformation: Improving Medical Experts’ Efficiency of Misinformation Detection
Publikacja
- A. Nabożny
- B. Balcerzak
- M. Morzy
- A. Wierzbicki
- Rok 2021
Fighting medical disinformation in the era of the global pandemic is an increasingly important problem. As of today, automatic systems for assessing the credibility of medical information do not offer sufficient precision to be used without human supervision, and the involvement of medical expert annotators is required. Thus, our work aims to optimize the utilization of medical experts’ time. We use the dataset of sentences taken...

Pełny tekst do pobrania w serwisie zewnętrznym
Analysis of human behavioral patterns
Publikacja
- A. Kołakowska
- Rok 2022
Widespread usage of Internet and mobile devices entailed growing requirements concerning security which in turn brought about development of biometric methods. However, a specially designed biometric system may infer more about users than just verifying their identity. Proper analysis of users’ characteristics may also tell much about their skills, preferences, feelings. This chapter presents biometric methods applied in several...

Pełny tekst do pobrania w serwisie zewnętrznym
Agnieszka Landowska dr hab. inż.

Osoby

Katedra Inżynierii Oprogramowania

Ukończyła studia na dwóch kierunkach: Finanse i bankowość na Uniwersytecie Gdańskim oraz Informatyka na WETI Politechniki Gdańskiej. Od 2000 roku jest związana z Politechniką Gdańską. W 2006 roku uzyskała stopień doktora w dziedzinie nauk technicznych, a w roku 2019 stopień doktora habilitowanego. Aktualnie jej praca naukowa dotyczy zagadnień interakcji człowiek-komputer oraz informatyki afektywnej (ang. affective computing), która...
Visual Lip Contour Detection for the Purpose of Speech Recognition
Publikacja
- Rok 2014
A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: EMOTION RECOGNITION, DATASET, VIDEO ANNOTATION

Agnieszka Landowska dr hab. inż.