Wyniki wyszukiwania dla: emotion recognition, dataset, video annotation

Investigating Feature Spaces for Isolated Word Recognition

Publikacja

G. Korvel
G. Tamulevicus
P. Treigys
J. Bernataviciene
B. Kostek

- Rok 2018

Much attention is given by researchers to the speech processing task in automatic speech recognition (ASR) over the past decades. The study addresses the issue related to the investigation of the appropriateness of a two-dimensional representation of speech feature spaces for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and timefrequency signal representation...

Detection of Face Position and Orientation Using Depth Data

Publikacja

M. Szwoch
P. Pieniążek

- Advances in Intelligent Systems and Computing - Rok 2015

In this paper an original approach is presented for real-time detection of user's face position and orientation based only on depth channel from a Microsoft Kinect sensor which can be used in facial analysis on scenes with poor lighting conditions where traditional algorithms based on optical channel may have failed. Thus the proposed approach can support, or even replace, algorithms based on optical channel or based on skeleton...

Pełny tekst do pobrania w serwisie zewnętrznym

Multi-Stage Video Analysis Framework

Publikacja

- Rok 2011

The chapter is organized as follows. Section 2 presents the general structure of the proposed framework and a method of data exchange between system elements. Section 3 is describing the low-level analysis modules for detection and tracking of moving objects. In Section 4 we present the object classification module. Sections 5 and 6 describe specialized modules for detection and recognition of faces and license plates, respectively....

Pełny tekst do pobrania w serwisie zewnętrznym

Parallelization of video stream algorithms in kaskada platform

Publikacja

A. Brzeski

- Rok 2011

The purpose of this work is to present different techniques of video stream algorithms parallelization provided by the Kaskada platform - a novel system working in a supercomputer environment designated for multimedia streams processing. Considered parallelization methods include frame-level concurrency, multithreading and pipeline processing. Execution performance was measured on four time-consuming image recognition algorithms,...

An audio-visual corpus for multimodal automatic speech recognition

Publikacja

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2017

review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Pełny tekst do pobrania w portalu

Neural Network Subgraphs Correlation with Trained Model Accuracy

Publikacja

I. Wrosz

- Rok 2020

Neural Architecture Search (NAS) is a computationally demanding process of finding optimal neural network architecture for a given task. Conceptually, NAS comprises applying a search strategy on a predefined search space accompanied by a performance evaluation method. The design of search space alone is expected to substantially impact NAS efficiency. We consider neural networks as graphs and find a correlation between the presence...

Pełny tekst do pobrania w serwisie zewnętrznym

Automatic recognition of males and females among web browser users based on behavioural patterns of peripherals usage

Publikacja

A. Kołakowska
A. Landowska
P. Jarmolkowicz
M. Jarmolkowicz
K. Sobota

- Internet Research - Rok 2016

Purpose The purpose of this paper is to answer the question whether it is possible to recognise the gender of a web browser user on the basis of keystroke dynamics and mouse movements. Design/methodology/approach An experiment was organised in order to track mouse and keyboard usage using a special web browser plug-in. After collecting the data, a number of parameters describing the users’ keystrokes, mouse movements and clicks...

Pełny tekst do pobrania w serwisie zewnętrznym

Gesture Recognition With the Linear Optical Sensor and Recurrent Neural Networks

Publikacja

- IEEE SENSORS JOURNAL - Rok 2018

In this paper, the optical linear sensor, a representative of low-resolution sensors, was investigated in the multiclass recognition of near-field hand gestures. The recurrent neural network (RNN) with a gated recurrent unit (GRU) memory cell was utilized as a gestures classifier. A set of 27 gestures was collected from a group of volunteers. The 27 000 sequences obtained were divided into training, validation, and test subsets....

Pełny tekst do pobrania w portalu

Application of autoencoder to traffic noise analysis

Publikacja

- Journal of the Acoustical Society of America - Rok 2019

The aim of an autoencoder neural network is to transform the input data into a lower-dimensional code and then to reconstruct the output from this code representation. Applications of autoencoders to classifying sound events in the road traffic have not been found in the literature. The presented research aims to determine whether such an unsupervised learning method may be used for deploying classification algorithms applied to...

Pełny tekst do pobrania w portalu

Noise profiling for speech enhancement employing machine learning models

Publikacja

K. Kąkol
G. Korvel
B. Kostek

- Journal of the Acoustical Society of America - Rok 2022

This paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...

Pełny tekst do pobrania w portalu

Driver’s Condition Detection System Using Multimodal Imaging and Machine Learning Algorithms

Publikacja

- Rok 2023

To this day, driver fatigue remains one of the most significant causes of road accidents. In this paper, a novel way of detecting and monitoring a driver’s physical state has been proposed. The goal of the system was to make use of multimodal imaging from RGB and thermal cameras working simultaneously to monitor the driver’s current condition. A custom dataset was created consisting of thermal and RGB video samples. Acquired data...

Pełny tekst do pobrania w serwisie zewnętrznym

Improving methods for detecting people in video recordings using shifting time-windows

Publikacja

A. Blokus
H. Krawczyk

- Rok 2018

We propose a novel method for improving algorithms which detect the presence of people in video sequences. Our focus is on algorithms for applications which require reporting and analyzing all scenes with detected people in long recordings. Therefore one of the target qualities of the classification result is its stability, understood as a low number of invalid scene boundaries. Many existing methods process images in the recording...

Pełny tekst do pobrania w serwisie zewnętrznym

Thermal Image Processing for Respiratory Estimation from Cubical Data with Expandable Depth

Publikacja

M. Szankin
A. Kwaśniewska
J. Rumiński

- Journal of Imaging - Rok 2023

As healthcare costs continue to rise, finding affordable and non-invasive ways to monitor vital signs is increasingly important. One of the key metrics for assessing overall health and identifying potential issues early on is respiratory rate (RR). Most of the existing methods require multiple steps that consist of image and signal processing. This might be difficult to deploy on edge devices that often do not have specialized...

Pełny tekst do pobrania w portalu

A comparative study of English viseme recognition methods and algorithms

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

Pełny tekst do pobrania w portalu

A comparative study of English viseme recognition methods and algorithm

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...

Pełny tekst do pobrania w portalu

Controlling computer by lip gestures employing neural network

Publikacja

- Rok 2010

Results of experiments regarding lip gesture recognition with an artificial neural network are discussed. The neural network module forms the core element of a multimodal human-computer interface called LipMouse. This solution allows a user to work on a computer using lip movements and gestures. A user face is detected in a video stream from a standard web camera using a cascade of boosted classifiers working with Haar-like features....

Pełny tekst do pobrania w serwisie zewnętrznym

Deep CNN based decision support system for detection and assessing the stage of diabetic retinopathy

Publikacja

A. Kwasigroch
B. Jarzembinski
M. Grochowski

- Rok 2018

The diabetic retinopathy is a disease caused by long-standing diabetes. Lack of effective treatment can lead to vision impairment and even irreversible blindness. The disease can be diagnosed by examining digital color fundus photographs of retina. In this paper we propose deep learning approach to automated diabetic retinopathy screening. Deep convolutional neural networks (CNN) - the most popular kind of deep learning algorithms...

Pełny tekst do pobrania w serwisie zewnętrznym

Convolutional Neural Networks for C. Elegans Muscle Age Classification Using Only Self-Learned Features

Publikacja

- Journal of Telecommunications and Information Technology - Rok 2022

Nematodes Caenorhabditis elegans (C. elegans) have been used as model organisms in a wide variety of biological studies, especially those intended to obtain a better understanding of aging and age-associated diseases. This paper focuses on automating the analysis of C. elegans imagery to classify the muscle age of nematodes based on the known and well established IICBU dataset. Unlike many modern classification methods, the proposed...

Pełny tekst do pobrania w portalu

The Hough transform in the classification process of inland ships

Publikacja

K. Bobkowska
N. Wawrzyniak

- Zeszyty Naukowe Akademii Morskiej w Szczecinie - Rok 2019

This article presents an analysis of the possibilities of using image processing methods for feature extraction that allows kNN classification based on a ship’s image delivered from an on-water video surveillance system. The subject of the analysis is the Hough transform which enables the detection of straight lines in an image. The recognized straight lines and the information about them serve as features in the classification...

Pełny tekst do pobrania w portalu

Real-Time Gastrointestinal Tract Video Analysis on a Cluster Supercomputer

Publikacja

- Rok 2012

The article presents a novel approach to medical video data analysis and recognition. Emphasis has been put on adapting existing algorithms detecting le- sions and bleedings for real time usage in a medical doctor's office during an en- doscopic examination. A system for diagnosis recommendation and disease detec- tion has been designed taking into account the limited mobility of the endoscope and the doctor's requirements. The...

Real-Time Bleeding Detection in Gastrointestinal Tract Endoscopic Examinations Video

Publikacja

- International Journal of Distributed and Parallel Systems - Rok 2013

The article presents a novel approach to medical video data analysis and recognition of bleedings. Emphasis has been put on adapting pre-existing algorithms dedicated to the detection of bleedings for real-time usage in a medical doctor’s office during an endoscopic examination. A real-time system for analyzing endoscopic videos has been designed according to the most significant requirements of medical doctors. The main goal of...

Pełny tekst do pobrania w portalu

Concurrent Video Denoising and Deblurring for Dynamic Scenes

Publikacja

- IEEE Access - Rok 2021

Dynamic scene video deblurring is a challenging task due to the spatially variant blur inflicted by independently moving objects and camera shakes. Recent deep learning works bypass the ill-posedness of explicitly deriving the blur kernel by learning pixel-to-pixel mappings, which is commonly enhanced by larger region awareness. This is a difficult yet simplified scenario because noise is neglected when it is omnipresent in a wide...

Pełny tekst do pobrania w portalu

Toward Robust Pedestrian Detection With Data Augmentation

Publikacja

- IEEE Access - Rok 2020

In this article, the problem of creating a safe pedestrian detection model that can operate in the real world is tackled. While recent advances have led to significantly improved detection accuracy on various benchmarks, existing deep learning models are vulnerable to invisible to the human eye changes in the input image which raises concerns about its safety. A popular and simple technique for improving robustness is using data...

Pełny tekst do pobrania w portalu

Identification of Emotional States Using Phantom Miro M310 Camera

Publikacja

M. Przyborski

- Internal Security - Rok 2013

The purpose of this paper is to present the possibilities associated with the use of remote sensing methods in identifying human emotional states, and to present the results of the research conducted by the authors in this field. The studies presented involved the use of advanced image analysis to identify areas on the human face that change their activity along with emotional expression. Most of the research carried out in laboratories...

Audio content analysis in the urban area telemonitoring system

Publikacja

- Rok 2010

Artykuł przedstawia możliwości rozwinięcie monitoringu miejskiego o automatyczną analizę dźwięku. Przedstawiono metody parametryzacji dźwięku, które możliwe są do zastosowania w takim systemie oraz omówiono aspekty techniczne implementacji. W kolejnej części przedstawiono system decyzyjny oparty na drzewach zastosowany w systemie. System ten rozpoznaje dźwięki niebezpieczne (strzał, rozbita szyba, krzyk) wśród dźwięków zarejestrowanych...

Pełny tekst do pobrania w serwisie zewnętrznym

Speech Analytics Based on Machine Learning

Publikacja

- Rok 2019

In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Pełny tekst do pobrania w serwisie zewnętrznym

Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience

Publikacja

- IEEE Access - Rok 2019

Significant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...

Pełny tekst do pobrania w portalu

Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging

Publikacja

- Rok 2017

In the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modiﬁcation of the training program which minimizes the...

Pełny tekst do pobrania w serwisie zewnętrznym

Selected Technical Issues of Deep Neural Networks for Image Classification Purposes

Publikacja

- Bulletin of the Polish Academy of Sciences-Technical Sciences - Rok 2019

In recent years, deep learning and especially Deep Neural Networks (DNN) have obtained amazing performance on a variety of problems, in particular in classification or pattern recognition. Among many kinds of DNNs, the Convolutional Neural Networks (CNN) are most commonly used. However, due to their complexity, there are many problems related but not limited to optimizing network parameters, avoiding overfitting and ensuring good...

Pełny tekst do pobrania w portalu

CNN-CLFFA: Support Mobile Edge Computing in Transportation Cyber Physical System

Publikacja

A. Bhansali
R. Kumar Patra
P. Bidare Divakarachari
P. Falkowski-Gilski
G. Shivakanth
S. N. Patil

- IEEE Access - Rok 2024

In the present scenario, the transportation Cyber Physical System (CPS) improves the reliability and efficiency of the transportation systems by enhancing the interactions between the physical and cyber systems. With the provision of better storage ability and enhanced computing, cloud computing extends transportation CPS in Mobile Edge Computing (MEC). By inspecting the existing literatures, the cloud computing cannot fulfill...

Pełny tekst do pobrania w portalu

Genetic programming extension to APF-based monocular human body pose estimation

Publikacja

P. Szczuko

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2012

New method of the human body pose estimation based on a single camera 2D observation is presented, aimed at smart surveillance related video analysis and action recognition. It employs 3D model of the human body, and genetic algorithm combined with annealed particle filter for searching the global optimum of model state, best matching the object's 2D observation. Additionally, new motion cost metric is employed, considering current...

Pełny tekst do pobrania w portalu

How do responsible universities perceive their social engagement? In search of signs of Creating Shared Value by the University

Publikacja

E. Karwowska

- Journal of Modern Science - Rok 2023

Objectives: University social responsibility still lacks legitimisation and is perceived as a burden that hinders academics from doing research and teaching. Creating Shared Value by the University may serve as a tool to motivate universities to engage in initiatives for society, as this is beneficial for both parties. Yet, some researchers perceive the creation of economic value as inappropriate for academia. Thus, it was interesting...

Pełny tekst do pobrania w portalu

Detecting Apples in the Wild: Potential for Harvest Quantity Estimation

Publikacja

A. Janowski
R. Kaźmierczak
C. Kowalczyk
J. Szulwic

- Sustainability - Rok 2021

Knowing the exact number of fruits and trees helps farmers to make better decisions in their orchard production management. The current practice of crop estimation practice often involves manual counting of fruits (before harvesting), which is an extremely time-consuming and costly process. Additionally, this is not practicable for large orchards. Thanks to the changes that have taken place in recent years in the field of image...

Pełny tekst do pobrania w portalu

Filtry

Katalog

Kategoria

Rok

Opcje

Investigating Feature Spaces for Isolated Word Recognition

Detection of Face Position and Orientation Using Depth Data

Multi-Stage Video Analysis Framework

Parallelization of video stream algorithms in kaskada platform

An audio-visual corpus for multimodal automatic speech recognition

Neural Network Subgraphs Correlation with Trained Model Accuracy

Automatic recognition of males and females among web browser users based on behavioural patterns of peripherals usage

Gesture Recognition With the Linear Optical Sensor and Recurrent Neural Networks

Application of autoencoder to traffic noise analysis

Noise profiling for speech enhancement employing machine learning models

Driver’s Condition Detection System Using Multimodal Imaging and Machine Learning Algorithms

Improving methods for detecting people in video recordings using shifting time-windows

Thermal Image Processing for Respiratory Estimation from Cubical Data with Expandable Depth

A comparative study of English viseme recognition methods and algorithms

A comparative study of English viseme recognition methods and algorithm

Controlling computer by lip gestures employing neural network

Deep CNN based decision support system for detection and assessing the stage of diabetic retinopathy

Convolutional Neural Networks for C. Elegans Muscle Age Classification Using Only Self-Learned Features

The Hough transform in the classification process of inland ships

Real-Time Gastrointestinal Tract Video Analysis on a Cluster Supercomputer

Real-Time Bleeding Detection in Gastrointestinal Tract Endoscopic Examinations Video

Concurrent Video Denoising and Deblurring for Dynamic Scenes

Toward Robust Pedestrian Detection With Data Augmentation

Identification of Emotional States Using Phantom Miro M310 Camera

Audio content analysis in the urban area telemonitoring system

Speech Analytics Based on Machine Learning

Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience

Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging

Selected Technical Issues of Deep Neural Networks for Image Classification Purposes

CNN-CLFFA: Support Mobile Edge Computing in Transportation Cyber Physical System

Genetic programming extension to APF-based monocular human body pose estimation

How do responsible universities perceive their social engagement? In search of signs of Creating Shared Value by the University

Detecting Apples in the Wild: Potential for Harvest Quantity Estimation

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje

Wyniki wyszukiwania dla: emotion recognition, dataset, video annotation