Search results for: VIDEO EVENT RECOGNITION

Testbed analysis of video and VoIP transsmission performance in IEEE 802.11 b/g/n networks

Publication

- TELECOMMUNICATION SYSTEMS - Year 2011

The aim of the work is to analyze capabilities and limitations of different implementations of IEEE 802.11 technologies (IEEE 802.11 b/g/n), utilized for both video streaming and VoIP calls directed to mobile devices. Our preliminary research showed that results obtained with currently popular simulation tools can be drastically different than these possible in real-world environment, so, in order to correctly evaluate performance...

Full text available to download

Systematic Literature Review for Emotion Recognition from EEG Signals

Publication

- Year 2022

Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Full text available to download

Systematic Literature Review for Emotion Recognition from EEG Signals

Publication

- Year 2022

Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Full text to download in external service

Automatic recognition of therapy progress among children with autism

Publication

A. Kołakowska
A. Landowska
A. Anzulewicz
K. Sobota

- Scientific Reports - Year 2017

The article presents a research study on recognizing therapy progress among children with autism spectrum disorder. The progress is recognized on the basis of behavioural data gathered via five specially designed tablet games. Over 180 distinct parameters are calculated on the basis of raw data delivered via the game flow and tablet sensors - i.e. touch screen, accelerometer and gyroscope. The results obtained confirm the possibility...

Full text available to download

Local Texture Pattern Selection for Efficient Face Recognition and Tracking

Publication

- Advances in Intelligent Systems and Computing - Year 2015

This paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...

Full text to download in external service

Feasibility Study for Food Intake Tasks Recognition Based on Smart Glasses

Publication

M. Biallas
A. Andrushevich
R. Kistler
A. Klapproth
K. Czuszyński
A. Bujnowski

- Journal of Medical Imaging and Health Informatics - Year 2015

In this exploratory study 13 adult test subjects have performed different food intake tasks while wearing a three axis accelerometer mounted at a temple of glasses. Two different algorithms for task recognition have been applied and compared. The retrospective data processing leads to better task recognition results when the frequency range of 50 Hz to 100 Hz is analysed within accelerometer signal recordings. A straightforward...

Full text to download in external service

User experience evaluation study on the quality of 1K, 2K, and 4K H.265/HEVC video content

Publication

P. Falkowski-Gilski
T. Uhl
C. Hoppe

- Zeszyty Naukowe Akademii Morskiej w Szczecinie - Year 2024

Nowadays, most content creators focus on distributing rich media at the highest possible resolution. Currently, the majority of sold consoles, media players, computer hardware, as well as displays and TVs are advertised as 4K-compatible. The same trend is observed in the case of popular online streaming services and terrestrial TV broadcasts. Generally speaking, it is assumed that higher bitrates provide higher subjective judgements....

Full text available to download

Fuzzy rule-based dynamic gesture recognition employing camera & multimedia projector

Publication

- Year 2010

In the paper the system based on camera and multimedia projector enabling a user to control computer applications by dynamic hand gestures is presented. The main objective is to present the gesture recognition methodology which bases on representing hand movement trajectory by motion vectors analyzed using fuzzy rule-based inference. The approach was engineered in the system developed with J2SE and C++ / OpenCV technology. OpenCV...

Full text to download in external service

RECSYS CHALLENGE 2015: a BUY EVENT PREDICTION IN THE E-COMMERCE DOMAIN

Publication

A. Karpus

- Year 2016

In this paper we present our approach to RecSys Challenge 2015. Given a set of e-commerce events, the task is to predict whether a user will buy something in the current session and, if yes, which of the item will be bought. We show that the data preparation and enrichment are very important in finding the solution for the challenge and that simple ideas and intuitions could lead to satisfactory results. We also show that simple...

A survey of automatic speech recognition deep models performance for Polish medical terms

Publication

- Year 2023

Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

Full text to download in external service

Multimodal human-computer interfaces based on advanced video and audio analysis

Publication

- Year 2013

Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...

Full text to download in external service

Hybrid of Neural Networks and Hidden Markov Models as a modern approach to speech recognition systems

Publication

- Pomiary Automatyka Robotyka - Year 2013

The aim of this paper is to present a hybrid algorithm that combines the advantages ofartificial neural networks and hidden Markov models in speech recognition for control purpos-es. The scope of the paper includes review of currently used solutions, description and analysis of implementation of selected artificial neural network (NN) structures and hidden Markov mod-els (HMM). The main part of the paper consists of a description...

Full text available to download

Multi-Stage Video Analysis Framework

Publication

A. Czyewski
G. Szwoch
P. Dalka
P. Szczuko
A. Ciarkowski
D. Ellwart
T. Merta
K. opatka
u. Kulasek
J. Wolski

- Year 2011

Full text to download in external service

Towards Cognitive and Perceptive Video Systems

Publication

T. Akgun
C. Attwood
A. Cavallaro
C. Fabre
F. Poiesi
P. Szczuko

- Year 2014

In this chapter we cover research and development issues related to smart cameras. We discuss challenges, new technologies and algorithms, applications and the evaluation of today’s technologies. We will cover problems related to software, hardware, communication, embedded and distributed systems, multi-modal sensors, privacy and security. We also discuss future trends and market expectations from the customer’s point of view.

Full text to download in external service

Improving methods for detecting people in video recordings using shifting time-windows

Publication

A. Blokus
H. Krawczyk

- Year 2018

We propose a novel method for improving algorithms which detect the presence of people in video sequences. Our focus is on algorithms for applications which require reporting and analyzing all scenes with detected people in long recordings. Therefore one of the target qualities of the classification result is its stability, understood as a low number of invalid scene boundaries. Many existing methods process images in the recording...

Full text to download in external service

Automated Classifier Development Process for Recognizing Book Pages from Video Frames

Publication

- Communications in Computer and Information Science - Year 2020

One of the latest developments made by publishing companies is introducing mixed and augmented reality to their printed media (e.g. to produce augmented books). An important computer vision problem that they are facing is classification of book pages from video frames. The problem is non-trivial, especially considering that typical training data is limited to only one digital original per book page, while the trained classifier...

Full text to download in external service

Theory of recognition in a historical perspective. Axel Honneth's Anerkennung: Eine europäische Ideengeschichte

Publication

A. Karalus

- Archiwum Historii Filozofii i Myśli Społecznej - Year 2019

The article discusses Honneth excursion into the realm of the history of ideas. This time Honneth decides to laser it on the notion of "recognition" in three different cultural areas and three different traditions: French, English, and German. The article discusses Honneth's persepctive and attempts at finding the common thread that would link three aforementioned traditions.

Full text available to download

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Publication

G. Tamulevicius
G. Korvel
A. B. Yayak
P. Treigys
J. Bernataviciene
B. Kostek

- Electronics - Year 2020

In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Full text available to download

A review of emotion recognition methods based on keystroke dynamics and mouse movements

Publication

A. Kołakowska

- Year 2013

The paper describes the approach based on using standard input devices, such as keyboard and mouse, as sources of data for the recognition of users’ emotional states. A number of systems applying this idea have been presented focusing on three categories of research problems, i.e. collecting and labeling training data, extracting features and training classifiers of emotions. Moreover the advantages and examples of combining standard...

Full text to download in external service

Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions

Publication

M. Wang
T. Sirlapu
A. Kwaśniewska
M. Szankin
M. Bartscherer
R. Nicolas

- Year 2018

With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

Full text to download in external service

Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System

Publication

- Advances in Intelligent Systems and Computing - Year 2013

The main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...

Full text to download in external service

Spatial Calibration of a Dual PTZ-Fixed Camera System for Tracking Moving Objects in Video

Publication

- JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY - Year 2013

A dual camera setup is proposed, consisting of a fixed (stationary) camera and a pan-tilt-zoom (PTZ) camera, employed in an automatic video surveillance system. The PTZ camera is zoomed in on a selected point in the fixed camera view and it may automatically track a moving object. For this purpose, two camera spatial calibration procedures are proposed. The PTZ camera is calibrated in relation to the fixed camera image, using interpolated...

Full text to download in external service

Predicting emotion from color present in images and video excerpts by machine learning

Publication

- IEEE Access - Year 2023

This work aims at predicting emotion based on the colors present in images and video excerpts using a machine-learning approach. The purpose of this paper is threefold: (a) to develop a machine-learning algorithm that classifies emotions based on the color present in an image, (b) to select the best-performing algorithm from the first phase and apply it to film excerpt emotion analysis based on colors, (c) to design an online survey...

Full text available to download

Applicability of Emotion Recognition and Induction Methods to Study the Behavior of Programmers

Publication

M. Wróbel

- Applied Sciences-Basel - Year 2018

Recent studies in the field of software engineering have shown that positive emotions can increase and negative emotions decrease the productivity of programmers. In the field of affective computing, many methods and tools to recognize the emotions of computer users were proposed. However, it has not been verified yet which of them can be used to monitor the emotional states of software developers. The paper describes a study carried...

Full text available to download

Parallel implementation of background subtraction algorithms for real-time video processing on a supercomputer platform

Publication

- Journal of Real-Time Image Processing - Year 2016

Results of evaluation of the background subtraction algorithms implemented on a supercomputer platform in a parallel manner are presented in the paper. The aim of the work is to chose an algorithm, a number of threads and a task scheduling method, that together provide satisfactory accuracy and efficiency of a real-time processing of high resolution camera images, maintaining the cost of resources usage at a reasonable level. Two...

Full text available to download

Transport of dangerous goods by rail, and threats to the subsoil of the railway surface in the event of a disaster

Publication

- AIP Conference Proceedings - Year 2023

In Poland, in 2020, the mass of dangerous goods (loads) transported by rail was 26 151.06 thousand tone. This translated into the performance of 8 899 691.89 thousand tone - km of transport performance. In 2020, these figures accounted for 11.72% of the weight of goods transported by rail. The situation is similar in other countries around the world. With such a large volume of transport of dangerous...

Full text available to download

A general approach to study molecular fragmentation and energy redistribution after an ionizing event

Publication

E. Erdmann
N. Aguirre
S. Indrajith
J. Chiarinelli
A. Domaracka
P. Rousseau
B. A. Huber
P. Bolognesi
R. Richter
L. Avaldi... and 3 others

- PHYSICAL CHEMISTRY CHEMICAL PHYSICS - Year 2021

We propose to combine quantum chemical calculations, statistical mechanical methods, and photoionization and particle collision experiments to unravel the redistribution of internal energy of the furan cation and its dissociation pathways. This approach successfully reproduces the relative intensity of the different fragments as a function of the internal energy of the system in photoelectron–photoion coincidence experiments and...

Full text available to download

Recognition of environmentally important ions

Publication

N. Łukasik
E. Wagner-Wysiecka
V. Hubscher-Bruder
M. Bocheńska
S. Michel

- Logistyka - Year 2013

..

Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition

Publication

- Year 2016

The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...

Video Cloud Services for Hospitals: Designing an End-to-End Cloud Service Platform for Medical Video Storage and Secure Access

Publication

P. Pawałowski
C. Mazurek
M. Leszczuk
J. Moureaux
A. Chaabouni

- JMIR Biomedical Engineering - Year 2020

Full text to download in external service

Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition

Publication

- Year 2019

Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

Full text available to download

From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition

Publication

P. Rościszewski

- Computer Science - Year 2017

Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

Full text available to download

Deep Learning: A Case Study for Image Recognition Using Transfer Learning

Publication

S. Erpolat Tasabat
O. Aydin

- Year 2021

Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Full text to download in external service

Automatic singing quality recognition employing artificial neural networks

Publication

P. Żwan

- Archives of Acoustics - Year 2008

Celem artykułu jest udowodnienie możliwości automatycznej oceny jakości technicznej głosów śpiewaczych. Pokrótce zaprezentowano w nim stworzoną bazę danych głosów śpiewaczych oraz zaimplementowane parametry. Przy pomocy sztucznych sieci neuronowych zaprojektowano system decyzyjny, który oceniono w pięciostopniowej skali jakość techniczną głosu. Przy pomocy metod statystycznych udowodniono, że wyniki generowane przez ten system...

Full text available to download

Application of crowdfunding to video game projects financing

Publication

K. Szopik-Depczyńska
A. Kędzierska-Szczepaniak
K. Szczepaniak

- Procedia Computer Science - Year 2020

Full text to download in external service

Quaternion Encryption Method for Image and Video Transmission

Publication

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2013

Quaternions are hyper-complex numbers of rank 4. They are often applied to mechanics in 3D space and are considered to be one of the best ways of representing rotations. In this paper a quaternion encryption method, based on algorithm by Nagase et al. (2004) has been proposed. According to a computer-based simulation the results of the performed research yield a high level of security, which is additionally strengthened by the...

Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets

Publication

- Electronics - Year 2022

Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

Full text available to download

New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception

Publication

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2013

The influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...

Full text available to download

Real-time working gas recognition system based on the array of semiconductor gas sensors and portable computer Raspberry PI

Publication

- Year 2013

The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and low-cost alternative for other devices, like gas analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...

A Novel IoT-Perceptive Human Activity Recognition (HAR) Approach Using Multi-Head Convolutional Attention

Publication

H. Zhang
Z. Xiao
J. Wang
F. Li
E. Szczerbicki

- IEEE Internet of Things Journal - Year 2019

Together with fast advancement of the Internet of Things (IoT), smart healthcare applications and systems are equipped with increasingly more wearable sensors and mobile devices. These sensors are used not only to collect data, but also, and more importantly, to assist in daily activity tracking and analyzing of their users. Various human activity recognition (HAR) approaches are used to enhance such tracking. Most of the existing...

Full text available to download

Unraveling the Interplay between DNA and Proteins: A Computational Exploration of Sequence and Structure-Specific Recognition Mechanisms

Publication

K. A. Hossain

- Year 2023

My PhD dissertation focused on DNA-protein interactions and the recognition of specific DNA sequences and structures. I discovered that acidic amino acid residues (Asp/Glu) play a crucial role by exhibiting a preference for cytosine. Their contribution to binding affinity depends on nearby cytosines, balancing electrostatic repulsion with specific interactions. Acidic residues act as negative selectors, discouraging non-cytosine...

Full text available to download

Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning

Publication

A. Czyżewski

- Journal of the Acoustical Society of America - Year 2023

Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Full text available to download

Hazards of a flooding event in the city of Gdansk and possible forms of preventing the phenomenon – case study

Publication

- Urban Water Journal - Year 2024

The main objective is to examine the urban flood hazard in the city of Gdansk and to determine the possibilities of preventing this phenomenon. Hydrological and hydraulic modeling was used for the case study analysis of urban flood in Strzyża basin, applying the HEC-HMS and HEC-RAS systems. The result of modeling with the assumption of torrential rainfall with a duration of t = 1 h (from 35 to 58 mm) is the probability of pluvial...

Full text to download in external service

Instrument detection and pose estimation with rigid part mixtures model in video-assisted surgeries

Publication

- MEDICAL IMAGE ANALYSIS - Year 2018

Localizing instrument parts in video-assisted surgeries is an attractive and open computer vision problem. A working algorithm would immediately find applications in computer-aided interventions in the operating theater. Knowing the location of tool parts could help virtually augment visual faculty of surgeons, assess skills of novice surgeons, and increase autonomy of surgical robots. A surgical tool varies in appearance due to...

Full text to download in external service

Video Classification Technology in a Knowledge-Vision-Integration Platform for Personal Protective Equipment Detection: An Evaluation

Publication

C. De
C. Sanin
E. Szczerbicki

- Year 2018

This work is part of an effort for the development of a Knowledge-Vision Integration Platform for Hazard Control (KVIP-HC) in industrial workplaces, adaptable to a wide range of industrial environments. This paper focuses on hazards resulted from the non-use of personal protective equipment (PPE), and examines a few supervised learning techniques to compose the proposed system for the purpose of recognition of three protective...

Full text to download in external service

Viruses, cancer and non-self recognition

Publication

M. Padariya
U. Kalathiya
S. Mikac
K. Dziubek
M. Tovar
E. Sroka
R. Fahraeus
A. Sznarkowska

- Open Biology - Year 2021

Full text to download in external service

Face Recognition: Shape versus Texture

Publication

M. Smiatacz

- Year 2015

This paper describes experiments related to the application of well-known techniques of the texture feature extraction (Local Binary Patterns and Gabor filtering) to the problem of automatic face verification. Results of the tests show that simple image normalization strategy based on the eye center detection and a regular grid of fiducial points outperforms the more complicated approach, employing active models that are able to...

Full text to download in external service

Balance recognition on the basis of EEG measurement.

Publication

- Annals of Computer Science and Information Systems - Year 2016

Although electroencephalography (EEG) is not typically used for verifying the sense of balance, it can be used for analysing cortical signals responsible for this phenomenon. Simple balance tasks can be proposed as a good indicator of whether the sense of balance is acting more or less actively. This article presents preliminary results for the potential of using EEG to balance sensing....

Full text available to download

Role of cholesterol in substrate recognition by -secretase

Publication

- Scientific Reports - Year 2021

-Secretase is an enzyme known to cleave multiple substrates within their transmembrane domains, with the amyloid precursor protein of Alzheimer’s Disease among the most prominent examples. The activity of -secretase strictly depends on the membrane cholesterol content, yet the mechanistic role of cholesterol in the substrate binding and cleavage remains unclear. In this work, we used all-atom molecular dynamics simulations to examine...

Full text available to download

Automatic system for audio-video material reconstruction and archiving

Publication

- Year 2008

Referat przedstawia propozycję modelu systemu automatycznej archiwizacji i rekonstrukcji nagrań audio-wideo. Założeniem tego rozwiązania jest uczynienie procesu rekonstrukcji nagrań bardziej niezależnym od człowieka. Ma to na celu redukcję kosztów rekonstrukcji przetwarzanych nagrań. Z powodu dużej liczby archiwalnych nagrań audio-wideo istnieje potrzeba stworzenia systemu który umożliwi automatyczną indeksację ich treści. Pomoże...

Search

Filters

Catalog

Category

Year

Options

Search results for: VIDEO EVENT RECOGNITION