Search results for: video data
-
Towards Knowledge Formalization and Sharing in a Cognitive Vision Platform for Hazard Control (CVP-HC)
PublicationHazards are present in all workplaces and can result in serious injuries, short and long-term illnesses, or death. In this context, management of safety is essential to ensure the occupational health of workers. Aiming to assist the safety manage-ment process, especially in industrial environments, a Cognitive Vision Platform for Hazard Control (CVP-HC) has been proposed. The CVP-HC is a scalable yet adaptable system capable of...
-
Influence of Toll Collection Method on Motorways on Traffic Safety and Efficiency
PublicationThe paper presents the results of the effectiveness analysis for various forms of toll collection on motorway gates and their impact on safety and efficiency of traffic. The results from field tests that were performed on the A1 motorway in Rusocin were used for the analysis. The research included the analysis of the video image of driver behaviour on the way to the toll booths and at toll booths. The PTV VISSIM tool was used to...
-
Controlling computer by lip gestures employing neural network
PublicationResults of experiments regarding lip gesture recognition with an artificial neural network are discussed. The neural network module forms the core element of a multimodal human-computer interface called LipMouse. This solution allows a user to work on a computer using lip movements and gestures. A user face is detected in a video stream from a standard web camera using a cascade of boosted classifiers working with Haar-like features....
-
Multimedia interface using head movements tracking
PublicationThe presented solution supports innovative ways of manipulating computer multimedia content, such as: static images, videos and music clips and others that can be browsed subsequently. The system requires a standard web camera that captures images of the user face. The core of the system is formed by a head movement analyzing algorithm that finds a user face and tracks head movements in real time. Head movements are tracked with...
-
Obtaining a Well-Trained Artificial Intelligence Algorithm from Cross-Validation in Endoscopy
PublicationThe article shortly discusses endoscopic video analysis problems and artificial intelligence algorithms supporting it. The most common method of efficiency testing of these algorithms is to perform intensive cross-validation. This allows for accurately evaluate their performance of generalization. One of the main problems of this procedure is that there is no simple and universal way of obtaining a specific instance of a well-trained...
-
Cross-domain applications of multimodal human-computer interfaces
PublicationDeveloped multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named...
-
A comparative study of English viseme recognition methods and algorithms
PublicationAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...
-
A comparative study of English viseme recognition methods and algorithm
PublicationAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...
-
Impact of Shifting Time-Window Post-Processing on the Quality of Face Detection Algorithms
PublicationWe consider binary classification algorithms, which operate on single frames from video sequences. Such a class of algorithms is named OFA (One Frame Analyzed). Two such algorithms for facial detection are compared in terms of their susceptibility to the FSA (Frame Sequence Analysis) method. It introduces a shifting time-window improvement, which includes the temporal context of frames in a post-processing step that improves the...
-
IMAGE CORRELATION AS A TOLL FOR TRACKING FACIAL CHANGES CAUSING BY EXTERNAL STIMULI
PublicationExpressions of the human face bring a lot of information, which are a valuable source in the areas of computer vision, remote sensing and affective computing. For years, by analyzing the movement of the skin and facial muscles scientists are trying to create the perfect tool, based on image analysis, allowing the recognition of emotional states of human beings. To create a reliable algorithm, it is necessary to explore and examine...
-
Virtual Whiteboard: A gesture-controlled pen-free tool emulating school whiteboard
PublicationIn the paper the so-called Virtual Whiteboard is presented which may be an alternative solution for modern electronic whiteboards based on electronic pens and sensors. The presented tool enables the user to write, draw and handle whiteboard contents using his/her hands only. An additional equipment such as infrared diodes, infrared cameras or cyber gloves is not needed. The user's interaction with the Virtual Whiteboard computer...
-
Multimodal Surveillance Based Personal Protection System
PublicationA novel, multimodal approach for automatic detection of abduction of a protected individual, employing dedicated personal protection device and a city monitoring system is proposed and overviewed. The solution is based on combining four modalities (signals coming from: Bluetooth, fixed and PTZ cameras, thermal camera, acoustic sensors). The Bluetooth signal is used continuously to monitor the protected person presence, and in case...
-
Cooperation of mono- and bi-articular muscles: human lower limb
PublicationObjectives: The aim of this study was to create and analyze a Pareto-optimal problem that would describe cooperation between mono- and bi-articulate lower limb muscles in sagittal plane. Methods: Equations describing the problem were derived and analyzed, additional constrains were introduced and experimental verification based on gait video analysis was performed. Results: Uncertainty of Pareto-optimal solution is shown for the...
-
Improving Traffic Light Recognition Methods using Shifting Time-Windows
PublicationWe propose a novel method of improving algorithms recognizing traffic lights in video sequences. Our focus is on algorithms for applications which notify the driver of a light in sight. Many existing methods process images in the recording separately. Our method bases on the observation that real-life videos depict underlying continuous processes. We named our method FSA (Frame Sequence Analyzed). It is applicable for any underlying...
-
Mobile virtual presence in civil engineering education and training
PublicationEducation technologies are on the cusp, emerging technologies connected with each other might reveal new added value in this field. In this paper, the author showed a framework for conducting civil engineering classes with a virtual presence on the construction site. The presented idea might be realized by use of the unnamed aerial vehicle equipped with 360 video camera. On-site activities are transmitted at a distance by GSM and...
-
Deep Features Class Activation Map for Thermal Face Detection and Tracking
PublicationRecently, capabilities of many computer vision tasks have significantly improved due to advances in Convolutional Neural Networks. In our research, we demonstrate that it can be also used for face detection from low resolution thermal images, acquired with a portable camera. The physical size of the camera used in our research allows for embedding it in a wearable device or indoor remote monitoring solution for elderly and disabled...
-
The Hough transform in the classification process of inland ships
PublicationThis article presents an analysis of the possibilities of using image processing methods for feature extraction that allows kNN classification based on a ship’s image delivered from an on-water video surveillance system. The subject of the analysis is the Hough transform which enables the detection of straight lines in an image. The recognized straight lines and the information about them serve as features in the classification...
-
Pomorskie drogi ku Niepodległej
EventsPolitechnika Otwarta zaprasza na premierowy pokaz filmu Jana Butowskiego pt. „Pomorskie drogi ku Niepodległej”. Poznamy sylwetki bohaterów, miejsca i wydarzenia, które odegrały znaczącą rolę na drodze ku niepodległości.
-
Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services
PublicationStreaming multimedia services have become very popular in recent years, due to the development of wireless networks. With the growing number of mobile devices worldwide, service providers offer dedicated applications that allow to deliver on-demand audio and video content anytime and everywhere. The aim of this study was to compare different streaming services and investigate their impact on the CPU and RAM resources, with respect...
-
QR CODE JAKO NARZĘDZIE KOMUNIKACJI Z KLIENTAMI
PublicationKażdą nową technologię użytkownicy muszą zacząć postrzegać jako użyteczną, aby mogła się ona upowszechnić. W ten sposób działania z nią związane stają się codziennością. Jednak oprócz użyteczności dla konsumentów istotna jest też prostota użytkowania technologii. Obie cechy dotyczą technologii mobilnych i związanych z nimi działań określanych mianem marketingu mobilnego . Zalicza się do nich: wysyłanie SMS i MMS, włączanie bluetooth...
-
An new method of audio-visual correlation analysis
PublicationThis paper presents a new methodology of conducting the audio-visual correlation analysis employing the gaze tracking system. Interaction between two perceptual modalities, seeing and hearing, their interaction and mutual reinforcement in a complex relationship was a subject of many research studies. Earlier stage of the carried out experiments at the Multimedia Systems Department (MSD) showed that there exists a relationship between...
-
English Language Learning Employing Developments in Multimedia IS
PublicationIn the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...
-
Bimodal Emotion Recognition Based on Vocal and Facial Features
PublicationEmotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...
-
Modelling and Analysis of the Positioning Accuracy in the Loading Systems of Mobile Cranes
PublicationIn this work, the authors analyse the influence of the order and range of sequential movements of a crane's working members on the accuracy of the final cargo positioning. The analysis was conducted on the basis of a specially developed method in which the authors proposed the introduction of a geometrical indicator of positioning the load in the intermediate positions (after completing each movement sequence) and in the target...
-
Cognition and Decisional Experience to Support Safety Management in Workplaces
PublicationHazards are present in all workplaces and can result in serious injuries, short and long-term illnesses, or death. In this context, management of safety is essential to ensure the occupational health of workers. Aiming to assist the safety management process, especially in industrial environments, a Cognitive Vision Platform for Hazard Control (CVP-HC) is proposed. This platform is a Cyber Physical system, capable of identifying...
-
Long-term comparative evaluation of an acoustic climate in selected schools before and after the acoustic treatment
PublicationThe results of long-term continuous noise measurements in two selected schools are presented in the paper. Noise characteristics were measured continuously there for approximately 16 months. Measurements started eight months prior to the acoustic treatment of the school corridors of both schools. An evaluation of the acoustic climates in both schools, before and after the acoustic treatment, was performed based on comparison of...
-
Evaluating the Use of Edge Device Towards Fall Detection in Smart City Environment
PublicationThis paper presents the development and preliminary testing of a fall detection algorithm that leverages OpenPose for real-time human pose estimation from video feeds. The system is designed to function optimally within a range of up to 7 meters from ground-level cameras, focusing exclusively on detected human silhouettes to enhance processing efficiency. The performance of the proposed approach was evaluated using accuracy values...
-
Low-Power Receivers for Wireless Capacitive Coupling Transmission in 3-D-Integrated Massively Parallel CMOS Imager
PublicationThe paper presents pixel receivers for massively parallel transmission of video signal between capacitive coupled integrated circuits (ICs). The receivers meet the key requirements for massively parallel transmission, namely low-power consumption below a single μW, small area of less than 205 μm2, high sensitivity better than 160 mV, and good immunity to crosstalk. The receivers were implemented and measured in a 3-D IC (two face-to-face...
-
On the Consumption of Multimedia Content Using Mobile Devices: a Year to Year User Case Study
PublicationIn the early days, consumption of multimedia content related with audio signals was only possible in a stationary manner. The music player was located at home, with a necessary physical drive. An alternative way for an individual was to attend a live performance at a concert hall or host a private concert at home. To sum up, audio-visual effects were only reserved for a narrow group of recipients. Today, thanks to portable players,...
-
Broadening the scope of measurement and analysis of vibrations of an organ pipe employing intensity probe, simulations, and highspeed camera
PublicationThis paper shows an integrated approach to measure, analyze, and model phenomena occurring in an organ pipe driven by pressurized air. The aim of this paper is two-fold, i.e., to measure the pressure signal and the intensity field around the mouth by means of an intensity probe and to visualize and observe the motion of the air jet, which represents the excitation mechanism of the system. This is realized through two techniques,...
-
DevEmo—Software Developers’ Facial Expression Dataset
PublicationThe COVID-19 pandemic has increased the relevance of remote activities and digital tools for education, work, and other aspects of daily life. This reality has highlighted the need for emotion recognition technology to better understand the emotions of computer users and provide support in remote environments. Emotion recognition can play a critical role in improving the remote experience and ensuring that individuals are able...
-
Estimation of DC motor parameters using a simple CMOS camera
PublicationDifferent components of control systems for mobile robots are based on dynamic models. In low-cost solutions such a robot is wheeled and equipped with DC motors, which have to be included in the model of the robot. The model is fairly simple but determination of its parameters needs not to be easy. For instance, DC motor parameters are typically identified indirectly using suitable measurements, concerning engine voltage, current,...
-
An integrated e-learning services management system providing HD videoconferencing and CAA services
PublicationIn this paper we present a novel e-learning services management system, designed to provide highly modifiable platform for various e-learning tools, able to fulfill its function in any network connectivity conditions (including no connectivity scenario). The system can scale from very simple setup (adequate for servicing a single exercise) to a large, distributed solution fit to support an enterprise. Strictly modular architecture...
-
Diagnosing wind turbine condition employing a neural network to the analysis of vibroacoustic signals
PublicationIt is important from the economic point of view to detect damage early in the wind turbines before failures occur. For this purpose, a monitoring device was built that analyzes both acoustic signals acquired from the built-in non-contact acoustic intensity probe, as well as from the accelerometers, mounted on the internal devices in the nacelle. The signals collected in this way are used for long-term training of the autoencoder...
-
Novel Vision Monitoring Method Based on Multi Light Points for Space-Time Analysis of Overhead Contact Line Displacements
PublicationThe article presents an innovative vision monitoring method of overhead contact line (OCL) displacement, which utilizes a set of LED light points installed along it. A light point is an, LED fed from a battery. Displacements of the LED points, recorded by a camera, are interpreted as a change of OCL shape in time and space. The vision system comprises a camera, properly situated with respect to the OCL, which is capable of capturing...
-
Long Distance Vital Signs Monitoring with Person Identification for Smart Home Solutions
PublicationAbstract— Imaging photoplethysmography has already been proved to be successful in short distance (below 1m). However, most of the real-life use cases of measuring vital signs require the system to work at longer distances, to be both more reliable and convenient for the user. The possible scenarios that system designers must have in mind include monitoring of the vital signs of residents in nursing homes, disabled people, who...
-
Personal Brand Value and Social Media, the Top Football Players’ Case
PublicationPersonal branding valuation and social media usage are new and empirically unexplored areas of research. The aim of the presented study is to determine how social media performance and sentiment are related to the value of a personal brand. Based on an example of 100 most valuable football players, in reference to transfermarkt.com and the sentione.com (sentiment analysis), the author points out the strongest...
-
Hardware realization of shadow detection algorithm in FPGA
PublicationW referacie opisano problem detekcji cieni w sekwencjach wideo. Na podstawie metod znanych z literatury opracowano algorytm detekcji cieni, działający w czasie rzeczywistym i przeznaczony do realizacji sprzętowej w układzie FPGA. Algorytmy zostały przetestowane i porównane w środowisku MATLAB. Za pomocą języka VHDL zrealizowano system detekcji cieni wykorzystujący opracowany algorytm i zaimplementowano go w układzie Virtex-4. Został...
-
An audio-visual corpus for multimodal automatic speech recognition
Publicationreview of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...
-
Acoustic radar employing particle velocity sensors
PublicationA concept, practical realization and applications of a passive acoustic radar to automatic localization, tracking of sound sources were presented in the paper. The device consist of the new kind of multichannel miniature sound intensity sensors and a group of digital signal processing algorithms. Contrary to active radars, it does not emit the scanning beam but after receiving surroundings sounds it provide information about the...
-
Łączony fingerprinting i kryptograficzne zabezpieczenie danych z wykorzystaniem szyfru Hilla
PublicationW referacie przedstawiono rozwinięcie metody Hillcast znakowania obrazów, która należy do grupy metod wprowadzających fingerprinting przy deszyfracji zabezpieczonego kryptograficznie obrazu (ang. JFD (Joint Fingerprinting and Decryption)), której głównym przeznaczeniem są usługi wideo na żądanie (ang. VoD (Video on Demand)). Metoda zapewnia kryptograficzne bezpieczeństwo oraz znakowanie treści multimedialnych, przy jednoczesnym...
-
Genetic programming extension to APF-based monocular human body pose estimation
PublicationNew method of the human body pose estimation based on a single camera 2D observation is presented, aimed at smart surveillance related video analysis and action recognition. It employs 3D model of the human body, and genetic algorithm combined with annealed particle filter for searching the global optimum of model state, best matching the object's 2D observation. Additionally, new motion cost metric is employed, considering current...
-
Image Representation for Cognitive Systems Using SOEKS and DDNA: A Case Study for PPE Compliance
PublicationCognitive Vision Systems have gained significant interest from academia and industry during the past few decade, and one of the main reasons behind this is the potential of such technologies to revolutionize human life as they intend to work under complex visual scenes, adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior. The combination of these properties aims to mimic the human capabilities...
-
MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES
PublicationAutomatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...
-
Soft Real-Time Communication with WebSocket and WebRTC Protocols Performance Analysis for Web-based Control Loops
PublicationThe web browser has become an access window for content and services. The browser is available on almost any device connected to the network, regardless of its intended use: desktop, mobile device, computing server, e-book reader etc. Browsers are used by people to read news, contact the world, to check a bank account, register a visit at the doctor, watching video content, electronic purchases, using web versions of the office...
-
Measurement of Seafloor Acoustic Backscatter Angular Dependence at 150 kHz Using a Multibeam Echosounder
PublicationAcoustic seafloor measurements with multibeam echosounders (MBESs) are currently often used for submarine habitat mapping, but the MBESs are usually not acoustically calibrated for backscattering strength (BBS) and cannot be used to infer absolute seafloor angular dependence. We present a study outlining the calibration and showing absolute backscattering strength values measured at a frequency of 150 kHz at around 10–20 m water...
-
Publicly available lecture webcasts - e-learning or promotion tool? case study
PublicationThis paper aims to show how universities interact with Internet users by webcasting selected courses. Paper has exploratory case-study character, presenting example of Berkeley Webcast initiative of University of California, Berkeley, webcasting undergraduate courses and on-campus events. On the base of short introduction to webcasting usage as an e-learning and promotional tool, the analysis of 3 purposely chosen different courses...
-
The effect of groyne field on trapping macroplastic. Preliminary results from laboratory experiments
PublicationMacroplastic, a precursor of microplastic pollution, has become a new scope of research interest. However, the physical processes of macroplastic transport and deposition in rivers are poorly understood, which makes the decisions of where to locate macroplastic trapping infrastructure difficult. In this research, we conducted a series of experiments in a laboratory channel, exploring the impact of groynes and flexible artificial...
-
Improving automatic surveillance by sound analysis
PublicationAn automatic surveillance system, based on event detection in the video image can be improved by implementing algorithms for audio analysis. Dangerous or illegal actions are often connected with distinctive sound events like screams or sudden bursts of energy. A method for detection and classification of alarming sound events is presented. Detection is based on the observation of sudden changes in sound level in distinctive sub-bands...
-
Visual Content Representation for Cognitive Systems: Towards Augmented Intelligence
PublicationCognitive Vision Systems have gained significant attention from academia and industry during the past few decades. One of the main reasons behind this interest is the potential of such technologies to revolutionize human life since they intend to work robustly under complex visual scenes (which environmental conditions may vary), adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior. The combination...