Wyniki wyszukiwania dla: video data

Wyniki wyszukiwania dla: video data

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 656

wyczyść wszystkie filtry niedostępne

MODALITY corpus - SPEAKER 32 - COMMANDS C5
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 35 - COMMANDS C5
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 33 - COMMANDS C4
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 27 - SEQUENCE S3
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 27 - COMMANDS C3
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 33 - SEQUENCE S5
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 27 - SEQUENCE S2
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
The Physiological Effects of ASMR on Anxiety
Publikacja
- S. Seifzadeh
- V. Asayesh
- M. Torabi
- M. Dehghani
- E. Rabbani
- F. Asgharianasl
- Frontiers in Biomedical Technologies - Rok 2023
Purpose: Autonomous Sensory Meridian Response is a novel phenomenon that is very popular these days on Youtube and Reddit to its anti-anxiety effects. As the name suggests, ASMR is a relaxing warm sensation that begins on the scalp and spreads throughout the body. This technique is also known as "brain massage," and it relies on soothing sights and sounds, like whispers and slow movements. Investigating these videos is primarily motivated...

Pełny tekst do pobrania w portalu
Tensile modulus of human orbital wall bones cut in sagittal and coronal planes
Publikacja
- K. Żerdzicki
- P. Lemski
- P. Kłosowski
- A. Skorek
- M. A. Zmuda Trzebiatowski
- M. Koberda
- PLOS ONE - Rok 2021
In the current research, 68 specimens of orbital superior and/or medial walls taken from 33 human cadavers (12 females, 21 males) were subjected to uniaxial tension untill fracture. The samples were cut in the coronal (38 specimens) and sagittal (30 specimens) planes of the orbital wall. Apparent density (ρapp), tensile Young’s modulus (E-modulus) and ultimate tensile strength (UTS) were identified. Innovative test protocols were...

Pełny tekst do pobrania w portalu
Video of LEGO bricks on conveyor belt - minifigures, animals, plants and accessories
Dane Badawcze
open access
- T. Boiński
- seria: Video of LEGO bricks on conveyor belt
The set contains videos of LEGO bricks (minifigures, animals, plants and accessories) moving on a white conveyor belt. The images were prepared for training neural network for recognition of LEGO bricks. The bricks were separated as much as possible and in most cases they should not overlap. The images were taken from different sides by stationary camera...
Piotr Dalka mgr inż.

Osoby
MODALITY corpus - SPEAKER 17 - SEQUENCE S1
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S4
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S2
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S5
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S3
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S6
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
Vident-lab: a dataset for multi-task video processing of phantom dental scenes
Dane Badawcze
open access
We introduce a new, asymmetrically annotated dataset of natural teeth in phantom scenes for multi-task video processing: restoration, teeth segmentation, and inter-frame homography estimation. Pairs of frames were acquired with a beam splitter. The dataset constitutes a low-quality frame, its high-quality counterpart, a teeth segmentation mask, and...
Hardware-Software Implementation of a Sensor Network for CityTraffic Monitoring Using the FPGA- and ASIC-Based Sensor Nodes
Publikacja
- Journal of Signal Processing Systems for Signal Image and Video Technology - Rok 2013
Artykuł opisuje prototypową sieć sensorową do monitorowania ruchu pojazdów w mieście. Węzły sieci sensorowej, wyposażone w kamerę o niskiej rozdzielczości, obserwują ulice i wykrywają poruszające się obiekty. Detekcja obiektów jest realizowana w oparciu o własny algorytm segmentacji obrazów, wykorzystujący podwójne odejmowanie tła, wykrywanie krawędzi i cieni, działający na dedykowanym systemie mikroelektronicznym typu ''System...

Pełny tekst do pobrania w portalu
Joanna Raczek dr inż.

Osoby

Katedra Algorytmów i Modelowania Systemów

Wykształcenie 1997 -- 2001 Studia inżynierskie, Wydział Fizyki Technicznej i Matematyki Stosowanej, Politechnika Gdańska. Kierunek: Matematyka, specjalność: Matematyka Stosowana. 2001 -- 2003 Studia magisterskie, Wydział Fizyki Technicznej i Matematyki Stosowanej, Politechnika Gdańska. Kierunek: Matematyka, specjalność: Matematyka Stosowana. 2000 -- 2004 Studia inżynierskie, Wydział Elektroniki, Informatyki i Telekomunikacji,...
International Symposium on Audio, Video, Image Processing and Intelligent Applications

Konferencje
IEEE International Conference on Advanced Video and Signal Based Surveillance

Konferencje
Postprodukcja nagrania wideo z dzwiekiem dookolnym
Publikacja
- Rok 2009
One of the aims of this paper is to present issues related to audio-video correlation. This is presented on the basis of a short film realization employing surround microphone techniques. First, some related works in the domain of sound and vision correlation are presented. Then assumptions concerning scene creation related to both audio and video are shortly described. Another objective is to discuss results of subjective tests...
Bimodal deep learning model for subjectively enhanced emotion classification in films
Publikacja
- D. Weber
- B. Kostek
- INFORMATION SCIENCES - Rok 2024
This research delves into the concept of color grading in film, focusing on how color influences the emotional response of the audience. The study commenced by recalling state-of-the-art works that process audio-video signals and associated emotions by machine learning. Then, assumptions of subjective tests for refining and validating an emotion model for assigning specific emotional labels to selected film excerpts were presented....

Pełny tekst do pobrania w serwisie zewnętrznym
Distributed Framework for Visual Event Detection in Parking Lot Area
Publikacja
- Communications in Computer and Information Science - Rok 2011
The paper presents the framework for automatic detection of various events occurring in a parking lot basing on multiple camera video analysis. The framework is massively distributed, both in the logical and physical sense. It consists of several entities called node stations that use XMPP protocol for internal communication and SRTP protocol with Jingle extension for video streaming. Recognized events include detecting parking...

Pełny tekst do pobrania w serwisie zewnętrznym
Information Systems &Technologies /SPIE Conference on Digital Video Compression Algorithms & Techniques

Konferencje
SkinDepth - synthetic 3D skin lesion database
Dane Badawcze
wersja 1.0 open access
- A. Jezierska
- M. Woźniak
SkinDepth is the first synthetic 3D skin lesion database. The release of SkinDepth dataset intends to contribute to the development of algorithms for:
Rozproszone przechowywanie zapasowych kopii danych
Publikacja
- J. Kuchta
- Rok 2012
Pokazano metodę wykorzystania systemu przetwarzania rozproszonego do zabezpieczenia instytucji przed skutkami ataku hakerskiego połączonego ze zniszczeniem bazy danych tej instytucji. Metoda ta polega na wplataniu pakietów danych do materiałów audio-video ściąganych przez internautów korzystających z serwisów filmowych Video-on-Demand i przechowywaniu danych w rozproszeniu na setki lub nawet tysiące komputerów.

Pełny tekst do pobrania w serwisie zewnętrznym
Model of emotions for game players
Publikacja
- W. Szwoch
- Rok 2015
Affect-aware video games can respond to a game player's emotions. Such games seem to be more attractive for users. Therefore for that kind of games it is necessary to create a model of the player's emotions to know to which emotions the application should react. The paper describes different models of emotions. The questionnaire and experiment for video game players is presented. Some results of the tests are shown. Then the model...

Pełny tekst do pobrania w serwisie zewnętrznym
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publikacja
- Rok 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Pełny tekst do pobrania w serwisie zewnętrznym
New Tool for Examining QoS in the VToIP Service
Publikacja
- T. Uhl
- K. Nowicki
- S. Paulsen
- Journal of Telecommunications and Information Technology - Rok 2014
This paper is dedicated to the subject of measuring QoS in the Video Telephony over IP (VToIP) service. QoS measurement models in general and then models designed specifically for measuring QoS in the VToIP service are presented. A new numerical tool for examining the quality of VToIP video streams VToIP is described. The tool’s functionality is then put to the test in a number of analysis scenarios. The results and insights gained...

Pełny tekst do pobrania w portalu
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
Publikacja
- International Journal of Image Processing and Visual Communication - Rok 2013
In this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...

Pełny tekst do pobrania w serwisie zewnętrznym
EXPERIMENTAL ANALYSIS OF CONNECTION BETWEEN OBJECT-ORIENTED METRICS AND SOFTWARE CHANGEABILITY
Publikacja
- T. Sanner
- A. Czyżewski
- Rok 2013
For the purpose of video surveillance software quality assessment in this work the ISO/IEC-9126 norm was used with a particular focus on maintainability of the software system. The paper presents a study on the connection between software metrics derived from the static analysis of the source code and changeability of the video surveillance software system. It is shown that meeting requirements of software quality metrics may result...
Performance Evaluation of Selected Parallel Object Detection and Tracking Algorithms on an Embedded GPU Platform
Publikacja
- G. Szwoch
- M. Szczodrak
- Rok 2017
Performance evaluation of selected complex video processing algorithms, implemented on a parallel, embedded GPU platform Tegra X1, is presented. Three algorithms were chosen for evaluation: a GMM-based object detection algorithm, a particle filter tracking algorithm and an optical flow based algorithm devoted to people counting in a crowd flow. The choice of these algorithms was based on their computational complexity and parallel...

Pełny tekst do pobrania w serwisie zewnętrznym
Moving object detection and tracking for the purpose of multimodal surveillance system in urban areas
Publikacja
- A. Czyżewski
- P. Dalka
- Rok 2008
Background subtraction method based on mixture of Gaussians was employed to detect all regions in a video frame denoting moving objects. Kalman filters were used for establishing relations between the regions and real moving objects in a scene and for tracking them continuously. The objects were represented by rectangles. The objects coupling with adequate regions including the relation of many-to-many was studied experimentally...
Building Knowledge for the Purpose of Lip Speech Identification
Publikacja
- Advances in Intelligent Systems and Computing - Rok 2017
Consecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...

Pełny tekst do pobrania w serwisie zewnętrznym
Eulerian motion magnification applied to structural health monitoring of wind turbines
Publikacja
- S. Cygert
- A. Czyżewski
- Journal of the Acoustical Society of America - Rok 2018
Several types of defects may occur in wind turbines, as physical damage of blades or gearbox malfunction. A wind farm monitoring and damage prediction system is built to observe abnormal vibrations of elements of wind turbine: blades, nacelle, and tower. Contactless methods are developed which do not require turbine stopping. In this work, structural health monitoring of a wind turbine is evaluated using a conversion from the captured...

Pełny tekst do pobrania w serwisie zewnętrznym
Architecture Design of a Networked Music Performance Platform for a Chamber Choir
Publikacja
- J. Cychnerski
- B. Mróz
- Communications in Computer and Information Science - Rok 2022
This paper describes an architecture design process for Networked Music Performance (NMP) platform for medium-sized conducted music ensembles, based on remote rehearsals of Academic Choir of Gdańsk University of Technology. The issues of real-time remote communication, in-person music performance, and NMP are described. Three iterative steps defining and extending the architecture of the NMP platform with additional features to...

Pełny tekst do pobrania w portalu
Commercial systems for automatic detection of events. Features, limitations and potential solutions
Publikacja
- G. Szwoch
- Rok 2009
Video Content Analysis (VCA). Motion detection. Example of complex VCA system. Commercial VCA systems and their applications. Limitations and problems. Possible solutions.
Simple gait parameterization and 3D animation for anonymous visual monitoring based on augmented reality
Publikacja
- P. Szczuko
- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2016
The article presents a method for video anonymization and replacing real human silhouettes with virtual 3D figures rendered on a screen. Video stream is processed to detect and to track objects, whereas anonymization stage employs animating avatars accordingly to behavior of detected persons. Location, movement speed, direction, and person height are taken into account during animation and rendering phases. This approach requires...

Pełny tekst do pobrania w portalu
The concept of aida applied to online interactive advertisement: an youtube case study
Publikacja
- B. Schivinski
- Rok 2012
This paper presents an approach of application of the AIDA framework to interactive advertisements presented on social media channels. The first section introduces the definitions of social media and its categorization. It is given an overview of the online video service YouTube.com. The second section describes social media marketing. The third section presents a theoretical introduction of traditional and interactive advertisement....
Network and Operating System Support for Digital Audio and Video (Network and OS Support for Digital A/V)

Konferencje
Performance evaluation of the parallel object tracking algorithm employing the particle filter
Publikacja
- G. Szwoch
- Rok 2016
An algorithm based on particle filters is employed to track moving objects in video streams from fixed and non-fixed cameras. Particle weighting is based on color histograms computed in the iHLS color space. Particle computations are parallelized with CUDA framework. The algorithm was tested on various GPU devices: a desktop GPU card, a mobile chipset and two embedded GPU platforms. The processing speed depending on the number...
The Innovative Faculty for Innovative Technologies
Publikacja
- Rok 2013
A leaflet describing Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology. Multimedia Systems Department described laboratories and prototypes of: Auditory-visual attention stimulator, Automatic video event detection, Object re-identification application for multi-camera surveillance systems, Object Tracking and Automatic Master-Slave PTZ Camera Positioning System, Passive Acoustic Radar,...

Pełny tekst do pobrania w serwisie zewnętrznym
Augmented Reality for Privacy-Sensitive Visual Monitoring
Publikacja
- P. Szczuko
- Rok 2014
The paper presents a method for video anonymization and replacing real human silhouettes with virtual 3D figures rendered on the screen. Video stream is processed to detect and to track objects, whereas anonymization stage employs fast blurring method. Substitute 3D figures are animated accordingly to behavior of detected persons. Their location, movement speed, direction, and person height are taken into account during the animation...

Pełny tekst do pobrania w serwisie zewnętrznym
Examining Quality of Hand Segmentation Based on Gaussian Mixture Models
Publikacja
- Rok 2014
Results of examination of various implementations of Gaussian mix-ture models are presented in the paper. Two of the implementations belonged to the Intel’s OpenCV 2.4.3 library and utilized Background Subtractor MOG and Background Subtractor MOG2 classes. The third implementation presented in the paper was created by the authors and extended Background Subtractor MOG2 with the possibility of operating on the scaled version of...

Pełny tekst do pobrania w serwisie zewnętrznym
Objects classification based on their physical sizes for detection of events in camera images
Publikacja
- Rok 2008
In the paper, a method of estimation of the physical sizes of the objects tracked in the video surveillance system, and a simple module for object classification based on the estimated physical sizes, are presented. The results of object classification are then used for automatic detection of various types of events in the camera image.
Camera Orientation-Independent Parking Events Detection
Publikacja
- Rok 2011
The paper describes the method for detecting precise position and time of vehicles parking in a parking lot. This task is trivial in case of favorable camera orientation but gets much more complex when an angle between the camera viewing axis and the ground is small. The method utilizes background subtraction and object tracking algorithms for detecting moving objects in a video stream. Objects are classified into vehicles and...

Pełny tekst do pobrania w serwisie zewnętrznym
Semantic Integration of Heterogeneous Recognition Systems
Publikacja
- P. Kaczmarek
- P. Raszkowski
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2011
Computer perception of real-life situations is performed using a variety of recognition techniques, including video-based computer vision, biometric systems, RFID devices and others. The proliferation of recognition modules enables development of complex systems by integration of existing components, analogously to the Service Oriented Architecture technology. In the paper, we propose a method that enables integration of information...
Eye Blink Based Detection of Liveness in Biometric Authentication Systems Using Conditional Random Fields
Publikacja
- M. Szwoch
- P. Pieniążek
- Rok 2012
The goal of this paper was to verify whether the conditional random fields are suitable and enough efficient for eye blink detection in user authentication systems based on face recognition with a standard web camera. To evaluate this approach several experiments were carried on using a specially developed test application and video database.

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: video data

Piotr Dalka mgr inż.

Joanna Raczek dr inż.