Search results for: DEPTH IMAGE
-
CNN Architectures for Human Pose Estimation from a Very Low Resolution Depth Image
PublicationThe paper is dedicated to proposing and evaluating a number of convolutional neural network architectures for calculating a multiple regression on 3D coordinates of human body joints tracked in a single low resolution depth image. The main challenge was to obtain a high precision in case of a noisy and coarse scan of the body, as observed by a depth sensor from a large distance. The regression network was expected to reason about...
-
Deep neural networks for human pose estimation from a very low resolution depth image
PublicationThe work presented in the paper is dedicated to determining and evaluating the most efficient neural network architecture applied as a multiple regression network localizing human body joints in 3D space based on a single low resolution depth image. The main challenge was to deal with a noisy and coarse representation of the human body, as observed by a depth sensor from a large distance, and to achieve high localization precision....
-
Depth Images Filtering In Distributed Streaming
PublicationIn this paper, we propose a distributed system for point cloud processing and transferring them via computer network regarding to effectiveness-related requirements. We discuss the comparison of point cloud filters focusing on their usage for streaming optimization. For the filtering step of the stream pipeline processing we evaluate four filters: Voxel Grid, Radial Outliner Remover, Statistical Outlier Removal and Pass Through....
-
DEPTH IMAGES FILTERING IN DISTRIBUTED STREAMING
PublicationIn this paper we discuss the comparison of point cloud filters focusing on their applicability for streaming optimization. For the filtering stage within a stream pipeline processing we evaluate three filters: Voxel Grid, Pass Through and Statistical Outlier Removal. For the filters we perform series of the tests aiming at evaluation of changes of point cloud size and transmitting frequency (various fps ratio). We propose a distributed...
-
ANN for human pose estimation in low resolution depth images
PublicationThe paper presents an approach to localize human body joints in 3D coordinates based on a single low resolution depth image. First a framework to generate a database of 80k realistic depth images from a 3D body model is described. Then data preprocessing and normalization procedure, and DNN and MLP artificial neural networks architectures and training are presented. The robustness against camera distance and image noise is analysed....
-
Scene Segmentation Basing on Color and Depth Images for Kinect Sensor
PublicationIn this paper we propose a method for segmenting single images from Kinect sensor by considering both color and depth information. The algorithm is based on a series of edge detection procedures designed for particular features of the scene objects. RGB and HSV color planes are separately analyzed in the first step with Canny edge detector, resulting in overall color edges mask. In depth images both clear boundaries and smooth...
-
Thermal Image Processing for Respiratory Estimation from Cubical Data with Expandable Depth
PublicationAs healthcare costs continue to rise, finding affordable and non-invasive ways to monitor vital signs is increasingly important. One of the key metrics for assessing overall health and identifying potential issues early on is respiratory rate (RR). Most of the existing methods require multiple steps that consist of image and signal processing. This might be difficult to deploy on edge devices that often do not have specialized...
-
Mariusz Kaczmarek dr hab. inż.
PeopleReceived M.Sc., Eng. in Electronics in 1995 from Gdansk University of Technology, Ph.D. in Medical Electronics in 2003 and habilitation in Biocybernetics and Biomedical Engineering in 2017. He was an investigator in about 13 projects receiving a number of awards, including four best papers, practical innovations (7 medals and awards) and also the Andronicos G. Kantsios Award and Siemens Award. Main research activities: the issues...
-
Stereo image visualization for a VISROBOT system
PublicationThe article describes a novel approach to robotic vision in mobile robot systems. The system implements a Visrobot system which implements a generic idea of using mobile robots for exploring an indoor environment. The task of such a robot is to visualize a stereo image properly for an operator. The system uses different stereo baseline values. Variable baseline can result in increasing depth resolution for distant objects. We assume...
-
Super-resolved Thermal Imagery for High-accuracy Facial Areas Detection and Analysis
PublicationIn this study, we evaluate various Convolutional Neural Networks based Super-Resolution (SR) models to improve facial areas detection in thermal images. In particular, we analyze the influence of selected spatiotemporal properties of thermal image sequences on detection accuracy. For this purpose, a thermal face database was acquired for 40 volunteers. Contrary to most of existing thermal databases of faces, we publish our dataset...
-
Instance segmentation of stack composed of unknown objects
PublicationThe article reviews neural network architectures designed for the segmentation task. It focuses mainly on instance segmentation of stacked objects. The main assumption is that segmentation is based on a color image with an additional depth layer. The paper also introduces the Stacked Bricks Dataset based on three cameras: RealSense L515, ZED2, and a synthetic one. Selected architectures: DeepLab, Mask RCNN, DEtection TRansformer,...
-
Computer-Aided Detection of Hypertensive Retinopathy Using Depth-Wise Separable CNN
PublicationHypertensive retinopathy (HR) is a retinal disorder, linked to high blood pressure. The incidence of HR-eye illness is directly related to the severity and duration of hypertension. It is critical to identify and analyze HR at an early stage to avoid blindness. There are presently only a few computer-aided systems (CADx) designed to recognize HR. Instead, those systems concentrated on collecting features from many retinopathy-related...
-
Analysis of Methods for Determining Shallow Waterbody Depths Based on Images Taken by Unmanned Aerial Vehicles
PublicationHydrographic surveys enable the acquisition and processing of bathymetric data, which after being plotted onto nautical charts, can help to ensure safety of navigation, monitor changes in the coastal zone, and assess hydro-engineering structure conditions. This study involves the measurement of waterbody depth, identification of the seabed shape and geomorphology, the coastline course, and the location of underwater obstacles....
-
Face Profile View Retrieval Using Time of Flight Camera Image Analysis
PublicationMethod for profile view retrieving of the human face is presented. The depth data from the 3D camera is taken as an input. The preprocessing is, besides of standard filtration, extended by the process of filling of the holes which are present in depth data. The keypoints, defined as the nose tip and the chin are detected in user’s face and tracked. The Kalman filtering is applied to smooth the coordinates of those points which...
-
Efficient signal processing in spectroscopic optical coherence tomography
PublicationSpectroscopic optical coherence tomography (SOCT) is an extension of a standard OCT technique, which allows to obtain depth-resolved, spectroscopic information on the examined sample. It can be used as a source of additional contrast in OCT images e.g. by encoding certain features of the light spectrum into the hue of the image pixels. However, SOCT require computation of time-frequency distributions of each OCT A-scan, what is...
-
Intradermal nevus - Male, 38 - Tissue image [7070729594566521]
Open Research DataThis is the histopathological image of SKIN tissue sample obtained in Medical University Gdańsk and deposited in ZMDL-GUMED. The sample image was taken using: Pannoramic 250 3DHistech slide scanner (20x magnification) and saved to DICOM format.
-
Squamous cell carcinoma, NOS - Female, 69 - Tissue image [10010729534327011]
Open Research DataThis is the histopathological image of CERVIX UTERI tissue sample obtained in Medical University Gdańsk and deposited in ZMDL-GUMED. The sample image was taken using: Pannoramic 250 3DHistech slide scanner (20x magnification) and saved to DICOM format.
-
Squamous cell carcinoma, NOS - Female, 69 - Tissue image [10010729534325871]
Open Research DataThis is the histopathological image of CERVIX UTERI tissue sample obtained in Medical University Gdańsk and deposited in ZMDL-GUMED. The sample image was taken using: Pannoramic 250 3DHistech slide scanner (20x magnification) and saved to DICOM format.
-
An analytical four-layer horizontal electric current dipole model for analysing underwater electric potential in shallow seawater
PublicationThe paper presents a new analytical four‑layer (air–water–bottom–non‑conductive layer) horizontal electric dipole model which allows an accurate approximation of ship’s Underwater Electric Potential (UEP) from a sufficient depth in shallow coastal marine waters. The numerical methods, usually Finite Element Method (FEM) or Boundary Elements Method (BEM), are typically used to estimate the electric field and the distribution of...
-
SkinDepth - synthetic 3D skin lesion database
Open Research DataSkinDepth is the first synthetic 3D skin lesion database. The release of SkinDepth dataset intends to contribute to the development of algorithms for:
-
Magnetic signature reproduction of ferromagnetic ships at arbitrary geographical position, direction and depth using a multi-dipole model – source and verification dataset with description
Open Research DataThe dataset include source synthetic magnetic data concerning the corvette-type ship numeric model. The data are for 6 locations around the World with different V1 ÷ V6 Earth magnetic field values. The attached data is in Matlab. MAT format, but the data can also be used in Octave software.
-
Badanie stanu nawierzchni drogowej z wykorzystaniem uczenia maszynowego
PublicationW artykule opisano budowę systemu informowania o stanie nawierzchni drogowej z wykorzystaniem metod cyfrowego przetwarzania obrazów oraz uczenia maszynowego. Efektem wykonanych prac badawczych jest eksperymentalna platforma, pozwalająca na rejestrację uszkodzeń na drogach, system do analizy, przetwarzania i klasyfikacji danych oraz webowa aplikacja użytkownika do przeglądu stanu nawierzchni w wybranej lokalizacji.
-
Improving Accuracy of Contactless Respiratory Rate Estimation by Enhancing Thermal Sequences with Deep Neural Networks
PublicationEstimation of vital signs using image processing techniques have already been proved to have a potential for supporting remote medical diagnostics and replacing traditional measurements that usually require special hardware and electrodes placed on a body. In this paper, we further extend studies on contactless Respiratory Rate (RR) estimation from extremely low resolution thermal imagery by enhancing acquired sequences using Deep...
-
Economical methods for measuring road surface roughness
PublicationTwo low-cost methods of estimating the road surface condition are presented in the paper, the first one based on the use of accelerometers and the other on the analysis of images acquired from cameras installed in a vehicle. In the first method, miniature positioning and accelerometer sensors are used for evaluation of the road surface roughness. The device designed for installation in vehicles is composed of a GPS receiver and...
-
Buried Object Characterization by Data-Driven Surrogates and Regression-Enabled Hyperbolic Signature Extraction
PublicationThis work addresses artificial-intelligence-based buried object characterization using FDTD-based electromagnetic simulation toolbox of a Ground Penetrating Radar (GPR) to generate B-scan data. In data collection, FDTD-based simulation tool, gprMax is used. The task is to estimate geophysical parameters of a cylindrical shape object of various radii, buried at different positions in the dry soil medium simultaneously and independently...
-
Ensembling noisy segmentation masks of blurred sperm images
PublicationBackground: Sperm tail morphology and motility have been demonstrated to be important factors in determining sperm quality for in vitro fertilization. However, many existing computer-aided sperm analysis systems leave the sperm tail out of the analysis, as detecting a few tail pixels is challenging. Moreover, some publicly available datasets for classifying morphological defects contain images limited only to the sperm head. This...
-
CMGNet: Context-aware middle-layer guidance network for salient object detection
PublicationSalient object detection (SOD) is a critical task in computer vision that involves accurately identifying and segmenting visually significant objects in an image. To address the challenges of gridding issues and feature...
-
Processing of Satellite Data in the Cloud
PublicationThe dynamic development of digital technologies, especially those dedicated to devices generating large data streams, such as all kinds of measurement equipment (temperature and humidity sensors, cameras, radio-telescopes and satellites – Internet of Things) enables more in-depth analysis of the surrounding reality, including better understanding of various natural phenomenon, starting from atomic level reactions, through macroscopic...
-
Near-Infrared III Nanophosphorswith Cr3+Ni2+ Energy Transfer for Bioimaging
Open Research DataIn the biomedical field, the use of fluorescence imaging in the second near-infrared (NIR-II) region is growing rapidly because it imparts the advantages of reduced autofluorescence and low photon scattering. The advantage of reduced scattering is that it increases penetration depth in vivo and improves imaging clarity. Herein, this work uses mesoporous...
-
Mode shapes of a beam and plate with defects, obtained by experimental modal analysis
Open Research DataThe DataSet contains the experimental results of the first mode shape for a beam and a plate.
-
Water currents in Głębinka Passage in late spring of 1975
Open Research DataData set contains the results of the field measurements of horizontal water currents carried out in the Głębinka Passage in Puck Bay (Southern Baltic, Poland) in 1975 by Department of Physical Oceanography (Institute Oceanography, University of Gdańsk). Głębinka Passage is a narrow strait playing crucial role in water exchange between shallow and deep...
-
Biomass of macrophytobentos in the Puck Bay in 2010-2018
Open Research DataThe database contains data on qualitative composition and biomass of macrophytobenthos (flower plants and macroalgae) in samples collected in the Puck Bay area (Gulf of Gdańsk, southern Baltic Sea) at 20 stations between 2010-2018. The database contains information on sampling sites (region, geographical coordinates, depth), sample characteristics (date,...
-
Vident-synth: a synthetic intra-oral video dataset for optical flow estimation
Open Research DataWe introduce Vident-synth, a large dataset of synthetic dental videos with corresponding ground truth forward and backward optical flows and occlusion masks. It can be used for:
-
The effect of interview location on the perception of Ecosystem Services provided by trees. A Polish case study.
Open Research DataSeveral survey research methods are available to study attitudes towards the environment, including: CAWI (computer-assisted Internet interview), CATI (computer-assisted telephone interview), CAPI (computer-assisted personal interview), and PAPI (paper-pencil interview). An increasingly popular CAWI approach is the geo-questionnaire – an internet survey...
-
Ocean mixed layer dynamics: high-resolution simulations of wind, wave and convective effects
Open Research DataThis dataset contains results of high-resolution numerical simulations of the ocean mixed layer (OML) forced by wind, waves and cooling from the atmosphere, i.e., under strongly turbulent, convective conditions. The goal is to provide detailed, three-dimensional information about OML circulation, turbulent kinetic energy, and temperature and salinity...
-
MODALITY corpus - SPEAKER 35 - COMMANDS C1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - SEQUENCE S6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - COMMANDS C5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - SEQUENCE S4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 10 - SEQUENCE S1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 39 - COMMANDS C1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - SEQUENCE S3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - COMMANDS C3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 33 - SEQUENCE S1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - COMMANDS C2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 21 - COMMANDS C3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - SEQUENCE S4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 01 - SEQUENCE S6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...