Abstrakt
The work presented in the paper is dedicated to determining and evaluating the most efficient neural network architecture applied as a multiple regression network localizing human body joints in 3D space based on a single low resolution depth image. The main challenge was to deal with a noisy and coarse representation of the human body, as observed by a depth sensor from a large distance, and to achieve high localization precision. The regression network was expected to reason about relations of body parts based on depth image, and to extract locations of joints, and provide coordinates defining the body pose. The method involved creation of a dataset with 200,000 realistic depth images of a 3D body model, then training and testing numerous architectures including feedforward multilayer perceptron network and deep convolutional neural networks. The results of training and evaluation are included and discussed. The most accurate DNN network was further trained and evaluated on an augmented depth images dataset. The achieved accuracy was similar to a reference Kinect algorithm results, with a great benefit of fast processing speed and significantly lower requirements on sensor resolution, as it used 100 times less pixels than Kinect depth sensor. The method was robust against sensor noise, allowing imprecision of depth measurements. Finally, our results were compared with VGG, MobileNet, and ResNet architectures.
Cytowania
-
1 8
CrossRef
-
0
Web of Science
-
2 0
Scopus
Autor (1)
Cytuj jako
Pełna treść
- Wersja publikacji
- Accepted albo Published Version
- Licencja
- otwiera się w nowej karcie
Słowa kluczowe
Informacje szczegółowe
- Kategoria:
- Publikacja w czasopiśmie
- Typ:
- artykuł w czasopiśmie wyróżnionym w JCR
- Opublikowano w:
-
MULTIMEDIA TOOLS AND APPLICATIONS
nr 78,
strony 29357 - 29377,
ISSN: 1380-7501 - Język:
- angielski
- Rok wydania:
- 2019
- Opis bibliograficzny:
- Szczuko P.: Deep neural networks for human pose estimation from a very low resolution depth image// MULTIMEDIA TOOLS AND APPLICATIONS. -Vol. 78, (2019), s.29357-29377
- DOI:
- Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.1007/s11042-019-7433-7
- Weryfikacja:
- Politechnika Gdańska
Powiązane datasety
- dane badawcze Very low resolution depth images of 200,000 poses
wyświetlono 246 razy