3D Tracking of Human Motion Using Visual Skeletonization and Stereoscopic Vision

The design of markerless systems to reconstruct human motion in a timely, unobtrusive and externally valid manner is still an open challenge. Artificial intelligence algorithms based on automatic landmarks identification on video images opened to a new approach, potentially e-viable with low-cost ha...

Full description

Bibliographic Details
Main Authors: Matteo Zago, Matteo Luzzago, Tommaso Marangoni, Mariolino De Cecco, Marco Tarabini, Manuela Galli
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-03-01
Series:Frontiers in Bioengineering and Biotechnology
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fbioe.2020.00181/full
id doaj-e4d39f15b4cb48d5ba65ad98b3905ee7
record_format Article
spelling doaj-e4d39f15b4cb48d5ba65ad98b3905ee72020-11-25T00:29:07ZengFrontiers Media S.A.Frontiers in Bioengineering and Biotechnology2296-41852020-03-01810.3389/fbioe.2020.001815089973D Tracking of Human Motion Using Visual Skeletonization and Stereoscopic VisionMatteo Zago0Matteo Luzzago1Tommaso Marangoni2Mariolino De Cecco3Marco Tarabini4Manuela Galli5Department of Electronics, Information and Bioengineering, Polytechnic of Milan, Milan, ItalyDepartment of Mechanical Engineering, Polytechnic of Milan, Milan, ItalyDepartment of Mechanical Engineering, Polytechnic of Milan, Milan, ItalyDepartment of Industrial Engineering, University of Trento, Trento, ItalyDepartment of Mechanical Engineering, Polytechnic of Milan, Milan, ItalyDepartment of Electronics, Information and Bioengineering, Polytechnic of Milan, Milan, ItalyThe design of markerless systems to reconstruct human motion in a timely, unobtrusive and externally valid manner is still an open challenge. Artificial intelligence algorithms based on automatic landmarks identification on video images opened to a new approach, potentially e-viable with low-cost hardware. OpenPose is a library that t using a two-branch convolutional neural network allows for the recognition of skeletons in the scene. Although OpenPose-based solutions are spreading, their metrological performances relative to video setup are still largely unexplored. This paper aimed at validating a two-cameras OpenPose-based markerless system for gait analysis, considering its accuracy relative to three factors: cameras' relative distance, gait direction and video resolution. Two volunteers performed a walking test within a gait analysis laboratory. A marker-based optical motion capture system was taken as a reference. Procedures involved: calibration of the stereoscopic system; acquisition of video recordings, simultaneously with the reference marker-based system; video processing within OpenPose to extract the subject's skeleton; videos synchronization; triangulation of the skeletons in the two videos to obtain the 3D coordinates of the joints. Two set of parameters were considered for the accuracy assessment: errors in trajectory reconstruction and error in selected gait space-temporal parameters (step length, swing and stance time). The lowest error in trajectories (~20 mm) was obtained with cameras 1.8 m apart, highest resolution and straight gait, and the highest (~60 mm) with the 1.0 m, low resolution and diagonal gait configuration. The OpenPose-based system tended to underestimate step length of about 1.5 cm, while no systematic biases were found for swing/stance time. Step length significantly changed according to gait direction (p = 0.008), camera distance (p = 0.020), and resolution (p < 0.001). Among stance and swing times, the lowest errors (0.02 and 0.05 s for stance and swing, respectively) were obtained with the 1 m, highest resolution and straight gait configuration. These findings confirm the feasibility of tracking kinematics and gait parameters of a single subject in a 3D space using two low-cost webcams and the OpenPose engine. In particular, the maximization of cameras distance and video resolution enabled to achieve the highest metrological performances.https://www.frontiersin.org/article/10.3389/fbioe.2020.00181/fullmovement measurementgait analysiscomputer visionartificial intelligencemarkerless motion capture
collection DOAJ
language English
format Article
sources DOAJ
author Matteo Zago
Matteo Luzzago
Tommaso Marangoni
Mariolino De Cecco
Marco Tarabini
Manuela Galli
spellingShingle Matteo Zago
Matteo Luzzago
Tommaso Marangoni
Mariolino De Cecco
Marco Tarabini
Manuela Galli
3D Tracking of Human Motion Using Visual Skeletonization and Stereoscopic Vision
Frontiers in Bioengineering and Biotechnology
movement measurement
gait analysis
computer vision
artificial intelligence
markerless motion capture
author_facet Matteo Zago
Matteo Luzzago
Tommaso Marangoni
Mariolino De Cecco
Marco Tarabini
Manuela Galli
author_sort Matteo Zago
title 3D Tracking of Human Motion Using Visual Skeletonization and Stereoscopic Vision
title_short 3D Tracking of Human Motion Using Visual Skeletonization and Stereoscopic Vision
title_full 3D Tracking of Human Motion Using Visual Skeletonization and Stereoscopic Vision
title_fullStr 3D Tracking of Human Motion Using Visual Skeletonization and Stereoscopic Vision
title_full_unstemmed 3D Tracking of Human Motion Using Visual Skeletonization and Stereoscopic Vision
title_sort 3d tracking of human motion using visual skeletonization and stereoscopic vision
publisher Frontiers Media S.A.
series Frontiers in Bioengineering and Biotechnology
issn 2296-4185
publishDate 2020-03-01
description The design of markerless systems to reconstruct human motion in a timely, unobtrusive and externally valid manner is still an open challenge. Artificial intelligence algorithms based on automatic landmarks identification on video images opened to a new approach, potentially e-viable with low-cost hardware. OpenPose is a library that t using a two-branch convolutional neural network allows for the recognition of skeletons in the scene. Although OpenPose-based solutions are spreading, their metrological performances relative to video setup are still largely unexplored. This paper aimed at validating a two-cameras OpenPose-based markerless system for gait analysis, considering its accuracy relative to three factors: cameras' relative distance, gait direction and video resolution. Two volunteers performed a walking test within a gait analysis laboratory. A marker-based optical motion capture system was taken as a reference. Procedures involved: calibration of the stereoscopic system; acquisition of video recordings, simultaneously with the reference marker-based system; video processing within OpenPose to extract the subject's skeleton; videos synchronization; triangulation of the skeletons in the two videos to obtain the 3D coordinates of the joints. Two set of parameters were considered for the accuracy assessment: errors in trajectory reconstruction and error in selected gait space-temporal parameters (step length, swing and stance time). The lowest error in trajectories (~20 mm) was obtained with cameras 1.8 m apart, highest resolution and straight gait, and the highest (~60 mm) with the 1.0 m, low resolution and diagonal gait configuration. The OpenPose-based system tended to underestimate step length of about 1.5 cm, while no systematic biases were found for swing/stance time. Step length significantly changed according to gait direction (p = 0.008), camera distance (p = 0.020), and resolution (p < 0.001). Among stance and swing times, the lowest errors (0.02 and 0.05 s for stance and swing, respectively) were obtained with the 1 m, highest resolution and straight gait configuration. These findings confirm the feasibility of tracking kinematics and gait parameters of a single subject in a 3D space using two low-cost webcams and the OpenPose engine. In particular, the maximization of cameras distance and video resolution enabled to achieve the highest metrological performances.
topic movement measurement
gait analysis
computer vision
artificial intelligence
markerless motion capture
url https://www.frontiersin.org/article/10.3389/fbioe.2020.00181/full
work_keys_str_mv AT matteozago 3dtrackingofhumanmotionusingvisualskeletonizationandstereoscopicvision
AT matteoluzzago 3dtrackingofhumanmotionusingvisualskeletonizationandstereoscopicvision
AT tommasomarangoni 3dtrackingofhumanmotionusingvisualskeletonizationandstereoscopicvision
AT mariolinodececco 3dtrackingofhumanmotionusingvisualskeletonizationandstereoscopicvision
AT marcotarabini 3dtrackingofhumanmotionusingvisualskeletonizationandstereoscopicvision
AT manuelagalli 3dtrackingofhumanmotionusingvisualskeletonizationandstereoscopicvision
_version_ 1725333234476122112