LI Chaonan,LIU Sheng,YAO Lu,ZOU Siyu.Video-based body geometric aware network for 3D human pose estimation[J].Optoelectronics Letters,2022,(5):313-320
Video-based body geometric aware network for 3D human pose estimation
Author NameAffiliation
LI Chaonan College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China 
LIU Sheng College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China 
YAO Lu College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China 
ZOU Siyu College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China 
Abstract:
      Three-dimensional human pose estimation (3D HPE) has broad application prospects in the fields of trajectory prediction, posture tracking and action analysis. However, the frequent self-occlusions and the substantial depth ambiguity in two-dimensional (2D) representations hinder the further improvement of accuracy. In this paper, we propose a novel video-based human body geometric aware network to mitigate the above problems. Our network can implicitly be aware of the geometric constraints of the human body by capturing spatial and temporal context information from 2D skeleton data. Specifically, a novel skeleton attention (SA) mechanism is proposed to model geometric context dependencies among different body joints, thereby improving the spatial feature representation ability of the network. To enhance the temporal consistency, a novel multilayer perceptron (MLP)-Mixer based structure is exploited to comprehensively learn temporal context information from input sequences. We conduct experiments on publicly available challenging datasets to evaluate the proposed approach. The results outperform the previous best approach by 0.5 mm in the Human3.6m dataset. It also demonstrates significant improvements in HumanEva-I dataset.
Hits: 354
Download times: 1
View Full Text    Download reader