华体会官方网页版-华体会(中国)

官方微信
友情链接

华体会官方网页版-华体会(中国):PIFU-RGBD: Single-view RGB-D Pixel-aligned Implicit Function for 3D Human Reconstruction

2024-05-14


Wang, Yingli; Zhang, Liping; Li, Weijun; Dong, Xiaoli; Li, Li; Qin, Hong Source: 2023 International Conference on High Performance Big Data and Intelligent Systems, HDIS 2023, p 93-99, 2023, 2023 International Conference on High Performance Big Data and Intelligent Systems, HDIS 2023;

Abstract:

Recent advances in IMAGE-BASED parsing of human bodies have been driven by the significant improvement in successful deep learning methods for 2D image processing. Although current methods have demonstrated outstanding global reconstruction capability, they still fail to process inherent depth ambiguity in 2D image images. In this paper, we propose PIFU-RGBD, a new pixel-aligned function representation method to reconstruct the complete and detailed 3D human from a single RGB-D image. The PIFU-RGBD method is mainly structured into two stages. The initial stage involves transforming a single RGB-D image into a single-view human point cloud, and then the single-view mesh is modeled based on the point cloud data, and the binocular view is rendered. Moving on to the second stage, the depth information and voxel alignment features of binocular view are obtained through the stereoscopic vision network and input into the implicit function estimation network. By using the Marching Cubes algorithm, a complete three-dimensional reconstruction of the human body model is obtained. It is worth noting that the RGBD images obtained by any camera can be converted into the input of unified camera parameters after processing in the first stage, which makes the depth information and voxel alignment features extracted in the second stage are camera-independent. The trained network performs depth-aware reconstruction under unified parameter settings. Compared with previous works, our proposed method can effectively improve the pose ambiguity problem of the reconstruction of human model with single view input, and significantly improve the reconstruction accuracy. Compared with the current SOTA method, which uses single-view RGB-D input to reconstruct the complete human body, the scheme proposed in this paper can reconstruct the human body model with accurate posture on the data captured by cameras with different parameters, and has the advantage of stronger generalization capability.

?2023 IEEE. (20 refs.)




关于我们
下载视频观看
联系方式
通信地址

北京市海淀区清华东路甲35号(林大北路中段) 北京912信箱 (100083)

电话

010-82304210/010-82305052(传真)

E-mail

semi@semi.ac.cn

交通地图
友情链接
中华人民共和国科学技术部
中国科华体会官方网页版-华体会(中国)
中国工程院
国家自然科学基金委员会
中国科华体会官方网页版-华体会(中国)大学
中国科学技术大学
中国科华体会官方网页版-华体会(中国)科技产业网
版权所有 华体会官方网页版-华体会(中国)

备案号:京ICP备05085259-1号 京公网安备110402500052 中国科华体会官方网页版-华体会(中国)半导体所声明

华体会官方网页版-华体会(中国):

华体会官方网页版-华体会(中国)