HIDE: Hierarchical iterative decoding enhancement for multi-view 3D human parameter regression

التفاصيل البيبلوغرافية
العنوان: HIDE: Hierarchical iterative decoding enhancement for multi-view 3D human parameter regression
المؤلفون: Lin WT(林伟涛), Zhang JG(张吉光), Meng WL(孟维亮), Liu XL(刘湘龙), Zhang XP(张晓鹏)
بيانات النشر: WILEY
سنة النشر: 2024
المجموعة: Institute of Automation: CASIA OpenIR (Chinese Academy of Sciences) / 中国科学院自动化研究所机构知识库
مصطلحات موضوعية: 3D human mesh recovery, body modeling, computer vision, deep learning, Computer Science, Software Engineering
الوصف: Parametric human modeling are limited to either single-view frameworks or simple multi-view frameworks, failing to fully leverage the advantages of easily trainable single-view networks and the occlusion-resistant capabil ities of multi-view images. The prevalent presence of object occlusion and self-occlusion in real-world scenarios leads to issues of robustness and accuracy in predicting human body parameters. Additionally, many methods overlook the spatial connectivity of human joints in the global estimation of model pose parameters, resulting in cumulative errors in continuous joint parameters.To address these challenges, we propose a flexible and efficient iterative decoding strategy. By extending from single-view images to multi-view video inputs, we achieve local-to-global optimization. We utilize attention mechanisms to cap ture the rotational dependencies between any node in the human body and all its ancestor nodes, thereby enhancing pose decoding capability. We employ a parameter-level iterative fusion of multi-view image data to achieve flexible integration of global pose information, rapidly obtaining appropriate projection features from different viewpoints, ultimately resulting in precise parameter estimation. Through experiments, we validate the effectiveness of the HIDE method on the Human3.6M and 3DPW datasets, demonstrating significantly improved visualization results compared to previous methods.
نوع الوثيقة: report
اللغة: English
Relation: Computer Animation and Virtual Worlds; http://ir.ia.ac.cn/handle/173211/57341
DOI: 10.1002/cav.2266
الاتاحة: http://ir.ia.ac.cn/handle/173211/57341
https://doi.org/10.1002/cav.2266
رقم الانضمام: edsbas.E04B9895
قاعدة البيانات: BASE