Human motion prediction via spatio-temporal inpainting

التفاصيل البيبلوغرافية
العنوان:	Human motion prediction via spatio-temporal inpainting
المؤلفون:	Hernández Ruiz, Alejandro José, Gall, Juergen, Moreno-Noguer, Francesc
المساهمون:	Universitat Politècnica de Catalunya. Doctorat en Automàtica, Robòtica i Visió, Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI
بيانات النشر:	Institute of Electrical and Electronics Engineers (IEEE)
سنة النشر:	2019
المجموعة:	Universitat Politècnica de Catalunya, BarcelonaTech: UPCommons - Global access to UPC knowledge
مصطلحات موضوعية:	Àrees temàtiques de la UPC::Informàtica, Computer vision, Pattern recognition, Reconeixement de formes (Informàtica), Visió per ordinador
الوصف:	© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting /republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works ; We propose a Generative Adversarial Network (GAN) to forecast 3D human motion given a sequence of past 3D skeleton poses. While recent GANs have shown promising results, they can only forecast plausible motion over relatively short periods of time (few hundred milliseconds) and typically ignore the absolute position of the skeleton w.r.t. the camera. Our scheme provides long term predictions (two seconds or more) for both the body pose and its absolute position. Our approach builds upon three main contributions. First, we represent the data using a spatio-temporal tensor of 3D skeleton coordinates which allows formulating the prediction problem as an inpainting one, for which GANs work particularly well. Secondly, we design an architecture to learn the joint distribution of body poses and global motion, capable to hypothesize large chunks of the input 3D tensor with missing data. And finally, we argue that the L2 metric, considered so far by most approaches, fails to capture the actual distribution of long-term human motion. We propose two alternative metrics, based on the distribution of frequencies, that are able to capture more realistic motion patterns. Extensive experiments demonstrate our approach to significantly improve the state of the art, while also handling situations in which past observations are corrupted by occlusions, noise and missing frames. ; Peer Reviewed ; Postprint (author's final draft)
نوع الوثيقة:	conference object
وصف الملف:	10 p.; application/pdf
اللغة:	English
ردمك:	978-1-72814-803-8 1-72814-803-0
Relation:	https://ieeexplore.ieee.org/document/9008530; info:eu-repo/grantAgreement/MINECO/2PE/MDM-2016-0656; info:eu-repo/grantAgreement/MINECO/2PE/TIN2017-900-R; Hernandez, A.; Gall, J.; Moreno-Noguer, F. Human motion prediction via spatio-temporal inpainting. A: IEEE International Conference on Computer Vision. "2019 IEEE/CVF International Conference on Computer Vision (ICCV)". Institute of Electrical and Electronics Engineers (IEEE), 2019, p. 7133-7142.; http://hdl.handle.net/2117/187445
DOI:	10.1109/ICCV.2019.00723
الاتاحة:	http://hdl.handle.net/2117/187445 https://doi.org/10.1109/ICCV.2019.00723
Rights:	Attribution-NonCommercial-NoDerivs 3.0 Spain ; http://creativecommons.org/licenses/by-nc-nd/3.0/es/ ; Open Access
رقم الانضمام:	edsbas.27B6BA6A
قاعدة البيانات:	BASE

View record in BASE

الوصف
ردمك:	9781728148038 1728148030
DOI:	10.1109/ICCV.2019.00723