Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos

We address in-the-wild hand-object reconstruction for a known object category in egocentric videos, focusing on temporal periods of stable grasps. We propose the task of Hand-Object Stable Grasp Reconstruction (HO-SGR), the joint reconstruction of frames during which the hand is stably holding the o...

Full description

Bibliographic Details
Main Authors: Zhu, Zhifan, Damen, Dima
Format: Text
Language:unknown
Published: 2023
Subjects:
Online Access:http://arxiv.org/abs/2312.15719
id ftarxivpreprints:oai:arXiv.org:2312.15719
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:2312.15719 2024-01-28T10:03:57+01:00 Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos Zhu, Zhifan Damen, Dima 2023-12-25 http://arxiv.org/abs/2312.15719 unknown http://arxiv.org/abs/2312.15719 Computer Science - Computer Vision and Pattern Recognition text 2023 ftarxivpreprints 2023-12-31T02:11:58Z We address in-the-wild hand-object reconstruction for a known object category in egocentric videos, focusing on temporal periods of stable grasps. We propose the task of Hand-Object Stable Grasp Reconstruction (HO-SGR), the joint reconstruction of frames during which the hand is stably holding the object. We thus can constrain the object motion relative to the hand, effectively regularising the reconstruction and improving performance. By analysing the 3D ARCTIC dataset, we identify temporal periods where the contact area between the hand and object vertices remain stable. We showcase that objects within stable grasps move within a single degree of freedom (1~DoF). We thus propose a method for jointly optimising all frames within a stable grasp by minimising the object's rotation to that within a latent 1 DoF. We then extend this knowledge to in-the-wild egocentric videos by labelling 2.4K clips of stable grasps from the EPIC-KITCHENS dataset. Our proposed EPIC-Grasps dataset includes 390 object instances of 9 categories, featuring stable grasps from videos of daily interactions in 141 environments. Our method achieves significantly better HO-SGR, both qualitatively and by computing the stable grasp area and 2D projection labels of mask overlaps. Comment: webpage: https://zhifanzhu.github.io/getagrip Text Arctic ArXiv.org (Cornell University Library) Arctic
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Computer Vision and Pattern Recognition
spellingShingle Computer Science - Computer Vision and Pattern Recognition
Zhu, Zhifan
Damen, Dima
Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos
topic_facet Computer Science - Computer Vision and Pattern Recognition
description We address in-the-wild hand-object reconstruction for a known object category in egocentric videos, focusing on temporal periods of stable grasps. We propose the task of Hand-Object Stable Grasp Reconstruction (HO-SGR), the joint reconstruction of frames during which the hand is stably holding the object. We thus can constrain the object motion relative to the hand, effectively regularising the reconstruction and improving performance. By analysing the 3D ARCTIC dataset, we identify temporal periods where the contact area between the hand and object vertices remain stable. We showcase that objects within stable grasps move within a single degree of freedom (1~DoF). We thus propose a method for jointly optimising all frames within a stable grasp by minimising the object's rotation to that within a latent 1 DoF. We then extend this knowledge to in-the-wild egocentric videos by labelling 2.4K clips of stable grasps from the EPIC-KITCHENS dataset. Our proposed EPIC-Grasps dataset includes 390 object instances of 9 categories, featuring stable grasps from videos of daily interactions in 141 environments. Our method achieves significantly better HO-SGR, both qualitatively and by computing the stable grasp area and 2D projection labels of mask overlaps. Comment: webpage: https://zhifanzhu.github.io/getagrip
format Text
author Zhu, Zhifan
Damen, Dima
author_facet Zhu, Zhifan
Damen, Dima
author_sort Zhu, Zhifan
title Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos
title_short Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos
title_full Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos
title_fullStr Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos
title_full_unstemmed Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos
title_sort get a grip: reconstructing hand-object stable grasps in egocentric videos
publishDate 2023
url http://arxiv.org/abs/2312.15719
geographic Arctic
geographic_facet Arctic
genre Arctic
genre_facet Arctic
op_relation http://arxiv.org/abs/2312.15719
_version_ 1789329533226188800