Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos
We address in-the-wild hand-object reconstruction for a known object category in egocentric videos, focusing on temporal periods of stable grasps. We propose the task of Hand-Object Stable Grasp Reconstruction (HO-SGR), the joint reconstruction of frames during which the hand is stably holding the o...
Main Authors: | , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2023
|
Subjects: | |
Online Access: | http://arxiv.org/abs/2312.15719 |
id |
ftarxivpreprints:oai:arXiv.org:2312.15719 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:2312.15719 2024-01-28T10:03:57+01:00 Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos Zhu, Zhifan Damen, Dima 2023-12-25 http://arxiv.org/abs/2312.15719 unknown http://arxiv.org/abs/2312.15719 Computer Science - Computer Vision and Pattern Recognition text 2023 ftarxivpreprints 2023-12-31T02:11:58Z We address in-the-wild hand-object reconstruction for a known object category in egocentric videos, focusing on temporal periods of stable grasps. We propose the task of Hand-Object Stable Grasp Reconstruction (HO-SGR), the joint reconstruction of frames during which the hand is stably holding the object. We thus can constrain the object motion relative to the hand, effectively regularising the reconstruction and improving performance. By analysing the 3D ARCTIC dataset, we identify temporal periods where the contact area between the hand and object vertices remain stable. We showcase that objects within stable grasps move within a single degree of freedom (1~DoF). We thus propose a method for jointly optimising all frames within a stable grasp by minimising the object's rotation to that within a latent 1 DoF. We then extend this knowledge to in-the-wild egocentric videos by labelling 2.4K clips of stable grasps from the EPIC-KITCHENS dataset. Our proposed EPIC-Grasps dataset includes 390 object instances of 9 categories, featuring stable grasps from videos of daily interactions in 141 environments. Our method achieves significantly better HO-SGR, both qualitatively and by computing the stable grasp area and 2D projection labels of mask overlaps. Comment: webpage: https://zhifanzhu.github.io/getagrip Text Arctic ArXiv.org (Cornell University Library) Arctic |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Computer Science - Computer Vision and Pattern Recognition |
spellingShingle |
Computer Science - Computer Vision and Pattern Recognition Zhu, Zhifan Damen, Dima Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos |
topic_facet |
Computer Science - Computer Vision and Pattern Recognition |
description |
We address in-the-wild hand-object reconstruction for a known object category in egocentric videos, focusing on temporal periods of stable grasps. We propose the task of Hand-Object Stable Grasp Reconstruction (HO-SGR), the joint reconstruction of frames during which the hand is stably holding the object. We thus can constrain the object motion relative to the hand, effectively regularising the reconstruction and improving performance. By analysing the 3D ARCTIC dataset, we identify temporal periods where the contact area between the hand and object vertices remain stable. We showcase that objects within stable grasps move within a single degree of freedom (1~DoF). We thus propose a method for jointly optimising all frames within a stable grasp by minimising the object's rotation to that within a latent 1 DoF. We then extend this knowledge to in-the-wild egocentric videos by labelling 2.4K clips of stable grasps from the EPIC-KITCHENS dataset. Our proposed EPIC-Grasps dataset includes 390 object instances of 9 categories, featuring stable grasps from videos of daily interactions in 141 environments. Our method achieves significantly better HO-SGR, both qualitatively and by computing the stable grasp area and 2D projection labels of mask overlaps. Comment: webpage: https://zhifanzhu.github.io/getagrip |
format |
Text |
author |
Zhu, Zhifan Damen, Dima |
author_facet |
Zhu, Zhifan Damen, Dima |
author_sort |
Zhu, Zhifan |
title |
Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos |
title_short |
Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos |
title_full |
Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos |
title_fullStr |
Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos |
title_full_unstemmed |
Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos |
title_sort |
get a grip: reconstructing hand-object stable grasps in egocentric videos |
publishDate |
2023 |
url |
http://arxiv.org/abs/2312.15719 |
geographic |
Arctic |
geographic_facet |
Arctic |
genre |
Arctic |
genre_facet |
Arctic |
op_relation |
http://arxiv.org/abs/2312.15719 |
_version_ |
1789329533226188800 |