Abstract: Accurate alignment of virtual objects in Augmented Reality (AR) is essential for precision-critical applications such as surgery, infrastructure inspection, and digital twin systems. However ...
We introduce TASTE-Rob: 1) a dataset with 100,856 task-oriented hand-object interaction videos, 2) a three-stage pose-refinement video generation pipeline. With the above contributions, TASTE-Rob is ...