Abstract: Enabling robotic systems to perform long-horizon manipulation planning in real-world environments based on multimodal embodied perception and comprehension remains a longstanding challenge.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results