Abstract: This paper proposes a multi-scale fusion attention-enhanced multi-task learning algorithm (MS-FA-MTL) framework for real-time 3D object detection in autonomous driving scenarios. The ...
Enabling robotic systems to perform long-horizon manipulation planning in real-world environments based on multimodal embodied perception and comprehension remains a longstanding challenge. Recent ...