Abstract: In unknown environments lacking prior maps, achieving effective visual understanding is crucial for building highly efficient task - driven autonomous navigation systems. In this paper, we ...