Abstract: Large Vision-Language Models (VLMs), such as GPT-4, have achieved remarkable success across various fields. However, there are few studies on 3D indoor scene generation with VLMs. This paper ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results