[ICLR 2026] VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator
The official code for the paper: "Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator". VIST3A is a framework for text-to-3D generation that combines a multi-view ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results