The official code for the paper: "Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator". VIST3A is a framework for text-to-3D generation that combines a multi-view ...