LMMS Tutorial - Search News

Enabling the finetuning of the latest Large Multimodal Models

More and more large multimodal models (LMMs) are being released from time to time, but the finetuning of these models is not always straightforward. This codebase aims to provide a unified, minimal ...

GitHub

3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer

3D-LLaVA (CVPR 2025) is 3D Large Multimodal Model that takes point clouds and text instruction as input to perform VQA, Dense Captioning and 3D Referring Segmentation. At the core of 3D-LLaVA is a new ...

IEEE

LLLaVA-Critic: Learning to Evaluate Multimodal Models

Abstract: We introduce LLaVA-Critic, the first open-source large multimodal model (LMM) designed as a generalist evaluator to assess performance across a wide range of multi-modal tasks. LLaVA-Critic ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Enabling the finetuning of the latest Large Multimodal Models

3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer

LLLaVA-Critic: Learning to Evaluate Multimodal Models

Trending now