CSS Align - Search News

Aligning Vision-Language Model with Fine-grained Semantics for Open-Vocabulary Segmentation

Abstract: Pre-trained Vision-Language Models (VLMs) are often used to tackle the challenging task of Open-vocabulary Segmentation (OVS). To preserve the valuable pre-trained knowledge of VLM-based ...

IEEE

Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing

Abstract: Due to control limitations in the denoising process and the lack of training, zero-shot video editing methods often struggle to meet user instructions, resulting in generated videos that are ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Aligning Vision-Language Model with Fine-grained Semantics for Open-Vocabulary Segmentation

Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing

Trending now