Improve model card: update pipeline tag, add paper/code links, and expand description

by nielsr HF Staff - opened Nov 19, 2025

←

This PR significantly enhances the model card for this Sa2VA-i model by:

Updating the pipeline_tag to image-segmentation. This change more accurately reflects the model's core functionality of language-guided dense grounding and referring video object segmentation, ensuring better discoverability on the Hugging Face Hub.
Adding explicit links to the Hugging Face paper page (Sa2VA-i: Improving Sa2VA Results with Consistent Training and Inference) and the official GitHub repository (https://github.com/kumuji/sa2va-i).
Expanding the model description with a detailed overview, performance highlights, competition results, model zoo, quick start guidance, and key improvements, all sourced directly from the project's GitHub README. This provides comprehensive information to users.
Including authors and a teaser image for better presentation.
Consolidating citations to include both the Sa2VA-i paper and the original Sa2VA paper.
Removing irrelevant "File information" from the public model card.

These updates aim to provide a more complete, accurate, and user-friendly model card.

Owner Nov 20, 2025

Awesome! Thank you so much!

kumuji changed pull request status to merged Nov 20, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment