Video-Text-to-Text
Transformers
Safetensors
English
moss_vl
image-feature-extraction
SFT
Video-Understanding
Image-Understanding
MOSS-VL
OpenMOSS
multimodal
video
vision-language
custom_code
Instructions to use OpenMOSS-Team/MOSS-VL-Instruct-0408 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMOSS-Team/MOSS-VL-Instruct-0408 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenMOSS-Team/MOSS-VL-Instruct-0408", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Xet hash:
- 4591586d7e4c77a493bab1231c0f25e7261b3ed0a65ee29cc3bd65c58d5250e6
- Size of remote file:
- 1.31 GB
- SHA256:
- 5e10b79d49b76df0be11e36b83e0628adfd6197cf247c9f8d48fad5651b7234c
·
Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.