Video-Text-to-Text
Transformers
Safetensors
English
videochat_flash_qwen
feature-extraction
multimodal
custom_code
Eval Results (legacy)
Instructions to use OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -34,7 +34,7 @@ model-index:
|
|
| 34 |
- task:
|
| 35 |
type: multimodal
|
| 36 |
dataset:
|
| 37 |
-
name:
|
| 38 |
type: percepTest
|
| 39 |
metrics:
|
| 40 |
- type: accuracy
|
|
|
|
| 34 |
- task:
|
| 35 |
type: multimodal
|
| 36 |
dataset:
|
| 37 |
+
name: Perception Test
|
| 38 |
type: percepTest
|
| 39 |
metrics:
|
| 40 |
- type: accuracy
|