Video-Text-to-Text
Transformers
Safetensors
English
videochat_flash_qwen
feature-extraction
multimodal
custom_code
Eval Results (legacy)
Instructions to use OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -95,7 +95,7 @@ VideoChat-Flash-2B is constructed upon UMT-L (300M) and Qwen2_5-2B, employing on
|
|
| 95 |
|
| 96 |
First, you need to install [flash attention2](https://github.com/Dao-AILab/flash-attention) and some other modules. We provide a simple installation example below:
|
| 97 |
```
|
| 98 |
-
pip install transformers==4.
|
| 99 |
pip install timm
|
| 100 |
pip install av
|
| 101 |
pip install imageio
|
|
|
|
| 95 |
|
| 96 |
First, you need to install [flash attention2](https://github.com/Dao-AILab/flash-attention) and some other modules. We provide a simple installation example below:
|
| 97 |
```
|
| 98 |
+
pip install transformers==4.40.1
|
| 99 |
pip install timm
|
| 100 |
pip install av
|
| 101 |
pip install imageio
|