Instructions to use Y-J-Ju/SaHa-Qwen2-VL-2B-Instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Y-J-Ju/SaHa-Qwen2-VL-2B-Instruct with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForMultimodalLM processor = AutoProcessor.from_pretrained("Y-J-Ju/SaHa-Qwen2-VL-2B-Instruct") model = AutoModelForMultimodalLM.from_pretrained("Y-J-Ju/SaHa-Qwen2-VL-2B-Instruct") - Notebooks
- Google Colab
- Kaggle
update readme
Browse files
README.md
CHANGED
|
@@ -142,7 +142,7 @@ The model's performance was evaluated on the MMEB evaluation set, which includes
|
|
| 142 |
The SaHa-Qwen2-VL-2B-Instruct model achieves state-of-the-art performance in its parameter class on the MMEB benchmark, outperforming methods that rely on large-scale contrastive pre-training.
|
| 143 |
|
| 144 |
| Model | Params | Classification | Retrieval | VQA | Grounding | **IND** | **OND** | **Overall Avg.** |
|
| 145 |
-
| :--- | :---: | :---: | :---: | :---: | :---: | :---: |
|
| 146 |
| Ours (SaHa-Qwen2-VL-2B) | 2.2B | 65.4 | 70.0 | 59.1 | 83.0 | **71.2** | **62.1** | **67.1** |
|
| 147 |
|
| 148 |
|
|
|
|
| 142 |
The SaHa-Qwen2-VL-2B-Instruct model achieves state-of-the-art performance in its parameter class on the MMEB benchmark, outperforming methods that rely on large-scale contrastive pre-training.
|
| 143 |
|
| 144 |
| Model | Params | Classification | Retrieval | VQA | Grounding | **IND** | **OND** | **Overall Avg.** |
|
| 145 |
+
| :--- | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|
| 146 |
| Ours (SaHa-Qwen2-VL-2B) | 2.2B | 65.4 | 70.0 | 59.1 | 83.0 | **71.2** | **62.1** | **67.1** |
|
| 147 |
|
| 148 |
|