File size: 993 Bytes
573fb4a
 
5864e01
573fb4a
 
 
 
 
d374f50
 
 
 
 
 
573fb4a
313d4c1
 
 
 
7b64176
 
 
 
 
d374f50
573fb4a
d374f50
 
573fb4a
2970acd
d374f50
 
 
573fb4a
d374f50
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
---
library_name: transformers
license: cc-by-nc-sa-4.0
tags: []
---

## Model Details

- Model: ReasonCLIP-B32-S1
- Base model: [openai/clip-vit-base-patch32](https://huggingface.co/openai/clip-vit-base-patch32)
- Architecture: CLIP ViT-B/32
- Image resolution: 224
- Training stage: Stage 1
- Training data: [ReasonLite-42M](https://huggingface.co/datasets/RISys-Lab/ReasonCLIPLite-42M)

## Method

![Method overview](https://raw.githubusercontent.com/RISys-Lab/ReasonCLIP/main/doc/method.png)

## Resources

- GitHub: [RISys-Lab/ReasonCLIP](https://github.com/RISys-Lab/ReasonCLIP)
- Paper: [arXiv:2606.26794](https://arxiv.org/abs/2606.26794)

## Usage

```python
from transformers import CLIPModel, CLIPProcessor

model_id = "RISys-Lab/ReasonCLIP-B32-S1"
model = CLIPModel.from_pretrained(model_id)
processor = CLIPProcessor.from_pretrained(model_id)
```

For the full checkpoint list, see the [ReasonCLIP model card](https://github.com/RISys-Lab/ReasonCLIP/blob/main/doc/model_card.md).