Instructions to use apple/DFN5B-CLIP-ViT-H-14-378 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- OpenCLIP
How to use apple/DFN5B-CLIP-ViT-H-14-378 with OpenCLIP:
import open_clip model, preprocess_train, preprocess_val = open_clip.create_model_and_transforms('hf-hub:apple/DFN5B-CLIP-ViT-H-14-378') tokenizer = open_clip.get_tokenizer('hf-hub:apple/DFN5B-CLIP-ViT-H-14-378') - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -4,7 +4,7 @@ license_name: apple-sample-code-license
|
|
| 4 |
license_link: LICENSE
|
| 5 |
---
|
| 6 |
A CLIP (Contrastive Language-Image Pre-training) model trained on DFN-5B.
|
| 7 |
-
Data Filtering Networks (DFNs) are small used to automatically filter large pools of uncurated data.
|
| 8 |
This model was trained on 5B images that were filtered from a pool of 43B uncurated image-text pairs
|
| 9 |
(12.8B image-text pairs from CommonPool-12.8B + 30B additional public image-text pairs).
|
| 10 |
|
|
|
|
| 4 |
license_link: LICENSE
|
| 5 |
---
|
| 6 |
A CLIP (Contrastive Language-Image Pre-training) model trained on DFN-5B.
|
| 7 |
+
Data Filtering Networks (DFNs) are small networks used to automatically filter large pools of uncurated data.
|
| 8 |
This model was trained on 5B images that were filtered from a pool of 43B uncurated image-text pairs
|
| 9 |
(12.8B image-text pairs from CommonPool-12.8B + 30B additional public image-text pairs).
|
| 10 |
|