view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb • May 21, 2025 • 258
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5, 2025 • 56
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 ariG23498, merve, pcuenq, reach-vb • Mar 12, 2025 • 497
view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Mar 12 • 89
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Mar 12 • 66
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated Mar 12 • 153
Turkish Vision-Language Datasets Collection Collection of Turkish vision-language datasets. • 29 items • Updated Mar 2 • 11
view article Article Assisted Generation: a new direction toward low-latency text generation joaogante • May 11, 2023 • 79
view article Article Llama can now see and run on your device - welcome Llama 3.2 +5 merve, philschmid, osanseviero, reach-vb, lewtun, ariG23498, pcuenq • Sep 25, 2024 • 191
view article Article Fine-Tune ViT for Image Classification with 🤗 Transformers nateraw • Feb 11, 2022 • 61
view article Article CodeGemma - an official Google release for code LLMs +4 pcuenq, osanseviero, reach-vb, philschmid, mishig, loubnabnl • Apr 9, 2024 • 107