A newer version of the Gradio SDK is available: 6.19.0
metadata
title: Street Descriptions using MuniVis
emoji: π£οΈ
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 5.26.0
app_file: app.py
pinned: false
π£οΈ Street Descriptions using MuniVis Model
This Space provides a simple Gradio interface for generating image captions using a fine-tuned BLIP model hosted at saja003/MuniVis. It also includes the option to translate the generated English captions into Arabic using a MarianMT translation model.
π Features
- π§ Image captioning with a fine-tuned BLIP model.
- π Language selection: English or Arabic.
- πΈ Upload any street or urban image to generate a descriptive caption.
- π€ Translation powered by Helsinki-NLP/opus-mt-en-ar.
π How it works
- Upload an image.
- Select a language (English or Arabic).
- Receive a descriptive caption of the scene.
- If Arabic is selected, the caption is translated automatically.
π§ Models Used
- BLIP (Bootstrapping Language Image Pretraining):
- Fine-tuned on street and infrastructure images.
- Model repo:
saja003/MuniVis
- MarianMT Translation Model:
Helsinki-NLP/opus-mt-en-arfor English to Arabic translation.
π οΈ Requirements
This Space uses the following libraries:
transformers
torch
gradio
Pillow