---
license: apache-2.0
tags:
- echocardiography
- medical
- report-generation
- multimodal
- cardiology
- ultrasound
library_name: pytorch
pipeline_tag: text-generation
base_model: google/medgemma-1.5-4b-it
---

# EchoGemma

Multimodal echocardiography report generation from DICOM studies. EchoGemma combines an **EchoPrime video encoder** and a **LoRA-fine-tuned MedGemma language model** to process full echocardiographic studies and generate clinical text reports.


## Input

A folder of DICOM echocardiography video files (a complete study). The model processes all video clips, extracts embeddings and view classifications, then generates a structured clinical report.

## Output

A structured echocardiography text report.

## Requirements

- Python >= 3.10
- PyTorch 2.10+
- CUDA-capable GPU (recommended)
- ~18 GB disk space for model weights