--- license: apache-2.0 tags: - echocardiography - medical - report-generation - multimodal - cardiology - ultrasound library_name: pytorch pipeline_tag: text-generation base_model: google/medgemma-1.5-4b-it --- # EchoGemma Multimodal echocardiography report generation from DICOM studies. EchoGemma combines an **EchoPrime video encoder** and a **LoRA-fine-tuned MedGemma language model** to process full echocardiographic studies and generate clinical text reports. ## Input A folder of DICOM echocardiography video files (a complete study). The model processes all video clips, extracts embeddings and view classifications, then generates a structured clinical report. ## Output A structured echocardiography text report. ## Requirements - Python >= 3.10 - PyTorch 2.10+ - CUDA-capable GPU (recommended) - ~18 GB disk space for model weights