MedGemma 1.5: What this open medical AI can — and cannot — do - Technologie Zeitgeist

Last updated: 14. January 2026

Berlin, 14. January 2026

Insights

MedGemma 1.5 is an open medical AI model that adds support for 3D CT/MRI volumes and whole‑slide histopathology while staying available as a compact 4B multimodal option. It is useful for prototyping image‑plus‑text tasks, but the developer docs stress it is not cleared for direct clinical decisions without local validation.

Key Facts

MedGemma 1.5 supports multimodal inputs including 3D CT/MRI slices and whole‑slide histopathology in a developer release.
The compact 4B multimodal variant is designed to run on modern GPUs, while 27B variants need much larger memory and sharding.
Official benchmarks report improvements (e.g., CT macro accuracy ~61.1%), but many evaluations mix public and internal datasets and require independent validation.

Introduction

Who: Google published MedGemma 1.5 as part of its Health AI developer resources. What: the release brings multimodal capabilities for medical images plus text, notably handling 3D volumes. Why it matters now: open weights and example notebooks let developers experiment locally, but practical use needs careful preprocessing and safety checks.

What is new

MedGemma 1.5 extends the Gemma family with a 4B multimodal model that accepts text and images and adds explicit support for multi‑slice CT and MRI volumes plus whole‑slide images. The release includes a medical image encoder called MedSigLIP, example notebooks for CT and WSI preprocessing, and model weights packaged for Hugging Face and Google’s Model Garden. Reported numbers in the model card show improved performance over prior MedGemma versions on several medical tasks — for example a CT macro accuracy around 61.1% on an internal benchmark — but the documentation also notes many tests use a mix of public and licensed internal datasets.

What it means

For developers and researchers, MedGemma 1.5 lowers the barrier to building multimodal medical prototypes: you can feed images and clinical text into a single model and get free‑text outputs, summaries or visual question answers. The compact 4B model is intentionally compute‑efficient and can run on a modern GPU with roughly 10–16 GB VRAM in practical setups; larger 27B variants offer stronger text reasoning but need sharded inference. Important caveats: the model card explicitly warns it is not a clinical device. That means outputs may be helpful for drafting reports or triage support, but should not replace clinician judgement or validated diagnostic software without local trials and regulatory review.

What comes next

Teams that want to use MedGemma 1.5 should first reproduce official examples on public datasets, then run independent validation on their own clinical data to catch distribution shifts or hidden biases. Engineering work will focus on DICOM ingestion, tiling/normalizing slices into the model’s image tokens, and handling long contexts or KV cache limits — full‑volume CTs often require chunking or offloading. Regulators and hospitals will likely insist on documented validation and safety testing before any clinical deployment; for research and internal tools, expect short‑term prototypes and longer validation phases for production use.

Update: 13:10 – model card now links to CT and WSI preprocessing notebooks for easier prototyping.

Conclusion

MedGemma 1.5 brings practical multimodal capabilities to an open medical AI model family and makes 3D image + text experimentation easier for developers. It is a strong prototype and research tool, but its outputs require careful validation before any clinical use.

Join the conversation: share your experience with MedGemma 1.5 or ask questions to compare notes and best practices.

MedGemma 1.5: What this open medical AI can — and cannot — do

Insights

Key Facts

Introduction

What is new

What it means

What comes next

Conclusion

One response to “MedGemma 1.5: What this open medical AI can — and cannot — do”

Leave a Reply Cancel reply

In this article

Newsletter

MedGemma 1.5: What this open medical AI can — and cannot — do

Insights

Key Facts

Introduction

What is new

What it means

What comes next

Conclusion

One response to “MedGemma 1.5: What this open medical AI can — and cannot — do”

Leave a Reply Cancel reply

In this article

Newsletter

More articles

Screen record on iPhone, Android, Windows 11 & Mac: step-by-step

Set Up a Password Manager: Move Logins from Chrome/Safari to One Vault

Windows 11 OneDrive Backup: Sync Desktop, Documents & Pictures

Once a week, the most important tech and business takeaways.