---
library_name: ilex
tags:
- jax
- equinox
- ilex
- neuroimaging
- 3d-mri
license: mit
license_link: https://github.com/wangshansong1/Triad/blob/main/LICENSE
---

# Triad-Swin (3D-MRI self-supervised Swin-B backbone) -- Triad Swin-B encoder (SimMIM-pretrained)

## Description

Triad vision foundation model for 3D MRI, ported to JAX / Equinox from the upstream PyTorch release. Triad is an nnUNet PlainConvEncoder pretrained self-supervised on Triad-131K (131,170 3D MRI volumes spanning brain, breast, and prostate; T1/T2/FLAIR/DWI/fMRI/DCE) and serves as a transfer-learning backbone for downstream MRI segmentation, classification, and registration. The published checkpoints are encoder-only (the self-supervised decoder / mask token are stripped); this port exposes the pretrained encoder, whose multi-scale features are the transfer representation. Two backbone families are ported: the nnUNet PlainConvUNet encoder (TriadPlainConvUNet) and the 3D Swin Transformer encoder (TriadSwinViT, the Swin-B variant, via the shared nimox SwinViT primitive). Each is released under two self-supervised objectives -- masked autoencoding (MAE) and SimMIM -- as separate bundles (four in total).

## Intended use

As the MAE variant, but pretrained with the SimMIM masked-image-modelling objective. Same Swin-B encoder architecture and Triad-131K corpus; provided so downstream users can pick whichever SSL objective transfers better for their task. Encoder-only.

## Usage

```python
from ilex.models.triad import TriadSwinViT
model = TriadSwinViT.from_pretrained('ilex-hub/triad.swinb-simmim.1')
```

## Authors

Wang S., et al.

## Citation

Wang S., Safari M., Li Q., Chang C.-W., Qiu R. L. J., Roper J., Yu D. S., Yang X. (2025). Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging. arXiv:2502.14064. The Swin backbone is MONAI's SwinUNETR swinViT (use_v2): Hatamizadeh A., Nath V., Tang Y., Yang D., Roth H., Xu D. (2022). Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images. BrainLes 2021. arXiv:2201.01266. Tang Y., et al. (2022). Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis. CVPR. arXiv:2111.14791.

### References

- Wang S., et al. (2025). Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging. arXiv:2502.14064. https://arxiv.org/abs/2502.14064
- Codebase: https://github.com/wangshansong1/Triad

## License

HF Hub license tag: `mit`

**Effective terms:** MIT (Shansong Wang et al.) on both the Triad code (https://github.com/wangshansong1/Triad) and the released pretrained checkpoints. No commercial restrictions; no gating required. The arXiv preprint (2502.14064) is separately distributed under CC BY 4.0, but the code and weights the ilex bundle re-expresses are MIT. The ilex JAX / Equinox port code is separately licensed under Apache-2.0 / GPL-3.0; that does not alter the upstream MIT terms governing the weights.

Upstream license reference: https://github.com/wangshansong1/Triad/blob/main/LICENSE

### Copyright

Network architecture and pretrained weights: copyright (c) the Triad authors, released under the MIT License. JAX / Equinox port: copyright (c) the ilex authors, released under the Apache-2.0 / GPL-3.0 dual license used by ilex itself.

## Upstream source

Original weights / reference implementation: https://github.com/wangshansong1/Triad

## Provenance

This artefact was produced by [ilex](https://github.com/hypercoil/ilex)'s
save/load pipeline. The architecture is implemented in
`ilex.models.triad.TriadSwinViT` and the weights have been converted
from their upstream format. See the upstream source above
for the canonical reference.