SONAR SAEs β€” wandb logs

Raw wandb run directories (config, summary, system metrics, scalar training curves) matching the SAE checkpoints in:

Run IDs in directory names match the checkpoint run IDs in those repos.

Companion paper

Interpretability of Text Auto-Encoders using Sparse Auto-Encoders: A Sandbox for Interpreting Neuralese. Nicky Pochinkov & Jason Rich Darmawan, EACL 2026 (submitted).

Use

The original wandb runs were logged to the seperability wandb team. This repo mirrors the on-disk run directories for archival reproducibility; for live dashboards, see the wandb project directly.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Collection including nickypro/sonar-saes-wandb-logs