SONAR SAEs
Collection
Sparse Auto-Encoders for SONAR sentence embeddings, from Pochinkov & Darmawan (2025) (EACL submission). β’ 5 items β’ Updated
Raw wandb run directories (config, summary, system metrics, scalar training curves) matching the SAE checkpoints in:
nickypro/sonar-saes-large β scaled-up BatchTopKnickypro/sonar-saes-comparison β four-variant comparisonRun IDs in directory names match the checkpoint run IDs in those repos.
Interpretability of Text Auto-Encoders using Sparse Auto-Encoders: A Sandbox for Interpreting Neuralese. Nicky Pochinkov & Jason Rich Darmawan, EACL 2026 (submitted).
The original wandb runs were logged to the
seperability wandb team. This repo
mirrors the on-disk run directories for archival reproducibility; for
live dashboards, see the wandb project directly.