VisualEars FastConformer FA 69M 256x40 warm-start

This repository is auto-updated from the live VisualEars 69M warm-start training run.

Current best checkpoint

  • Source checkpoint: /workspace/train69_runs/fa69m_256x40_warmstart_full9669_20260615T015714Z/fa69m_256x40_warmstart_bpe1024/2026-06-15_01-57-22/checkpoints/fa69m_256x40_warmstart_bpe1024--val_wer=0.1551-epoch=0.ckpt
  • Hub path: checkpoints/fa69m_256x40_warmstart_bpe1024--val_wer=0.1551-epoch=0.ckpt
  • RNNT validation WER from checkpoint name: 0.1551
  • Size: 837980369 bytes
  • SHA256: 5cca73aee0f42d9e577eb4c9421a3d8c4e09777a55d18264cbd7553ab4401354
  • Uploaded at UTC: 2026-06-15T16:21:08Z

Architecture

  • d_model: 256
  • n_heads: 4
  • n_layers: 40
  • pred_hidden: 640
  • joint_hidden: 640
  • Parameter count: 69.627906M

Warm-start

  • Warm-start checkpoint: /workspace/train32_runs/fa32m_streaming_bpe1024_full9669_20260614T130932Z/fa32m_streaming_bpe1024/2026-06-14_13-09-40/checkpoints/fa32m_streaming_bpe1024--val_wer=0.1720-epoch=0.ckpt
  • Matched keys: 619
  • Matched params: 31648226
  • Matched encoder layers: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15]
  • Shape mismatches: 0

New encoder layers are near-identity initialized; see launch_stats.json / best_checkpoint.json for details.

Final exported artifact

  • Final source: /workspace/train69_runs/fa69m_256x40_warmstart_full9669_20260615T015714Z/fa69m_256x40_warmstart_bpe1024_final.nemo
  • Hub path: final/fa69m_256x40_warmstart_bpe1024_final.nemo
  • Size: 279511040 bytes
  • SHA256: c6cbaf40d12f635024be0c748fe9b4613604bd4decc31289ff66c55adbda6ea0
Downloads last month
167
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support