TemporalMesh Transformer: 29.4 PPL at 48% compute — beats Mamba, new open-source architecture
#11 opened 11 days ago
by
vigneshwar234
Is MTP supported by this model?
#10 opened 2 months ago
by
Bronyaaa
Please, we need NVFP4 versions that are official and tested against original Qwen3.5 model(s)
#8 opened 3 months ago
by
GabrielaCats
## Real-World Performance on 4x RTX PRO 6000 (SM120) -- Honest Numbers
❤️ 5
3
#7 opened 3 months ago
by
brandonmusic
27B NVFP4 PLEASE
👀 1
2
#6 opened 4 months ago
by
berkerdooo
NVFP4 for Qwen3.5-27B
➕ 1
9
#5 opened 4 months ago
by
faheemraza1
Please release a checkpoint for Qwen/Qwen3.5-122B-A10B
➕👍 4
3
#4 opened 4 months ago
by
Qnibbles
How to run on SM120
❤️ 3
#3 opened 4 months ago
by
catid
Support SM120
❤️👍 19
6
#2 opened 4 months ago
by
darkstar3537