Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

nvidia
/
Qwen3.5-397B-A17B-NVFP4

Text Generation
Safetensors
Model Optimizer
qwen3_5_moe
nvidia
ModelOpt
Qwen3.5
quantized
FP4
fp4
conversational
modelopt
Model card Files Files and versions
xet
Community
11
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

TemporalMesh Transformer: 29.4 PPL at 48% compute — beats Mamba, new open-source architecture

#11 opened 11 days ago by
vigneshwar234

Is MTP supported by this model?

#10 opened 2 months ago by
Bronyaaa

Please, we need NVFP4 versions that are official and tested against original Qwen3.5 model(s)

#8 opened 3 months ago by
GabrielaCats

## Real-World Performance on 4x RTX PRO 6000 (SM120) -- Honest Numbers

❤️ 5
3
#7 opened 3 months ago by
brandonmusic

27B NVFP4 PLEASE

👀 1
2
#6 opened 4 months ago by
berkerdooo

NVFP4 for Qwen3.5-27B

➕ 1
9
#5 opened 4 months ago by
faheemraza1

Please release a checkpoint for Qwen/Qwen3.5-122B-A10B

➕👍 4
3
#4 opened 4 months ago by
Qnibbles

How to run on SM120

❤️ 3
#3 opened 4 months ago by
catid

Support SM120

❤️👍 19
6
#2 opened 4 months ago by
darkstar3537

Getting nvidia/Qwen3.5-397B-A17B-NVFP4 running with SGLang (requires transformers v5) on RTX PRO 6000 (blackwell) CUDA 12.9

🔥 1
6
#1 opened 4 months ago by
bullpoint
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs