mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-mxfp4

This model was converted to MLX format from nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 using mlx-vlm version 0.4.5. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm

python -m mlx_vlm.generate --model mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-mxfp4 --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>

Downloads last month: 353

Safetensors

Model size

7B params

Tensor type

U32

BF16

MLX

Hardware compatibility

4-bit

Model tree for mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-mxfp4

Base model

nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Quantized

(51)

this model

Dataset used to train mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-mxfp4

Collection including mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-mxfp4

Nvidia Nemotron-3-Nano-Omni

Collection

8 items • Updated Apr 28 • 2