⚠️ WORK IN PROGRESS

Quantization Format: NVFP4 (compressed-tensors layout)
Status: Testing inference engine runtime and tensor mapping alignment.

This repository is currently a work in progress and is undergoing active calibration/testing for deployment on Blackwell hardware architectures.