Free, CPU-optimized on-prem ticket tagging models for helpdesk systems.
Note CPU optimized INT 8 quantization
Note GPU optimized BF 16 quantization