adamo1139's picture
Update README.md
78269b2 verified
|
raw
history blame
231 Bytes
metadata
base_model:
  - deepseek-ai/DeepSeek-V2.5-1210

AWQ quantization of DeepSeek-V2.5-1210

To run on 8xH100 80GB, you can use vLLM with:

vllm serve adamo1139/DeepSeek-V2.5-1210-AWQ --tensor-parallel 8 --trust-remote-code