octen-embedding-4b-w4a16 / generation_config.json
groxaxo's picture
Add W4A16 quantized Octen-Embedding-4B (auto-round-auto-gptq)
5864f58 verified
Raw
History Blame Contribute Delete
162 Bytes
{
"_from_model_config": true,
"bos_token_id": 151643,
"do_sample": true,
"eos_token_id": 151645,
"transformers_version": "5.6.2",
"use_cache": true
}