How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="speakleash/Bielik-11B-v2.6-Instruct-AWQ")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("speakleash/Bielik-11B-v2.6-Instruct-AWQ")
model = AutoModelForCausalLM.from_pretrained("speakleash/Bielik-11B-v2.6-Instruct-AWQ")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

Bielik-11B-v2.6-Instruct-AWQ

This repo contains AWQ format model files for SpeakLeash's Bielik-11B-v2.6-Instruct.

DISCLAIMER: Be aware that quantised models show reduced response quality and possible hallucinations!

Model description:

Responsible for model quantization

  • Remigiusz KinasSpeakLeash - team leadership, conceptualizing, calibration data preparation, process creation and quantized model delivery.

Contact Us

If you have any questions or suggestions, please use the discussion tab. If you want to contact us directly, join our Discord SpeakLeash.

Downloads last month
108
Safetensors
Model size
11B params
Tensor type
I32
·
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for speakleash/Bielik-11B-v2.6-Instruct-AWQ

Quantized
(8)
this model

Collection including speakleash/Bielik-11B-v2.6-Instruct-AWQ