How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="speakleash/Bielik-11B-v2.6-Instruct-bnb-4bit")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("speakleash/Bielik-11B-v2.6-Instruct-bnb-4bit")
model = AutoModelForCausalLM.from_pretrained("speakleash/Bielik-11B-v2.6-Instruct-bnb-4bit")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

Bielik-11B-v2.6-Instruct-bnb-4bit

This repo contains Bitsandbytes format model files for SpeakLeash's Bielik-11B-v2.6-Instruct.

DISCLAIMER: Be aware that quantised models show reduced response quality and possible hallucinations!

Model description:

Responsible for model quantization

  • Remigiusz KinasSpeakLeash - team leadership, conceptualizing, calibration data preparation, process creation and quantized model delivery.

Contact Us

If you have any questions or suggestions, please use the discussion tab. If you want to contact us directly, join our Discord SpeakLeash.

Downloads last month
2
Safetensors
Model size
12B params
Tensor type
F32
F16
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for speakleash/Bielik-11B-v2.6-Instruct-bnb-4bit

Quantized
(8)
this model