Text Generation
Transformers
Safetensors
German
llama
text-generation-inference
How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="LSX-UniWue/LLaMmlein_1B_prerelease")
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_1B_prerelease")
model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_1B_prerelease")
Quick Links
A newer version of this model is available: LSX-UniWue/LLaMmlein_1B

LLäMmlein 1B

This is a German Tinyllama 1B language model trained from scratch using the Tinyllama codebase on the German portion of RedPajama V2. Find more details on our page and our preprint!

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_1B")

tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_1B")

Evaluation

We evaluated our results on the SuperGLEBer benchmark. Data Take Down

Downloads last month
340,157
Safetensors
Model size
1B params
Tensor type
F32
·
Inference Providers NEW

Model tree for LSX-UniWue/LLaMmlein_1B_prerelease

Adapters
3 models

Datasets used to train LSX-UniWue/LLaMmlein_1B_prerelease

Space using LSX-UniWue/LLaMmlein_1B_prerelease 1

Collection including LSX-UniWue/LLaMmlein_1B_prerelease

Paper for LSX-UniWue/LLaMmlein_1B_prerelease