Text Generation
Transformers
Safetensors
German
llama
text-generation-inference
How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="LSX-UniWue/LLaMmlein_120M_prerelease")
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_120M_prerelease")
model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_120M_prerelease")
Quick Links
A newer version of this model is available: LSX-UniWue/LLaMmlein_120M

LLäMmlein 120M

This is a German Tinyllama 120M language model trained from scratch using the Tinyllama codebase on the German portion of RedPajama V2. Find more details on our page and our preprint

Next to the final model, we publish intermediate training checkpoints for our base models as separate branches of the model repository. These can be accessed via the drop-down menu labeled "main" in the top left corner of the "Files and versions" section.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_120M")

tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_120M")

Performance

We evaluated our model on the SuperGLEBer benchmark. Data Take Down

Downloads last month
12
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Datasets used to train LSX-UniWue/LLaMmlein_120M_prerelease

Collection including LSX-UniWue/LLaMmlein_120M_prerelease

Paper for LSX-UniWue/LLaMmlein_120M_prerelease